An open API service providing repository metadata for many open source software ecosystems.

Topic: "action-recognition"

dmlc/gluon-cv

Gluon CV Toolkit

Language: Python - Size: 37.8 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 5,888 - Forks: 1,206

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language: Python - Size: 68.2 MB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 4,601 - Forks: 1,286

kenshohara/3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language: Python - Size: 328 KB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 3,982 - Forks: 933

jinwchoi/awesome-action-recognition

A curated list of action recognition and related area resources

Size: 270 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 3,893 - Forks: 727

open-mmlab/mmskeleton

A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.

Language: Python - Size: 91.2 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 3,000 - Forks: 1,045

extreme-assistant/ICCV2023-Paper-Code-Interpretation

ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理

Size: 697 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 2,300 - Forks: 1,404

axinc-ai/ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Language: Python - Size: 1.16 GB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2,199 - Forks: 344

OpenGVLab/InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language: Python - Size: 53.2 MB - Last synced at: 1 day ago - Pushed at: 13 days ago - Stars: 1,899 - Forks: 112

open-mmlab/mmaction

An open-source toolbox for action understanding based on PyTorch

Language: Python - Size: 3.95 MB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 1,871 - Forks: 351

PaddlePaddle/PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Language: Python - Size: 106 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 1,616 - Forks: 384

yjxiong/temporal-segment-networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Language: Python - Size: 2.01 MB - Last synced at: 4 days ago - Pushed at: over 4 years ago - Stars: 1,562 - Forks: 476

MCG-NJU/VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Python - Size: 547 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

open-edge-platform/training_extensions

Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™

Language: Python - Size: 417 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1,189 - Forks: 451

kenshohara/video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

Language: Python - Size: 154 KB - Last synced at: 15 days ago - Pushed at: over 6 years ago - Stars: 1,122 - Forks: 260

kennymckormick/pyskl

A toolbox for skeleton-based action recognition.

Language: Python - Size: 2.09 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 1,099 - Forks: 200

yjxiong/tsn-pytorch

Temporal Segment Networks (TSN) in PyTorch

Language: Python - Size: 30.3 KB - Last synced at: 4 days ago - Pushed at: almost 6 years ago - Stars: 1,073 - Forks: 310

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 958 - Forks: 217

felixchenfy/Realtime-Action-Recognition

Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)

Language: Python - Size: 6.68 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 913 - Forks: 262

jeffreyyihuang/two-stream-action-recognition

Using two stream architecture to implement a classic action recognition method on UCF101 dataset

Language: Python - Size: 25.8 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 824 - Forks: 257

nghorbani/amass

Data preparation and loader for AMASS

Language: Jupyter Notebook - Size: 8.06 MB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 774 - Forks: 93

niais/Awesome-Skeleton-based-Action-Recognition

Skeleton-based Action Recognition

Language: HTML - Size: 219 KB - Last synced at: 25 days ago - Pushed at: about 2 years ago - Stars: 681 - Forks: 121

yjxiong/action-detection

temporal action detection with SSN

Language: Python - Size: 7.78 MB - Last synced at: 4 days ago - Pushed at: almost 6 years ago - Stars: 644 - Forks: 177

OpenGVLab/VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language: Python - Size: 935 KB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 642 - Forks: 76

DirtyHarryLYL/HOI-Learning-List

A list of Human-Object Interaction Learning.

Size: 327 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 617 - Forks: 58

bryanyzhu/two-stream-pytorch

PyTorch implementation of two-stream networks for video action recognition

Language: Python - Size: 43.6 MB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 580 - Forks: 148

firework8/Awesome-Skeleton-based-Action-Recognition

A curated paper list of awesome skeleton-based action recognition.

Size: 443 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 517 - Forks: 65

RaivoKoot/Video-Dataset-Loading-Pytorch

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

Language: Python - Size: 6.54 MB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 463 - Forks: 44

DmitryRyumin/CVPR-2023-24-Papers

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Language: Python - Size: 10.3 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 451 - Forks: 30

yoosan/video-understanding-dataset

A collection of recent video understanding datasets, under construction!

Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 438 - Forks: 79

MVIG-SJTU/AlphAction

Spatio-Temporal Action Localization System

Language: Python - Size: 296 KB - Last synced at: 13 days ago - Pushed at: about 3 years ago - Stars: 421 - Forks: 76

kenziyuliu/MS-G3D

[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"

Language: Python - Size: 84.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 392 - Forks: 94

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

Language: Python - Size: 690 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 361 - Forks: 54

kracwarlock/action-recognition-visual-attention

Action recognition using soft attention based deep recurrent neural networks

Language: Jupyter Notebook - Size: 985 KB - Last synced at: 19 days ago - Pushed at: over 8 years ago - Stars: 350 - Forks: 158

gurkirt/realtime-action-detection

This repository host the code for real-time action detection paper

Language: MATLAB - Size: 114 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 315 - Forks: 97

movienet/movienet-tools

Tools for movie and video research

Language: C++ - Size: 6.56 MB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 290 - Forks: 35

rlleshi/phar

deep learning sex position classifier

Language: Python - Size: 1.72 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 270 - Forks: 28

cmhungsteve/TA3N

[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)

Language: Python - Size: 1.68 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 263 - Forks: 40

moabitcoin/ig65m-pytorch

PyTorch 3D video classification models pre-trained on 65 million Instagram videos

Language: Python - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 263 - Forks: 28

dukebw/lintel

A Python module to decode video frames directly, using the FFmpeg C API.

Language: C - Size: 73.2 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 261 - Forks: 38

vt-vl-lab/iCAN

[BMVC 2018] iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection

Language: Python - Size: 25.5 MB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 260 - Forks: 57

mx-mark/VideoTransformer-pytorch

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

Language: Python - Size: 4.17 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 257 - Forks: 35

rohitgirdhar/AttentionalPoolingAction

Code/Model release for NIPS 2017 paper "Attentional Pooling for Action Recognition"

Language: Python - Size: 3.69 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 256 - Forks: 65

guiggh/hand_pose_action

Dataset and code for the paper "First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations", CVPR 2018.

Language: Python - Size: 21.1 MB - Last synced at: 11 months ago - Pushed at: over 6 years ago - Stars: 253 - Forks: 32

NVlabs/STEP

STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)

Language: Python - Size: 4.04 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 246 - Forks: 48

coderSkyChen/Action_Recognition_Zoo

Codes for popular action recognition models, verified on the something-something data set.

Language: Python - Size: 43.4 MB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 244 - Forks: 35

fandulu/DD-Net

A lightweight network for body/hand action recognition

Language: Jupyter Notebook - Size: 53.3 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 237 - Forks: 49

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Language: Python - Size: 1.64 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 234 - Forks: 33

DirtyHarryLYL/Transferable-Interactiveness-Network

Code for Transferable Interactiveness Knowledge for Human-Object Interaction Detection. (CVPR'19, TPAMI'21)

Language: Python - Size: 12.2 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 232 - Forks: 42

DirtyHarryLYL/HAKE-Action-Torch

HAKE-Action in PyTorch

Size: 128 MB - Last synced at: 18 days ago - Pushed at: 11 months ago - Stars: 230 - Forks: 48

DirtyHarryLYL/HAKE

HAKE: Human Activity Knowledge Engine (CVPR'18/19/20, NeurIPS'20, TPAMI'21)

Language: Python - Size: 21.8 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 224 - Forks: 14

sutdcv/UAV-Human

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

Language: Python - Size: 977 KB - Last synced at: 26 days ago - Pushed at: almost 3 years ago - Stars: 224 - Forks: 14

MichiganCOG/ViP

Video Platform for Action Recognition and Object Detection in Pytorch

Language: Python - Size: 694 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 221 - Forks: 35

rohitgirdhar/ActionVLAD

ActionVLAD for video action classification (CVPR 2017)

Language: Python - Size: 13 MB - Last synced at: 25 days ago - Pushed at: over 6 years ago - Stars: 215 - Forks: 61

wmcnally/golfdb

GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.

Language: Python - Size: 646 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 212 - Forks: 64

imsoo/fight_detection

Real time Fight Detection Based on 2D Pose Estimation and RNN Action Recognition

Language: C++ - Size: 502 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 209 - Forks: 43

shukkkur/VolleyVision

Applying Deep Learning Approaches to Volleyball Data

Language: Python - Size: 919 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 201 - Forks: 36

eriklindernoren/Action-Recognition

Exploration of different solutions to action recognition in video, using neural networks implemented in PyTorch.

Language: Python - Size: 8.51 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 197 - Forks: 75

kevin-ssy/Optical-Flow-Guided-Feature

Implementation Code of the paper Optical Flow Guided Feature, CVPR 2018

Language: C++ - Size: 7.33 MB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 196 - Forks: 44

bryanyzhu/Hidden-Two-Stream

Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"

Language: C++ - Size: 10.4 MB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 194 - Forks: 68

woodfrog/ActionRecognition

Explore Action Recognition

Language: Python - Size: 1.03 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 193 - Forks: 54

hbilen/dynamic-image-nets

Dynamic Image Networks for Action Recognition

Language: Matlab - Size: 3.48 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 180 - Forks: 66

chuckcho/video-caffe

Video-friendly caffe -- comes with the most recent version of Caffe (as of Jan 2019), a video reader, 3D(ND) pooling layer, and an example training script for C3D network and UCF-101 data

Language: C++ - Size: 46.3 MB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 176 - Forks: 93

axon-research/c3d-keras

C3D for Keras + TensorFlow

Language: Python - Size: 2.94 MB - Last synced at: 7 months ago - Pushed at: almost 8 years ago - Stars: 176 - Forks: 77

ykotseruba/JAAD

Annotation data for JAAD (Joint Attention in Autonomous Driving) Dataset

Language: Python - Size: 41.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 173 - Forks: 57

chinancheng/awesome-activity-prediction

Paper list of activity prediction and related area

Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 167 - Forks: 34

wanglimin/UntrimmedNet

Weakly Supervised Action Recognition and Detection

Language: Matlab - Size: 1.23 MB - Last synced at: 4 days ago - Pushed at: over 6 years ago - Stars: 161 - Forks: 48

xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language: Python - Size: 19.2 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 158 - Forks: 19

noureldien/timeception

Timeception for Complex Action Recognition, CVPR 2019 (Oral Presentation)

Language: Python - Size: 603 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 158 - Forks: 34

whwu95/BIKE

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Language: Python - Size: 9.01 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 155 - Forks: 18

daili0015/ModelFeast

Pytorch model zoo for human, include all kinds of 2D CNN, 3D CNN, and CRNN

Language: Python - Size: 1.89 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 155 - Forks: 38

epic-kitchens/epic-kitchens-55-annotations

🍴 Annotations for the EPIC KITCHENS-55 Dataset.

Language: Python - Size: 29.4 MB - Last synced at: 17 days ago - Pushed at: about 4 years ago - Stars: 151 - Forks: 26

hikvision-research/skelact

Skeleton-based action recognition models in PyTorch, including Two-Stream CNN, HCN, HCN-Baseline, Ta-CNN and Dynamic GCN

Language: Python - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 150 - Forks: 21

cagbal/Skeleton-Based-Action-Recognition-Papers-and-Notes

Skeleton-based Action Recognition Papers and Small Notes and Top 2 Leaderboard for NTU-RGBD

Size: 109 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 150 - Forks: 25

whwu95/Text4Vis

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Language: Python - Size: 8.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 149 - Forks: 13

USTC-Video-Understanding/I3D_Finetune

TensorFlow code for finetuning I3D model on UCF101.

Language: Python - Size: 731 KB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 145 - Forks: 43

perseus784/Vehicle_Collision_Prediction_Using_CNN-LSTMs

Predict Vehicle collision moments before it happens in Carla!. CNN and LSTM hybrid architecture is used to understand a series of images.

Language: Python - Size: 55.1 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 144 - Forks: 29

dlpbc/keras-kinetics-i3d

keras implementation of inflated 3d from Quo Vardis paper + weights

Language: Python - Size: 2.7 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 137 - Forks: 45

sujoyp/wtalc-pytorch

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Language: Python - Size: 35.9 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 131 - Forks: 25

sutdcv/Animal-Kingdom

[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding

Language: Python - Size: 165 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 128 - Forks: 12

Cogito2012/DEAR

[ICCV 2021 Oral] Deep Evidential Action Recognition

Language: Python - Size: 210 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 128 - Forks: 19

xlliu7/TadTR

[TIP 2022] End-to-end Temporal Action Detection with Transformer

Language: Python - Size: 194 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 128 - Forks: 13

ChenFengYe/SportsCap

[IJCV 2021] SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos

Language: Python - Size: 17.3 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 127 - Forks: 13

akshitac8/tfvaegan

[ECCV 2020] Official Pytorch implementation for "Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification". SOTA results for ZSL and GZSL

Language: Python - Size: 1000 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 124 - Forks: 29

wushidonguc/two-stream-action-recognition-keras

Two-stream CNNs for video action recognition implemented in Keras

Language: Python - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 117 - Forks: 46

kenshohara/3D-ResNets

3D ResNets for Action Recognition

Language: Lua - Size: 28.3 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 117 - Forks: 21

lRomul/ball-action-spotting

SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023

Language: Python - Size: 619 KB - Last synced at: 9 days ago - Pushed at: almost 2 years ago - Stars: 114 - Forks: 16

haamoon/mmtm

Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"

Language: Python - Size: 47.9 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 112 - Forks: 21

MichiganCOG/M-PACT

A one stop shop for all of your activity recognition needs.

Language: Python - Size: 458 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 106 - Forks: 24

shlizee/Predict-Cluster

Repository for PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition

Language: Jupyter Notebook - Size: 52 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 104 - Forks: 23

ldkong1205/TranSVAE

Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

Language: Python - Size: 64.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 103 - Forks: 10

zhang-can/PAN-PyTorch

[Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance

Language: Python - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 103 - Forks: 11

wanglimin/TDD

Trajectory-pooled Deep-Convolutional Descriptors

Language: Matlab - Size: 520 KB - Last synced at: 12 months ago - Pushed at: almost 8 years ago - Stars: 103 - Forks: 75

DirtyHarryLYL/HAKE-Action

As a part of the HAKE project, includes the reproduced SOTA models and the corresponding HAKE-enhanced versions (CVPR2020).

Size: 39.8 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 100 - Forks: 13

xdshang/VidVRD-helper

To keep updates with VRU Grand Challenge, please use https://github.com/NExTplusplus/VidVRD-helper

Language: Python - Size: 192 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 99 - Forks: 28

ASMIftekhar/VSGNet

VSGNet:Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions.

Language: Python - Size: 5.66 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 98 - Forks: 21

ekazakos/temporal-binding-network

Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch

Language: Python - Size: 43.5 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 98 - Forks: 25

AlexanderMelde/SPHAR-Dataset

Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera like position.

Language: Python - Size: 5.47 GB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 95 - Forks: 18

rohitgirdhar/CATER

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

Language: Python - Size: 105 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 93 - Forks: 19

xingyul/cpnet

Learning Video Representations from Correspondence Proposals (CVPR 2019 Oral)

Language: Python - Size: 610 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 93 - Forks: 12

kiyoon/nvim-hand-gesture

Write programs with hand gestures

Language: Python - Size: 36.1 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 92 - Forks: 2

Related Topics
deep-learning 146 pytorch 115 computer-vision 104 video-understanding 63 tensorflow 42 video-classification 37 machine-learning 36 lstm 31 python 30 video 30 dataset 28 cnn 28 video-recognition 26 action-detection 24 activity-recognition 23 self-supervised-learning 22 skeleton-based-action-recognition 22 pose-estimation 21 convolutional-neural-networks 21 keras 21 ucf101 19 transfer-learning 18 deep-neural-networks 17 object-detection 17 human-activity-recognition 15 c3d 14 video-processing 14 image-classification 13 transformer 13 opencv 12 hmdb51 11 domain-adaptation 11 skeleton 11 representation-learning 11 unsupervised-learning 11 mediapipe 10 gesture-recognition 10 keras-tensorflow 10 vision-transformer 9 feature-extraction 9 python3 9 ucf-101 9 artificial-intelligence 9 i3d 8 human-action-recognition 8 ntu-rgbd 8 optical-flow 8 temporal-action-detection 8 human-object-interaction 8 neural-network 8 temporal-action-localization 7 caffe 7 two-stream-cnn 7 image-segmentation 7 semantic-segmentation 7 video-dataset 7 action-localization 7 action-classification 7 slowfast 6 epic-kitchens 6 video-analysis 6 pytorch-implementation 6 openpose 6 human-pose-estimation 6 lstm-neural-networks 6 temporal-segment-networks 6 lrcn 6 spatio-temporal 6 benchmark 6 tsm 6 zero-shot-learning 6 action-triplet 5 emotion-recognition 5 kinetics-datasets 5 attention-mechanism 5 video-representation-learning 5 tensorflow2 5 image-generation 5 graph-neural-networks 5 fine-grained-classification 5 resnet 5 classification 5 action 5 3dcnn 5 recognition 5 tsn 5 anomaly-detection 5 neural-networks 5 few-shot-learning 5 ava 5 youtube 5 unsupervised-machine-learning 4 group-activity-recognition 4 resnet-50 4 annotations 4 action-anticipation 4 object-recognition 4 multimodal 4 cctv 4 action-prediction 4