An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: action-recognition

HoBeom/cv-arxiv-daily Fork of Vincentqyw/cv-arxiv-daily

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Language: Python - Size: 6.04 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 1 - Forks: 0

PaddlePaddle/PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Language: Python - Size: 106 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 1,624 - Forks: 385

perseus784/Vehicle_Collision_Prediction_Using_CNN-LSTMs

Predict Vehicle collision moments before it happens in Carla!. CNN and LSTM hybrid architecture is used to understand a series of images.

Language: Python - Size: 55.1 MB - Last synced at: about 6 hours ago - Pushed at: about 1 year ago - Stars: 146 - Forks: 29

OpenGVLab/InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language: Python - Size: 53.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,915 - Forks: 112

axinc-ai/ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Language: Python - Size: 1.16 GB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,205 - Forks: 344

extreme-assistant/ICCV2023-Paper-Code-Interpretation

ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理

Size: 697 KB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 2,300 - Forks: 1,403

minghu0830/OphNet-benchmark

OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding

Language: Python - Size: 37 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 50 - Forks: 5

open-mmlab/mmskeleton

A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.

Language: Python - Size: 91.2 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 3,001 - Forks: 1,046

isLinXu/paper-list

autoupdate paper list

Language: Python - Size: 140 MB - Last synced at: 8 days ago - Pushed at: 10 days ago - Stars: 84 - Forks: 9

HarishValliappan/Fewshot-PoseTransformer-ActionClassifier

Language: Jupyter Notebook - Size: 3.25 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

sayakpaul/Action-Recognition-in-TensorFlow

Contains additional materials for two keras.io blog posts.

Language: Jupyter Notebook - Size: 19.6 MB - Last synced at: about 20 hours ago - Pushed at: almost 4 years ago - Stars: 17 - Forks: 3

lRomul/ball-action-spotting

SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023

Language: Python - Size: 619 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 115 - Forks: 16

niais/Awesome-Skeleton-based-Action-Recognition

Skeleton-based Action Recognition

Language: HTML - Size: 219 KB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 683 - Forks: 121

open-edge-platform/training_extensions

Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™

Language: Python - Size: 416 MB - Last synced at: 11 days ago - Pushed at: 14 days ago - Stars: 1,192 - Forks: 451

yjxiong/temporal-segment-networks

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Language: Python - Size: 2.01 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 1,563 - Forks: 476

yjxiong/tsn-pytorch

Temporal Segment Networks (TSN) in PyTorch

Language: Python - Size: 30.3 KB - Last synced at: 8 days ago - Pushed at: almost 6 years ago - Stars: 1,074 - Forks: 310

movienet/movienet-tools

Tools for movie and video research

Language: C++ - Size: 6.56 MB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 290 - Forks: 35

hnuzhy/CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

Size: 37.6 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 74 - Forks: 6

OpenGVLab/VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language: Python - Size: 935 KB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 642 - Forks: 76

AdaptiveMotorControlLab/LLaVAction

Language: Jupyter Notebook - Size: 16.9 MB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 35 - Forks: 1

kennymckormick/pyskl

A toolbox for skeleton-based action recognition.

Language: Python - Size: 2.09 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 1,099 - Forks: 200

kenshohara/3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language: Python - Size: 328 KB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 3,982 - Forks: 933

ParitoshParmar/Fitness-AQA

Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]

Language: Python - Size: 4.21 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 75 - Forks: 37

rlleshi/phar

deep learning sex position classifier

Language: Python - Size: 1.72 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 270 - Forks: 28

nghorbani/amass

Data preparation and loader for AMASS

Language: Jupyter Notebook - Size: 8.06 MB - Last synced at: 27 days ago - Pushed at: 11 months ago - Stars: 774 - Forks: 93

RaivoKoot/Video-Dataset-Loading-Pytorch

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

Language: Python - Size: 6.54 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 463 - Forks: 44

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language: Python - Size: 68.2 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 4,601 - Forks: 1,286

DmitryRyumin/CVPR-2023-24-Papers

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Language: Python - Size: 10.3 MB - Last synced at: 18 days ago - Pushed at: 11 months ago - Stars: 451 - Forks: 30

dmlc/gluon-cv

Gluon CV Toolkit

Language: Python - Size: 37.8 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 5,888 - Forks: 1,206

kenshohara/video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

Language: Python - Size: 154 KB - Last synced at: 27 days ago - Pushed at: over 6 years ago - Stars: 1,122 - Forks: 260

NVlabs/STEP

STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR'19 (Oral)

Language: Python - Size: 4.04 MB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 251 - Forks: 48

MVIG-SJTU/AlphAction

Spatio-Temporal Action Localization System

Language: Python - Size: 296 KB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 421 - Forks: 76

bryanyzhu/two-stream-pytorch

PyTorch implementation of two-stream networks for video action recognition

Language: Python - Size: 43.6 MB - Last synced at: 26 days ago - Pushed at: over 4 years ago - Stars: 580 - Forks: 148

sutdcv/UAV-Human

[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

Language: Python - Size: 977 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 224 - Forks: 14

ZJCV/TSN

[ECCV 2016] Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Language: Python - Size: 519 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 3

ParitoshParmar/MTL-AQA

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

Language: Python - Size: 27.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 68 - Forks: 15

jinwchoi/awesome-action-recognition

A curated list of action recognition and related area resources

Size: 270 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 3,893 - Forks: 727

Necolizer/ISTA-Net

[IROS 2023] Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition

Language: Python - Size: 9.98 MB - Last synced at: about 16 hours ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 1

cagbal/Skeleton-Based-Action-Recognition-Papers-and-Notes

Skeleton-based Action Recognition Papers and Small Notes and Top 2 Leaderboard for NTU-RGBD

Size: 109 KB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 150 - Forks: 25

firework8/Awesome-Skeleton-based-Action-Recognition

A curated paper list of awesome skeleton-based action recognition.

Size: 443 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 517 - Forks: 65

Necolizer/CHASE

[NeurIPS 2024] CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition

Language: Python - Size: 3.04 MB - Last synced at: about 16 hours ago - Pushed at: 5 months ago - Stars: 14 - Forks: 0

open-mmlab/mmaction

An open-source toolbox for action understanding based on PyTorch

Language: Python - Size: 3.95 MB - Last synced at: 29 days ago - Pushed at: about 3 years ago - Stars: 1,871 - Forks: 351

epic-kitchens/epic-kitchens-55-annotations

🍴 Annotations for the EPIC KITCHENS-55 Dataset.

Language: Python - Size: 29.4 MB - Last synced at: 30 days ago - Pushed at: over 4 years ago - Stars: 151 - Forks: 26

hwang-cs-ime/IVAC-P2L

[TMM-2025] The official implementation of "IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting".

Language: Python - Size: 1.01 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 1

RubenAMtz/ai-sports-assistant

Olympic weightlifting videos analysis, action recognition and assessment.

Language: Python - Size: 41.3 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 2

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Language: Python - Size: 1.64 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 234 - Forks: 33

cmhungsteve/TA3N

[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)

Language: Python - Size: 1.68 MB - Last synced at: 27 days ago - Pushed at: 7 months ago - Stars: 263 - Forks: 40

eriklindernoren/Action-Recognition

Exploration of different solutions to action recognition in video, using neural networks implemented in PyTorch.

Language: Python - Size: 8.51 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 197 - Forks: 75

DirtyHarryLYL/HAKE-Action-Torch

HAKE-Action in PyTorch

Size: 128 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 230 - Forks: 48

tomar840/two-stream-fusion-for-action-recognition-in-videos

Language: Python - Size: 208 KB - Last synced at: 14 days ago - Pushed at: almost 7 years ago - Stars: 91 - Forks: 29

jeffreyyihuang/two-stream-action-recognition

Using two stream architecture to implement a classic action recognition method on UCF101 dataset

Language: Python - Size: 25.8 MB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 873 - Forks: 249

USTC-Video-Understanding/I3D_Finetune

TensorFlow code for finetuning I3D model on UCF101.

Language: Python - Size: 731 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 145 - Forks: 43

kiyoon/channel_sampling

Official implementation of "Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition", BMVC 2022

Language: Python - Size: 1.25 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

DirtyHarryLYL/Transferable-Interactiveness-Network

Code for Transferable Interactiveness Knowledge for Human-Object Interaction Detection. (CVPR'19, TPAMI'21)

Language: Python - Size: 12.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 232 - Forks: 42

felixchenfy/Realtime-Action-Recognition

Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)

Language: Python - Size: 6.68 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 913 - Forks: 262

Cogito2012/DEAR

[ICCV 2021 Oral] Deep Evidential Action Recognition

Language: Python - Size: 210 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 128 - Forks: 19

ADL-X/LLAVIDAL

This is the offical repository of LLAVIDAL

Language: Python - Size: 32 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 1

yjxiong/action-detection

temporal action detection with SSN

Language: Python - Size: 7.78 MB - Last synced at: 8 days ago - Pushed at: almost 6 years ago - Stars: 644 - Forks: 177

AlexanderMelde/SPHAR-Dataset

Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera like position.

Language: Python - Size: 5.47 GB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 95 - Forks: 18

CAMMA-public/rendezvous

A transformer-inspired neural network for surgical action triplet recognition from laparoscopic videos.

Language: Python - Size: 2.24 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 10

dukebw/lintel

A Python module to decode video frames directly, using the FFmpeg C API.

Language: C - Size: 73.2 KB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 261 - Forks: 38

ChenFengYe/SportsCap

[IJCV 2021] SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos

Language: Python - Size: 17.3 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 127 - Forks: 13

mahshid1378/mmskeleton

A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.

Language: Python - Size: 54 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

amathislab/DLC2action

DLC2Action is an action segmentation package that makes running and tracking of machine learning experiments easy.

Language: HTML - Size: 10.3 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 3

mmact19/2019

MMAct: A Large-Scale Dataset for Cross Modal Learning on Human Action Understanding

Language: JavaScript - Size: 43.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

vra/action-recognition-using-3d-resnet

Use 3D ResNet to extract features of UCF101 and HMDB51 and then classify them.

Language: Python - Size: 166 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 42 - Forks: 12

ZJCV/X3D

[CVPR 2020] X3D: Expanding Architectures for Efficient Video Recognition

Language: Python - Size: 146 KB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 20 - Forks: 4

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Language: Jupyter Notebook - Size: 9.84 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 958 - Forks: 217

shukkkur/VolleyVision

Applying Deep Learning Approaches to Volleyball Data

Language: Python - Size: 919 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 201 - Forks: 36

DirtyHarryLYL/HAKE

HAKE: Human Activity Knowledge Engine (CVPR'18/19/20, NeurIPS'20, TPAMI'21)

Language: Python - Size: 21.8 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 224 - Forks: 14

laura-wang/video_repres_mas

code for CVPR-2019 paper: Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics

Language: Python - Size: 1.09 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 62 - Forks: 10

DirtyHarryLYL/HOI-Learning-List

A list of Human-Object Interaction Learning.

Size: 327 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 617 - Forks: 58

imsoo/fight_detection

Real time Fight Detection Based on 2D Pose Estimation and RNN Action Recognition

Language: C++ - Size: 502 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 209 - Forks: 43

kenshohara/video-classification-3d-cnn

Video classification tools using 3D ResNet

Language: Lua - Size: 143 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 23 - Forks: 6

kenshohara/3D-ResNets

3D ResNets for Action Recognition

Language: Lua - Size: 28.3 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 117 - Forks: 21

MCG-NJU/VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Python - Size: 547 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

vt-vl-lab/SDN

[NeurIPS 2019] Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition

Language: Python - Size: 40 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 83 - Forks: 13

masashi-hatano/MM-CDFSL

[ECCV 2024] Official code release for "Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition"

Language: Python - Size: 6.39 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 26 - Forks: 1

ykotseruba/JAAD

Annotation data for JAAD (Joint Attention in Autonomous Driving) Dataset

Language: Python - Size: 41.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 173 - Forks: 57

kracwarlock/action-recognition-visual-attention

Action recognition using soft attention based deep recurrent neural networks

Language: Jupyter Notebook - Size: 985 KB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 350 - Forks: 158

EdoWhite/Gate-Shift-Pose

A sport-tailored, pose-enhanced action recognition framework

Language: Python - Size: 673 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bryanyzhu/Hidden-Two-Stream

Caffe implementation for "Hidden Two-Stream Convolutional Networks for Action Recognition"

Language: C++ - Size: 10.4 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 194 - Forks: 68

VNOpenAI/pushup-counter-app

Count pushups from video/webcam. Tech stack: Keypoint detection, BlazePose, action recognition.

Language: Python - Size: 30.3 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 75 - Forks: 28

kiyoon/verb_ambiguity

Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022

Language: Python - Size: 1.07 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 0

miquelmarti/Okutama-Action

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Language: CSS - Size: 836 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 64 - Forks: 6

elicassion/3DTRL

Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"

Language: Python - Size: 24.3 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 0

sovit-123/Video-Recognition-using-Deep-Learning

This project uses deep learning and the PyTorch framework to detect sports action categories in videos in real-time. The neural network is a simple custom neural network built with PyTorch.

Language: Python - Size: 61.8 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 2

kiyoon/nvim-hand-gesture

Write programs with hand gestures

Language: Python - Size: 36.1 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 92 - Forks: 2

markbotros1/action-recognition

A Video Vision Transformer (ViViT) model for detecting incidences of contact between between NFL players in play-by-play video footage

Language: Python - Size: 4.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 2

rohitgirdhar/ActionVLAD

ActionVLAD for video action classification (CVPR 2017)

Language: Python - Size: 13 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 215 - Forks: 61

wanglimin/UntrimmedNet

Weakly Supervised Action Recognition and Detection

Language: Matlab - Size: 1.23 MB - Last synced at: 8 days ago - Pushed at: over 6 years ago - Stars: 161 - Forks: 48

berlin0308/Video-Behavior-Recognition

Language: Python - Size: 1.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

The-Martyr/OccludeNet-Dataset

OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition amidst Occlusions

Size: 7.34 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

vt-vl-lab/video-data-aug

Learning Representational Invariances for Data-Efficient Action Recognition

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 33 - Forks: 5

Marc-Kruiss/SignLanguage-ActionDetection

This project allows to train and test sign language data to identify numbers, the alphabet and poses with the help of opencv and mediapipe

Language: Python - Size: 6.29 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

daili0015/ModelFeast

Pytorch model zoo for human, include all kinds of 2D CNN, 3D CNN, and CRNN

Language: Python - Size: 1.89 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 155 - Forks: 38

karolzak/conv3d-video-action-recognition

My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves results similar to those reported by authors

Language: Python - Size: 1.8 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 54 - Forks: 10

DmitryRyumin/FG-2024-Papers

FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and gesture recognition. Seamlessly integrate code implementations for better understanding. ⭐ Experience the cutting edge of progress in facial analysis, gesture recognition, and biometrics with this repository!

Size: 6.17 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 1

YangLiu9208/SAKDN

[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition

Language: Python - Size: 86.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 23 - Forks: 3

JunweiLiang/MultiTrain

Code and model for "Multi-dataset Training of Transformers for Robust Action Recognition", NeurIPS 2022 Spotlight

Language: Python - Size: 400 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 20 - Forks: 1

Related Keywords
action-recognition 516 deep-learning 146 pytorch 115 computer-vision 104 video-understanding 63 tensorflow 42 video-classification 37 machine-learning 36 lstm 31 video 30 python 29 dataset 28 cnn 28 video-recognition 26 action-detection 24 activity-recognition 23 self-supervised-learning 22 pose-estimation 22 skeleton-based-action-recognition 22 keras 21 convolutional-neural-networks 21 ucf101 19 transfer-learning 18 deep-neural-networks 17 object-detection 17 human-activity-recognition 15 c3d 14 transformer 14 video-processing 14 image-classification 13 opencv 12 domain-adaptation 11 unsupervised-learning 11 skeleton 11 hmdb51 11 representation-learning 11 mediapipe 10 keras-tensorflow 10 gesture-recognition 10 vision-transformer 9 feature-extraction 9 artificial-intelligence 9 python3 9 ucf-101 9 optical-flow 8 neural-network 8 human-action-recognition 8 ntu-rgbd 8 i3d 8 human-object-interaction 8 temporal-action-detection 8 image-segmentation 7 video-dataset 7 temporal-action-localization 7 two-stream-cnn 7 action-localization 7 semantic-segmentation 7 caffe 7 action-classification 7 tsm 6 video-analysis 6 slowfast 6 lrcn 6 epic-kitchens 6 openpose 6 human-pose-estimation 6 pytorch-implementation 6 spatio-temporal 6 temporal-segment-networks 6 benchmark 6 lstm-neural-networks 6 zero-shot-learning 6 anomaly-detection 5 fine-grained-classification 5 recognition 5 graph-neural-networks 5 image-generation 5 3dcnn 5 emotion-recognition 5 video-representation-learning 5 youtube 5 resnet 5 tensorflow2 5 attention-mechanism 5 action-triplet 5 few-shot-learning 5 tsn 5 action 5 kinetics-datasets 5 ava 5 classification 5 neural-networks 5 annotations 4 multimodal 4 cnn-keras 4 video-representation 4 contrastive-learning 4 seq2seq 4 papers 4 action-anticipation 4