Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: video-classification

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language: Python - Size: 21.8 MB - Last synced: about 17 hours ago - Pushed: 1 day ago - Stars: 3,258 - Forks: 249

comp-imaging-sci/attention-based-bilstm-sleep-scoring

Codes related to the paper "Attention-Based CNN-BiLSTM for Sleep States Classification of Spatiotemporal Wide-Field Calcium Imaging Data"

Language: Jupyter Notebook - Size: 54.2 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 2 - Forks: 0

TasneemMohammed/Engagement-Detection-Using-Hybrid-EfficientNetB7-Together-With-TCN-LSTM-and-Bi-LSTM

Students Engagement Detection Using Hybrid EfficientNetB7 Together With TCN, LSTM, and Bi-LSTM (DAiSEE and VRESEE datasets)

Language: Jupyter Notebook - Size: 3.5 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 1 - Forks: 0

cmhungsteve/TA3N

[ICCV 2019 (Oral)] Temporal Attentive Alignment for Large-Scale Video Domain Adaptation (PyTorch)

Language: Python - Size: 1.64 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 256 - Forks: 41

olivesgatech/TA3N

[ICCV 2019 Oral] TA3N: https://github.com/cmhungsteve/TA3N (Most updated repo)

Language: Python - Size: 839 KB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 45 - Forks: 2

kili-technology/kili-python-sdk

Simplest and fastest image and text annotation tool.

Language: Jupyter Notebook - Size: 388 MB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 223 - Forks: 27

matin-ghorbani/Video-Classification-Transformers

Implement a video classification using transformers

Language: Jupyter Notebook - Size: 7.81 KB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 1 - Forks: 0

fcakyon/video-transformers

Easiest way of fine-tuning HuggingFace video classification models

Language: Python - Size: 72.3 KB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 127 - Forks: 12

majd-alhafi/Video-classification

Language: Jupyter Notebook - Size: 14.6 KB - Last synced: 18 days ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

PavloFesenko/gif_analyzer

TV show recognizer from GIF images

Language: Jupyter Notebook - Size: 369 KB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 4 - Forks: 4

cosmaadrian/multimodal-depression-from-video

Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"

Language: Python - Size: 370 KB - Last synced: 21 days ago - Pushed: 21 days ago - Stars: 20 - Forks: 2

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

Language: Python - Size: 1.11 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 75 - Forks: 7

lucidrains/TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Language: Python - Size: 181 KB - Last synced: 3 days ago - Pushed: almost 3 years ago - Stars: 672 - Forks: 85

masouduut94/volleyball_analytics

This project is designed to display how we can utilize deep learning methods for Sports Data Analytics.

Language: Jupyter Notebook - Size: 97.1 MB - Last synced: 26 days ago - Pushed: 26 days ago - Stars: 11 - Forks: 1

eriklindernoren/Action-Recognition

Exploration of different solutions to action recognition in video, using neural networks implemented in PyTorch.

Language: Python - Size: 8.51 MB - Last synced: 15 days ago - Pushed: over 4 years ago - Stars: 174 - Forks: 72

maxime7770/Videos-You-Love-To-Take

Personal video classification using Deep Learning techniques

Language: Jupyter Notebook - Size: 549 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

agrawal-rohit/youtube-video-classification

Youtube video classification using machine learning

Language: Jupyter Notebook - Size: 1.37 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 2 - Forks: 5

Andreas-UI/VCAB

Video Classification Autism Behaviour

Language: Python - Size: 121 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

Language: Python - Size: 31.4 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 777 - Forks: 107

lucidrains/STAM-pytorch

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

Language: Python - Size: 64.5 KB - Last synced: 5 days ago - Pushed: about 3 years ago - Stars: 122 - Forks: 15

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language: Python - Size: 68.2 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 3,866 - Forks: 1,173

innat/VideoMAE

[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Jupyter Notebook - Size: 13.9 MB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 12 - Forks: 2

innat/mae-spatiotemporal

Unofficial keras implementation of masked autoencoders as spatiotemporal learners

Size: 2.02 MB - Last synced: about 2 months ago - Pushed: 9 months ago - Stars: 2 - Forks: 0

kingardor/Activity-Recognition-TensorRT

3D ResNet Video Classification accelerated by TensorRT

Language: Python - Size: 9.9 MB - Last synced: about 1 month ago - Pushed: almost 3 years ago - Stars: 44 - Forks: 11

deepankarvarma/Skin-Cancer-Detection--OpenCV-TensorFlow-Keras

This repository contains Python code for generating a skin cancer detection model and utilizing it to detect skin cancer from user-inputted images or videos. The model architecture follows a sequential structure consisting of convolutional and pooling layers, with the final output layer using a sigmoid activation function.

Language: Python - Size: 5.86 KB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 2 - Forks: 2

Event-AHU/SSTFormer

[PokerEvent Benchmark Dataset & SNN-ANN Baseline] Official PyTorch implementation of "SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition"

Language: Python - Size: 7.75 MB - Last synced: 26 days ago - Pushed: 4 months ago - Stars: 13 - Forks: 1

HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

Size: 98.6 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 714 - Forks: 166

17Skye17/VideoLT

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Language: Python - Size: 36.2 MB - Last synced: 23 days ago - Pushed: about 2 years ago - Stars: 33 - Forks: 3

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Language: Jupyter Notebook - Size: 9.84 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 898 - Forks: 215

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

Language: Python - Size: 690 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 361 - Forks: 54

lucidrains/uniformer-pytorch

Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022

Language: Python - Size: 443 KB - Last synced: 21 days ago - Pushed: about 2 years ago - Stars: 97 - Forks: 4

VinAIResearch/fsvc-ata

Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments (ECCV 2022)

Language: Python - Size: 149 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 20 - Forks: 0

innat/VideoSwin

Keras Implementation of Video Swin Transformers for 3D Video Modeling

Language: Jupyter Notebook - Size: 7.69 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 18 - Forks: 2

mwoodson1/temporal-pooling-networks

Code for my entry in the Youtube8M Kaggle competition. Currently exploring applications to other video based problems

Language: Python - Size: 258 KB - Last synced: 3 months ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 1

n1ghtf4l1/decipher-engine

Detect and Translate American Sign Language (ASL) fingerspelling into text.

Size: 1.14 MB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

zuliani99/VideoClassification-CNN

Lite version of the following Oxford project: Large-scale Video Classification with Convolutional Neural Networks

Language: Jupyter Notebook - Size: 60 MB - Last synced: 26 days ago - Pushed: about 1 year ago - Stars: 6 - Forks: 1

amitparag/Attention-Classification

Slip detection with Franka Emika and GelSight Sensors

Language: Jupyter Notebook - Size: 1.19 GB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

sportzhang/paddle_youtube

使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。

Language: Python - Size: 140 KB - Last synced: 2 months ago - Pushed: over 4 years ago - Stars: 11 - Forks: 6

dager23/SlowFast-Custom-Data

An advanced action recognition and Video Classifier

Language: Python - Size: 60.5 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Language: Python - Size: 1.64 MB - Last synced: 3 months ago - Pushed: 10 months ago - Stars: 208 - Forks: 29

SathyasriS27/Automated-Crime-Detection

A crime detection and campus management system that helps in the detection, classification and subsequent mitigation of crimes occurring in a region of surveillance.

Language: Jupyter Notebook - Size: 447 MB - Last synced: 4 months ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 2

chenxuluo/GST-video

ICCV 19 Grouped Spatial-Temporal Aggretation for Efficient Action Recognition

Language: Python - Size: 14.6 KB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 43 - Forks: 11

Pk13055/transcript-based-classification

Video safety classification on the basis of transcripts.

Language: Jupyter Notebook - Size: 165 MB - Last synced: 5 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

khanhdq109/Pipeline-for-Hand-Gesture-Recognition

Developing a pipeline for Hand Gesture Recognition

Language: Python - Size: 402 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 2 - Forks: 0

wadhwasahil/Video-Classification-2-Stream-CNN

Video Classification using 2 stream CNN

Language: Python - Size: 62 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 309 - Forks: 109

Abdulrahman-Adel/Real-Life-Violence-Detection

training a vision transformer based model to detect violence in real life videos

Language: Python - Size: 1020 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

adlyZaroui/Event-Lip-Reading

Classification of event data video

Language: Jupyter Notebook - Size: 422 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

rlleshi/phar

deep learning sex position classifier

Language: Python - Size: 1.72 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 164 - Forks: 20

JohnPPinto/HMDB51_human_motion_recognition_pytorch

A project on video classification using PyTorch 2.0.

Language: Jupyter Notebook - Size: 4.54 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

sujiongming/UCF-101_video_classification

Classify UCF101 videos using one frame at a time with a CNN(InceptionV3)

Language: Python - Size: 88.9 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 194 - Forks: 59

kevin-ssy/Optical-Flow-Guided-Feature

Implementation Code of the paper Optical Flow Guided Feature, CVPR 2018

Language: C++ - Size: 7.33 MB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 195 - Forks: 43

hooman007/ProtoASNet

Official repository for the paper "ProtoASNet: Dynamic Prototypes for Inherently Interpretable and Uncertainty-Aware Aortic Stenosis Classification in Echocardiography" in MICCAI 2023 Conference

Language: Python - Size: 71.3 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 1

AKASH2907/deepfakes_video_classification

Deepfakes Video classification via CNN, LSTM, C3D and triplets [IWBF'20]

Language: Python - Size: 2.64 MB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 53 - Forks: 17

shreyash2610/Convolutional-Long-Short-Term-Memory-based-IOT-node-for-Violence-Detection

Abstract— Violence detection has been investigated extensively in the literature. Recently, IOT based violence video surveillance is an intelligent component integrated in security system of smart buildings. Violence video detector is a specific kind of detection models that should be highly accurate to increase the model’s sensitivity and reduce the false alarm rate. This paper proposes a novel architecture of ConvLSTM model that can run on low-cost Internet of Things (IOT) device such as raspberry pi board. The paper utilized convolutional neural networks (CNNs) to learn spatial features from video’s frames that were applied to Long Short- Term Memory (LSTM) for video classification into violence/non-violence classes. A complex dataset including two public datasets: RWF-2000 and RLVS-2000 was used for model training and evaluation. The challenging video content includes crowds and chaos, small object at far distance, low resolution, and transient action. Additionally, the videos were captured in various environments such as street, prison, and schools with several human actions such as playing football, basketball, tennis, swimming and eating. The experimental results show high performance of the proposed violence detection model in terms of average metrics having an accuracy of 73.35 %, recall of 76.90 %, precision of 72.53 %, F1 score of 74.01 %, false negative rate of 23.10 %, false positive rate of 30.20 %, and AUC of 82.0 %.

Language: Jupyter Notebook - Size: 1.42 MB - Last synced: 8 months ago - Pushed: almost 3 years ago - Stars: 4 - Forks: 3

michaelnation26/skateboard_trick_classification

Classifying skateboard tricks from video clips using DeepMind's I3D model and an audio feature extractor..

Language: Python - Size: 5.34 MB - Last synced: 8 months ago - Pushed: almost 5 years ago - Stars: 8 - Forks: 1

sagarvegad/Video-Classification-CNN-and-LSTM-

To classify video into various classes using keras library with tensorflow as back-end.

Language: Python - Size: 12.7 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 264 - Forks: 115

woodfrog/ActionRecognition

Explore Action Recognition

Language: Python - Size: 1.03 MB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 193 - Forks: 54

peanutsee/Gender-Classification

Gender Classification Projects. My agenda for this project is to learn how to deploy a model on opencv.

Language: Jupyter Notebook - Size: 4.39 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 1

kenshohara/video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

Language: Python - Size: 154 KB - Last synced: 8 months ago - Pushed: over 5 years ago - Stars: 1,060 - Forks: 259

russellllaputa/region-based-non-local-network

[Codes of paper]: Region-based Non-local operation for Video Classification

Language: Python - Size: 4.27 MB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 19 - Forks: 3

tqvinhcs/C3D-tensorflow

Action recognition with C3D network implemented in tensorflow

Language: Python - Size: 2 MB - Last synced: 8 months ago - Pushed: about 6 years ago - Stars: 33 - Forks: 17

OValery16/Tutorial-about-3D-convolutional-network

Tutorial about 3D convolutional network

Language: Python - Size: 49.1 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 221 - Forks: 34

AlexanderMelde/SPHAR-Dataset

Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera like position.

Language: Python - Size: 5.47 GB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 49 - Forks: 11

MohsenFayyaz89/T3D

Temporal 3D ConvNet

Language: Python - Size: 8.79 KB - Last synced: 8 months ago - Pushed: about 6 years ago - Stars: 106 - Forks: 34

maximus009/VisLangResearch

Experiments for my Research and Master's Thesis

Language: Jupyter Notebook - Size: 2.71 MB - Last synced: 9 months ago - Pushed: about 6 years ago - Stars: 5 - Forks: 0

miladpayandehh/Data-Prediction-and-Text-or-video-Classification-using-LSTM

Predictive analytics is a branch of advanced analytics that makes predictions about future outcomes using historical data combined with statistical modeling, data mining techniques, machine learning, and deep learning. Researchers employ predictive analytics to find patterns in this data to identify risks and opportunities.

Language: MATLAB - Size: 11.7 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

rishikonapure/Advertisement-Recommendation

Video Content-Based Advertisement Recommendation Using Text Classification

Language: Jupyter Notebook - Size: 3.83 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 6 - Forks: 9

vaishnavipatil29/Sign-Language-Recognition

Sign Language Recognition involves hand gesture detection techniques using machine and deep learning techniques.

Language: Jupyter Notebook - Size: 4.24 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 24 - Forks: 8

Dev-R/multi-modal-classification

An all-in-one Python script for real-time audio and video processing with deep learning models. Simultaneously handling live video and audio streams, it accomplishes action recognition, object detection, and audio classification. Additionally, it seamlessly integrates Twilio for notifications and utilizes Azure for efficient data management.

Language: Python - Size: 37.5 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

karolzak/conv3d-video-action-recognition

My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves results similar to those reported by authors

Language: Python - Size: 1.79 MB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 51 - Forks: 10

prakashjayy/C3D 📦

Implementation of https://arxiv.org/abs/1412.0767 using pytorch and keras.

Language: Python - Size: 3.23 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

okyksl/action-classification

Action classification using small UCF11 dataset

Language: Jupyter Notebook - Size: 75.9 MB - Last synced: 10 months ago - Pushed: almost 5 years ago - Stars: 3 - Forks: 1

Ha0Tang/HandGestureRecognition

[Neurocomputing 2019] Fast and Robust Dynamic Hand Gesture Recognition via Key Frames Extraction and Feature Fusion

Language: C++ - Size: 13.4 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 87 - Forks: 25

pomonam/LearnablePoolingMethods

TensorFlow Implementation of "Learnable Pooling Methods for Video Classification".

Language: Python - Size: 946 KB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 38 - Forks: 2

yasharmaster/motion-influence-map-in-mapreduce

Implementation of Motion Influence Map Technique for Video Classification in Apache Spark's Map Reduce Framework.

Language: Python - Size: 4.2 MB - Last synced: 10 months ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0

vijay4313/youtube-8m

The 2nd YouTube-8M Video Understanding Challenge

Language: Python - Size: 8.19 MB - Last synced: 10 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

comp-imaging-sci/MVG-CNN

Codes related to paper "Automated sleep stage classification of wide-field calcium imaging data via multiplex visibility graphs and deep learning"

Language: Python - Size: 19.4 MB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 5 - Forks: 1

wjun0830/MOVE

Official PyTorch Repository of "Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition" (AAAI 2023 Oral Paper) and Imbalanced-MiniKinetics200 dataset.

Language: Python - Size: 11.4 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 5 - Forks: 2

temur-kh/video-classification-cv

Video classification on UCF50 dataset

Language: Python - Size: 2.28 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 7 - Forks: 3

SiyuanYan1/Stress-Recognition-in-Thermal-Videos-using-BDLRCN

This is offical implementation of the paper "Stress Recognition in Thermal Videos using Bi-Directional Long-Term Recurrent Convolutional Neural Netwrok"

Language: Python - Size: 298 KB - Last synced: 11 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

wanglimin/ARTNet

Appearance-and-Relation Networks

Language: Python - Size: 1.16 MB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 204 - Forks: 58

akon1te/UrFunny-humor-detection

3rd grade HSE Course paper about detecting humor in the short videos using ML/DL models

Language: Jupyter Notebook - Size: 23.4 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

pilarcode/action-recognition-in-videos

Action Recognition (dance styles) in videos.

Language: Python - Size: 5.58 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

shraddhavijay/IFAKE

IFAKE is an application for detecting image and video forgery, designed to help users verify the authenticity of digital media. This repository also contains the AI model and dataset that we developed for image tampering detection, providing an effective solution for detecting image and video manipulations.

Language: Jupyter Notebook - Size: 4.28 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 6 - Forks: 0

NANDINI-star/Real-life-violence-detection

Language: Jupyter Notebook - Size: 1.98 MB - Last synced: 12 months ago - Pushed: over 2 years ago - Stars: 18 - Forks: 10

ascuet/CricShot10

CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.

Size: 6.02 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 54 - Forks: 1

ascuet/SoccerAct10

SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.

Size: 30 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 55 - Forks: 0

pomonam/AttentionCluster

TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".

Language: Python - Size: 259 KB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 39 - Forks: 8

abhiram-ds/Gesture_Recognition_CNN

Gesture recognition using Convolutional Neural Networks

Language: Jupyter Notebook - Size: 258 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

JohnPPinto/UCF50_human_activity_recognition_tensorflow

A project on video classification using Tensorflow with UCF50 dataset.

Language: Jupyter Notebook - Size: 74.7 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

xingyul/cpnet

Learning Video Representations from Correspondence Proposals (CVPR 2019 Oral)

Language: Python - Size: 610 KB - Last synced: 12 months ago - Pushed: over 4 years ago - Stars: 93 - Forks: 12

fmahoudeau/MiCT-Net-PyTorch

Video Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone

Language: Python - Size: 818 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 50 - Forks: 11

elliottloveridge/compressed-video-classification

Quantization, Element-Wise Pruning and Knowledge Distillation applied to 3D CNN's for Video Classification.

Language: Python - Size: 17.6 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0

kahnchana/svt

Official repository for "Self-Supervised Video Transformer" (CVPR'22)

Language: Python - Size: 767 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 82 - Forks: 12

yakhyo/video-classification-pytorch

Video Classification using R(2+1)D based on ResNet18 on UCF-101 dataset. PyTorch Implementation.

Language: Python - Size: 37.1 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

bryant1410/fitclip

Code for the FitCLIP method

Language: Python - Size: 111 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 5 - Forks: 0

junyongyou/Attention-boosted-deep-networks-for-video-classification

This is a implementation of integrating a simple but efficient attention block in CNN + bidirectional LSTM for video classification.

Language: Python - Size: 195 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 16 - Forks: 7

davide-coccomini/TimeSformer-Video-Classification

The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understanding?"

Language: Jupyter Notebook - Size: 237 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 31 - Forks: 4

FrederikSchorr/sign-language

Sign Language Recognition for Deaf People

Language: Python - Size: 4.61 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 88 - Forks: 33

ozgurkara99/video-dataset-preprocessing-meta-learning

Something-something-v2 video dataset is splitted into 3 meta-sets, namely, meta-training, meta-validation, meta-test. Overall, dataset includes 100 classes that are divided according to CMU [1] The code also provides a dataloader in order to create episodes considering given n-way k-shot learning task. Videos are converted to the frames under sparse-sampling protocol described in TSN [2]

Language: Python - Size: 92.8 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 4 - Forks: 0

Related Keywords
video-classification 151 deep-learning 52 action-recognition 35 pytorch 35 tensorflow 19 computer-vision 18 cnn 14 machine-learning 14 python 13 keras 13 lstm 12 video 12 ucf101 8 image-classification 8 video-dataset 8 convolutional-neural-networks 8 transformers 8 video-processing 7 video-understanding 7 python3 6 artificial-intelligence 6 3d-convolutional-network 5 video-recognition 5 action-classification 5 lstm-neural-networks 5 opencv 5 attention-mechanism 5 video-classification-pytorch 4 transfer-learning 4 deep-neural-networks 4 3d-cnn 4 image-processing 4 slowfast 4 i3d 4 human-action-recognition 4 violence-detection 4 text-classification 4 torch 3 semantic-segmentation 3 multi-modal 3 neural-network 3 rnn 3 fine-grained-classification 3 self-supervised-learning 3 sports-1m 3 ucf-101 3 dataset 3 video-classification-models 3 hmdb51 3 human-activity-recognition 3 kinetics 3 c3d 2 activity-recognition 2 kaggle 2 pretrained-models 2 video-prediction 2 tutorial 2 cctv 2 non-local 2 cctv-detection 2 keras-tensorflow 2 yolov8 2 videomae 2 object-detection 2 surveillance-systems 2 kaggle-competition 2 audio-classification 2 django 2 hand-gesture-recognition 2 cnn-classification 2 action-recognition-dataset 2 pytorch-lightning 2 youtube-8m 2 interpretable-deep-learning 2 surveillance 2 vision-transformer 2 human-computer-interaction 2 gesture-recognition 2 few-shot-learning 2 cnn-model 2 temporal-modeling 2 video-action-recognition 2 long-tailed-recognition 2 attention 2 event-camera 2 ucf-50 2 computer-vision-tools 2 vision 2 domain-adaptation 2 wide-field-optical-imaging 2 cvpr2019 2 iccv2019 2 spatialtemporal 2 sleep-stage-classification 2 sleep-scoring 2 neuroimaging 2 mouse-brain 2 pytorch-video 2 classification 2 huggingface 2