GitHub topics: visual-features
naver/kapture
kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.
Language: Python - Size: 54 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 510 - Forks: 68

cuixing158/Visual-Based-Odometry-Estimation1-cpp
Stitching and fusion of on-board surround view BEV real world image sequences, odometer estimation and output of large pixel map
Language: C++ - Size: 5.99 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 3 - Forks: 3

multimodal/multimodal
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
Language: Python - Size: 2.21 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 82 - Forks: 7

cuixing158/Visual-Based-Odometry-Estimation-cpp
Stitching and fusion of on-board surround view BEV real world image sequences, odometer estimation and output of large pixel map
Language: C++ - Size: 989 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Language: Python - Size: 282 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 497 - Forks: 94

ritika-0111/Amazon-Apparel-Recommendations
Recommends Apparel based on Text, Visual features, and weighted similarity using brand and color similarity.
Language: Jupyter Notebook - Size: 6.79 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

gcunhase/Emotional-Video-to-Audio-with-ANFIS-DeepRNN
Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]
Language: MATLAB - Size: 2.57 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 25 - Forks: 6

michelecafagna26/vinvl-visualbackbone
Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.
Language: Python - Size: 18.3 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1
