GitHub topics: vision-models
LMMMEng/OverLoCK
[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Language: Python - Size: 1.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 120 - Forks: 11

afondiel/awesome-smol
An awesome list of "small but mighty" models and resources.
Size: 142 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

afondiel/computer-vision-challenge
A hands-on collection of computer vision projects for everyone.
Language: Jupyter Notebook - Size: 151 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 40 - Forks: 8

MDGrey33/pyvisionai
The PyVisionAI Official Repo
Language: Python - Size: 9.93 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 98 - Forks: 10

D2I-Group/awesome-vision-time-series
This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".
Language: Python - Size: 3.76 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 1

Pavansomisetty21/Image-Caption-Generation-using-LLMs-GEMINI-
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
Language: Jupyter Notebook - Size: 366 KB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 1

AstraZeneca/PerfCam
PoC Code for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
Language: Jupyter Notebook - Size: 189 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

AstraZeneca/PerfCam-Dataset
Dataset for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
Language: Python - Size: 37.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

afondiel/how-diffusion-models-work-crash-course-DLAI
Diffusion Models crash course with Pytorch from DeepLearningAI
Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ksm26/Prompt-Engineering-for-Vision-Models
Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.
Language: Jupyter Notebook - Size: 22.3 MB - Last synced at: 29 days ago - Pushed at: 12 months ago - Stars: 6 - Forks: 2

shivendrra/ava
building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation
Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

regokan/deep-vision-lab
A comprehensive repository for research, code, and insights on convolutional neural networks and deep vision models
Language: Jupyter Notebook - Size: 33.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

afondiel/Prompt-Engineering-for-Vision-Models-DeepLearningAI
These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.
Language: Jupyter Notebook - Size: 103 MB - Last synced at: 13 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

The-Swarm-Corporation/swarm-models
A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.
Language: Python - Size: 2.53 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 4 - Forks: 1

kyegomez/VisionLLaMA
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
Language: Python - Size: 2.19 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 0

kyegomez/Midas
Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta
Language: Shell - Size: 2.16 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

ArashAkbarinia/DeepTHS
A framework to compute threshold sensitivity of deep networks to visual stimuli.
Language: Python - Size: 446 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

duynamrcv/vision_flocking
Vision-based swarms in the Presence of Occlusions
Language: Python - Size: 25.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

antonio-f/Moondream
Testing the Moondream tiny vision model
Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 27 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

Amr-Abdellatif/Fine-Tuninng-Pre-Trained-Vision-models-PyTorch
In This repo i FineTuned a Pretrained ResNet18 model from PyTorch library
Language: Jupyter Notebook - Size: 61 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

major196512/vistem
General Vision Model Training Template
Language: Python - Size: 502 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0
