GitHub topics: vision-models
busradeveci/Gemini-1.5-Vision-Tryout
AI-powered image & text demo using Gemini 1.5 Flash
Language: Jupyter Notebook - Size: 775 KB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 0 - Forks: 0

LMMMEng/OverLoCK
[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Language: Python - Size: 1.87 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 188 - Forks: 20

AstraZeneca/PerfCam
PoC Code for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
Language: Jupyter Notebook - Size: 235 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 2

D2I-Group/awesome-vision-time-series
This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".
Language: Python - Size: 3.71 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 25 - Forks: 1

MDGrey33/pyvisionai
The PyVisionAI Official Repo
Language: Python - Size: 9.93 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 102 - Forks: 11

The-Swarm-Corporation/DART
DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid architecture that combines diffusion-based and autoregressive approaches for text generation.
Language: Python - Size: 46.9 KB - Last synced at: 19 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

The-Swarm-Corporation/swarm-models
A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.
Language: Python - Size: 2.89 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 9

Pavansomisetty21/Image-Caption-Generation-using-LLMs-GEMINI-
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
Language: Jupyter Notebook - Size: 366 KB - Last synced at: 19 days ago - Pushed at: 9 months ago - Stars: 9 - Forks: 1

afondiel/awesome-smol
An awesome list of "small but mighty" models and resources.
Size: 142 KB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 2

afondiel/computer-vision-challenge
A hands-on collection of computer vision projects for everyone.
Language: Jupyter Notebook - Size: 151 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 40 - Forks: 8

AstraZeneca/PerfCam-Dataset
Dataset for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
Language: Python - Size: 37.2 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

afondiel/how-diffusion-models-work-crash-course-DLAI
Diffusion Models crash course with Pytorch from DeepLearningAI
Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

ksm26/Prompt-Engineering-for-Vision-Models
Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.
Language: Jupyter Notebook - Size: 22.3 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 2

shivendrra/ava
building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation
Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

regokan/deep-vision-lab
A comprehensive repository for research, code, and insights on convolutional neural networks and deep vision models
Language: Jupyter Notebook - Size: 33.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

afondiel/Prompt-Engineering-for-Vision-Models-DeepLearningAI
These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.
Language: Jupyter Notebook - Size: 103 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

kyegomez/VisionLLaMA
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
Language: Python - Size: 2.19 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 0

kyegomez/Midas
Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta
Language: Shell - Size: 2.16 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

ArashAkbarinia/DeepTHS
A framework to compute threshold sensitivity of deep networks to visual stimuli.
Language: Python - Size: 446 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

duynamrcv/vision_flocking
Vision-based swarms in the Presence of Occlusions
Language: Python - Size: 25.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

antonio-f/Moondream
Testing the Moondream tiny vision model
Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Amr-Abdellatif/Fine-Tuninng-Pre-Trained-Vision-models-PyTorch
In This repo i FineTuned a Pretrained ResNet18 model from PyTorch library
Language: Jupyter Notebook - Size: 61 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

major196512/vistem
General Vision Model Training Template
Language: Python - Size: 502 KB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0
