An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: vision-models

LMMMEng/OverLoCK

[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Language: Python - Size: 1.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 120 - Forks: 11

afondiel/awesome-smol

An awesome list of "small but mighty" models and resources.

Size: 142 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

afondiel/computer-vision-challenge

A hands-on collection of computer vision projects for everyone.

Language: Jupyter Notebook - Size: 151 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 40 - Forks: 8

MDGrey33/pyvisionai

The PyVisionAI Official Repo

Language: Python - Size: 9.93 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 98 - Forks: 10

D2I-Group/awesome-vision-time-series

This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".

Language: Python - Size: 3.76 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 1

Pavansomisetty21/Image-Caption-Generation-using-LLMs-GEMINI-

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

Language: Jupyter Notebook - Size: 366 KB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 1

AstraZeneca/PerfCam

PoC Code for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models

Language: Jupyter Notebook - Size: 189 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

AstraZeneca/PerfCam-Dataset

Dataset for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models

Language: Python - Size: 37.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

afondiel/how-diffusion-models-work-crash-course-DLAI

Diffusion Models crash course with Pytorch from DeepLearningAI

Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ksm26/Prompt-Engineering-for-Vision-Models

Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.

Language: Jupyter Notebook - Size: 22.3 MB - Last synced at: 29 days ago - Pushed at: 12 months ago - Stars: 6 - Forks: 2

shivendrra/ava

building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

regokan/deep-vision-lab

A comprehensive repository for research, code, and insights on convolutional neural networks and deep vision models

Language: Jupyter Notebook - Size: 33.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

afondiel/Prompt-Engineering-for-Vision-Models-DeepLearningAI

These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.

Language: Jupyter Notebook - Size: 103 MB - Last synced at: 13 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

The-Swarm-Corporation/swarm-models

A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.

Language: Python - Size: 2.53 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 4 - Forks: 1

kyegomez/VisionLLaMA

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

Language: Python - Size: 2.19 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 0

kyegomez/Midas

Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta

Language: Shell - Size: 2.16 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

ArashAkbarinia/DeepTHS

A framework to compute threshold sensitivity of deep networks to visual stimuli.

Language: Python - Size: 446 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

duynamrcv/vision_flocking

Vision-based swarms in the Presence of Occlusions

Language: Python - Size: 25.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

antonio-f/Moondream

Testing the Moondream tiny vision model

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 27 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

Amr-Abdellatif/Fine-Tuninng-Pre-Trained-Vision-models-PyTorch

In This repo i FineTuned a Pretrained ResNet18 model from PyTorch library

Language: Jupyter Notebook - Size: 61 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

major196512/vistem

General Vision Model Training Template

Language: Python - Size: 502 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

Related Keywords
vision-models 21 computer-vision 7 vision 4 ai 4 fine-tuning 3 cnn 3 pytorch 3 diffusion-models 3 artificial-intelligence 3 deep-learning 3 generative-ai 3 object-detection 3 image-generation 2 ml 2 multi-modal 2 vision-transformers 2 image-captioning 2 machine-learning 2 image-processing 2 vision-transformer 2 vlm 2 vit 2 openai 2 python 2 large-vision-models 2 yolo 2 gaussian-splatting 2 digital-twin 2 3d-reconstruction 2 llms 2 vision-language-model 2 prompt-engineering 2 image-descriptions 2 image-segmentation 1 convnets 1 vision-engine 1 hyperparameter-tuning 1 in-painting 1 dreambooth 1 transformer 1 lvm 1 swin-transformer 1 llm 1 large-language-models 1 audio-transformers 1 audio-engine 1 audio-classification 1 visual-workflows 1 stable-diffusion 1 owl-vit 1 sam 1 pretrained-models 1 tutorial 1 tiny-models 1 running-locally 1 language-models 1 huggingface-transformers 1 hands-on 1 swarm-robotics 1 python3 1 behavior-control 1 sensitivity-analysis 1 linear-probing 1 linear-classifier 1 human-machine-behavior 1 explainable-ai 1 deep-neural-networks 1 cognitive-neuroscience 1 tensorflow 1 parallel 1 usage 1 tool 1 swarms 1 production-ready 1 library 1 enterprise-grade 1 agents 1 visual-prompting 1 vision-model-prompting 1 video-processing 1 meta-sam 1 large-vision-language-models 1 generative-models 1 image-detection 1 image-classification 1 cv-challenge 1 computer-vision-tools 1 computer-vision-python 1 computer-vision-projects 1 computer-vision-opencv 1 computer-vision-hello-world 1 computer-vision-datasets 1 computer-vision-challenge 1 computer-vision-algorithms 1 smol-models 1 smls 1 on-device-ai 1 multimodality 1 lightweight-models 1 embedded-ai 1