An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: vision-models

busradeveci/Gemini-1.5-Vision-Tryout

AI-powered image & text demo using Gemini 1.5 Flash

Language: Jupyter Notebook - Size: 775 KB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 0 - Forks: 0

LMMMEng/OverLoCK

[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Language: Python - Size: 1.87 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 188 - Forks: 20

AstraZeneca/PerfCam

PoC Code for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models

Language: Jupyter Notebook - Size: 235 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 2

D2I-Group/awesome-vision-time-series

This is an official repository for "Harnessing Vision Models for Time Series Analysis: A Survey".

Language: Python - Size: 3.71 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 25 - Forks: 1

MDGrey33/pyvisionai

The PyVisionAI Official Repo

Language: Python - Size: 9.93 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 102 - Forks: 11

The-Swarm-Corporation/DART

DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid architecture that combines diffusion-based and autoregressive approaches for text generation.

Language: Python - Size: 46.9 KB - Last synced at: 19 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

The-Swarm-Corporation/swarm-models

A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and performance.

Language: Python - Size: 2.89 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 9

Pavansomisetty21/Image-Caption-Generation-using-LLMs-GEMINI-

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

Language: Jupyter Notebook - Size: 366 KB - Last synced at: 19 days ago - Pushed at: 9 months ago - Stars: 9 - Forks: 1

afondiel/awesome-smol

An awesome list of "small but mighty" models and resources.

Size: 142 KB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 2

afondiel/computer-vision-challenge

A hands-on collection of computer vision projects for everyone.

Language: Jupyter Notebook - Size: 151 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 40 - Forks: 8

AstraZeneca/PerfCam-Dataset

Dataset for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models

Language: Python - Size: 37.2 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

afondiel/how-diffusion-models-work-crash-course-DLAI

Diffusion Models crash course with Pytorch from DeepLearningAI

Language: Jupyter Notebook - Size: 6.93 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

ksm26/Prompt-Engineering-for-Vision-Models

Enhance your skills in prompt engineering for vision models. Learn to effectively prompt, fine-tune, and track experiments for models like SAM, OWL-ViT, and Stable Diffusion 2.0 to achieve precise image generation, segmentation, and object detection.

Language: Jupyter Notebook - Size: 22.3 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 2

shivendrra/ava

building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

regokan/deep-vision-lab

A comprehensive repository for research, code, and insights on convolutional neural networks and deep vision models

Language: Jupyter Notebook - Size: 33.1 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

afondiel/Prompt-Engineering-for-Vision-Models-DeepLearningAI

These notes and resources are compiled from the crash course Prompt Engineering for Vision Models offered by DeepLearning.AI.

Language: Jupyter Notebook - Size: 103 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

kyegomez/VisionLLaMA

Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta

Language: Python - Size: 2.19 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 0

kyegomez/Midas

Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta

Language: Shell - Size: 2.16 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

ArashAkbarinia/DeepTHS

A framework to compute threshold sensitivity of deep networks to visual stimuli.

Language: Python - Size: 446 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

duynamrcv/vision_flocking

Vision-based swarms in the Presence of Occlusions

Language: Python - Size: 25.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

antonio-f/Moondream

Testing the Moondream tiny vision model

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Amr-Abdellatif/Fine-Tuninng-Pre-Trained-Vision-models-PyTorch

In This repo i FineTuned a Pretrained ResNet18 model from PyTorch library

Language: Jupyter Notebook - Size: 61 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

major196512/vistem

General Vision Model Training Template

Language: Python - Size: 502 KB - Last synced at: 11 days ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

Related Keywords
vision-models 23 computer-vision 7 ai 4 vision 4 generative-ai 3 python 3 openai 3 fine-tuning 3 deep-learning 3 pytorch 3 llms 3 diffusion-models 3 cnn 3 artificial-intelligence 3 object-detection 3 machine-learning 2 ml 2 vit 2 image-descriptions 2 vlm 2 agents 2 vision-language-model 2 vision-transformers 2 multi-modal 2 image-captioning 2 prompt-engineering 2 vision-transformer 2 large-vision-models 2 yolo 2 gaussian-splatting 2 digital-twin 2 3d-reconstruction 2 image-processing 2 image-generation 2 text-generation 2 in-painting 1 owl-vit 1 sam 1 stable-diffusion 1 visual-workflows 1 cognitive-neuroscience 1 image-segmentation 1 hyperparameter-tuning 1 dreambooth 1 comet-library 1 unconditional-generation 1 latent-space 1 latent-diffusion 1 genai 1 conditional-generation 1 conditional-diffusion 1 lvm 1 image-detection 1 image-classification 1 cv-challenge 1 computer-vision-tools 1 deep-neural-networks 1 explainable-ai 1 human-machine-behavior 1 linear-classifier 1 linear-probing 1 sensitivity-analysis 1 behavior-control 1 python3 1 swarm-robotics 1 hands-on 1 huggingface-transformers 1 language-models 1 running-locally 1 tiny-models 1 tutorial 1 pretrained-models 1 audio-classification 1 audio-engine 1 audio-transformers 1 large-language-models 1 llm 1 swin-transformer 1 transformer 1 vision-engine 1 convnets 1 generative-models 1 large-vision-language-models 1 meta-sam 1 video-processing 1 vision-model-prompting 1 visual-prompting 1 parallel 1 tensorflow 1 chess 1 anthropic 1 attention 1 autogressive 1 diffusion 1 dit 1 gpts 1 midjourney 1 research 1 torch 1 transformers 1