An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: image-descriptions

baaivision/DenseFusion

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Language: Python - Size: 18.1 MB - Last synced at: 21 days ago - Pushed at: 5 months ago - Stars: 137 - Forks: 1

google/imageinwords

Data release for the ImageInWords (IIW) paper.

Language: JavaScript - Size: 21.4 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 209 - Forks: 9

DevExpress-Examples/office-file-api-ai-implementation

Integrate AI capabilities into a DevExpress-powered Office File API Web API application.

Language: C# - Size: 35.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 1

alterism/mastodon-alt-text

Experimenting with mastodon.social client alt-text usage dataset.

Language: HTML - Size: 9.85 MB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Pavansomisetty21/Image-Caption-Generation-using-LLMs-GEMINI-

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

Language: Jupyter Notebook - Size: 366 KB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 1

dhruvik-patel/image-description

This repo represents our machine learning project Image Description which is used to generate a description of an image based on activities and objects detected in the image.

Language: CSS - Size: 130 MB - Last synced at: 25 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 1

antonio-f/Moondream

Testing the Moondream tiny vision model

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 27 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

TatjanaChernenko/image_description_generation

NL Generation from structured inputs. Focuses on generating natural language descriptions for images by exploring the relationship between textual descriptions and image attributes. Leveraging an encoder-decoder architecture with LSTM cells, the system transforms normalized vector representations of attributes into fixed-length vector.

Language: Jupyter Notebook - Size: 182 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

meng1994412/CBIR

Content-Based Image Retrieval System

Language: Python - Size: 14.5 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 23 - Forks: 10

mariliafernandez/hilbert-curves-descriptor

Trabalho de Conclusão de Curso de Engenharia de Computação (UTFPR): Descritor de imagem baseado em curvas de Hilbert

Language: Jupyter Notebook - Size: 9.09 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

aviralchharia/Neural-Image-Captioning

In this project, we use a Deep Recurrent Architecture, which uses CNN (VGG-16 Net) pretrained on ImageNet to extract 4096-Dimensional image feature Vector and an LSTM which generates a caption from these feature vectors.

Language: Jupyter Notebook - Size: 4.5 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

caiocarneloz/lire-oasis

Lucene Image Retrieval (LIRe) code to extract Open Access Series of Imaging Studies (OASIS) features.

Language: Java - Size: 5.53 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ShivaliGoel/Paper-Explanations

Key Pointers/ Exhaustive Notes for various Machine Learning Research Papers

Size: 3.02 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

Related Keywords
image-descriptions 13 image-captioning 6 computer-vision 3 lstm 3 show-and-tell 2 neural-networks 2 machine-learning 2 image-processing 2 cnn 2 vision-models 2 image-descriptor 2 image 2 artificial-intelligence 2 data-science 2 accessibility 2 database 1 vision-language-models 1 natural-language-generation 1 lstm-neural-networks 1 encoder-decoder 1 vision-transformers 1 tutorial 1 tiny-models 1 running-locally 1 language-models 1 huggingface-transformers 1 hands-on 1 tflite-models 1 tensorflow 1 python 1 visual-perception 1 vlm 1 dataset 1 flask 1 university 1 research-paper-explanation 1 research-paper 1 karpathy 1 deep-visual-semantic-alignments 1 deep-neural-networks 1 deep-learning-tutorial 1 deep-learning-papers 1 lire 1 feature-extraction 1 alzheimers-disease 1 vgg16 1 mllm 1 multimodal-large-language-models 1 natural-language-processing 1 flickr8k-dataset 1 feature-vectors 1 bleu-score 1 descriptors 1 keypoints-detector 1 information-retrieval 1 mastodon-social 1 mastodon 1 image-description 1 fediverse 1 datascience 1 i2t 1 alttext 1 alt-text 1 aiss-master 1 a11y 1 word-processing 1 web-api 1 spreadsheet-document-api 1 office-file-api 1 devexpress 1 ai 1 image-text 1 t2i 1 image-to-text 1 dataset-generation 1 visual-models 1 detailed-annotations 1 vision-language-model 1 vision 1 openai 1 multi-model-learning 1 detailed-descriptions 1 image-caption-generator 1 evaluation 1 generativemodel 1 generative-ai 1 generate-contents 1 gen-ai 1 gemini-api 1 gemini 1 encoder-decoder-architecture 1 description 1 human-annotation 1 university-project 1