An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: image-caption

alpertunga-bile/image-caption-comfyui

Using image caption models to extract prompts in ComfyUI

Language: Python - Size: 9.29 MB - Last synced at: about 24 hours ago - Pushed at: 1 day ago - Stars: 8 - Forks: 2

jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

Language: Python - Size: 873 KB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 117 - Forks: 32

SemanticMediaWiki/SemanticImageCaption

Allows to generate image caption information from annotations

Language: PHP - Size: 63.5 KB - Last synced at: about 16 hours ago - Pushed at: 20 days ago - Stars: 7 - Forks: 2

Vision-CAIR/VisualGPT

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

Language: Python - Size: 6.18 MB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 328 - Forks: 53

fireicewolf/wd-llm-caption-cli

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

Language: Python - Size: 1.92 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 34 - Forks: 8

krishnakumarbhat/Visionsarathi

This is an innovative project aimed at enhancing the visual experience for individuals with impairments. Leveraging machine learning and natural language processing, this repository houses the codebase for generating efficient and coherent natural language descriptions of captured images. The project integrates seamlessly with image recognition,

Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 2

haoyu-he/ImageCaption

Image captioning project.

Language: Python - Size: 4.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

jianjieluo/SCD-Net

[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.

Language: Python - Size: 407 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 46 - Forks: 3

sudhakaranjain/marshall

Marshall: Modality-Agnostic Representation learning by SHAred pre-training of muLtiple modaLities

Language: Python - Size: 10.2 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Aldenhovel/ConceptualCaptions-940k

A subset of Google's ConceptualCaptions(3M) dataset which include 940k samples.

Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mostafax/Image-Caption

End to End Deep learning model that generate image captions

Language: Python - Size: 12.5 MB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 4

bhushan2311/image_caption_generator

An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic captions. Authenticated users have access to extra features like translating captions and text-to-speech functionality.

Language: JavaScript - Size: 191 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

Allenpandas/BLIP-ImageCaption

BLIP-ImageCaption

Language: Python - Size: 1.81 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Junjue-Wang/CapFormer

[IGARSS 2022] CapFormer: Pure transformer for remote sensing image caption

Size: 1.25 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 0

purveshpatel511/imageCaptioning

pre-trained model and source code for generate description of images.

Language: Python - Size: 636 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 25 - Forks: 9

women-in-ai-ireland/August-2020-WaiLEARN-Image-Caption-Generation

Image Caption Generation using Keras' Pre-Trained Image Feature Extraction models and LSTM

Language: Jupyter Notebook - Size: 243 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 3

Allenpandas/BLIP-ImageCaptioning Fork of salesforce/BLIP

Folk BLIP ImageCaptioning from salesforce

Language: Python - Size: 8.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dabasajay/Image-Caption-Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Language: Python - Size: 2.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 247 - Forks: 76

ashishyadav2/SeptaSEM

Major Project Repository

Language: Jupyter Notebook - Size: 1.15 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

anshdavid/pytorch-image-caption

Image caption using VGG16 + LSTM

Language: Python - Size: 2.02 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

linjianz/tf-image-caption

image caption for AI challenger

Language: Jupyter Notebook - Size: 17.8 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 1

NicholasKX/ShowAttendTell

A Mindspore Implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention".

Language: Python - Size: 5.74 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ludwig7685/image-caption-generation-with-ai-and-api

Language: Python - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

CAOANJIA/image-caption

PyTorch implementation of image captioning based on attention mechanism

Language: Python - Size: 66.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

rodrigo-barraza/inscriptor

Blip 2 Captioning, Mass Captioning, Question Answering, and other tools.

Language: Jupyter Notebook - Size: 491 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

VAIBHAV-2303/ImageCaptionRetrieval

Pytorch Image-caption retrieval model

Language: Python - Size: 14.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

Hsuxu/Paper-Notes

Paper notes in deep learning/machine learning and computer vision

Size: 6.69 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 40 - Forks: 13

kenya-sk/show_attend_and_tell

This repository reimplements "Show, Attend and Tell" model and add extra deep learning techniques.

Language: Python - Size: 19.2 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

hongkiat/css3-image-captions

Say good bye to jQuery plugins. Today, we can create similar image caption effect only with CSS3. This demo shows how this effects runs.

Language: CSS - Size: 189 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 4

zyj0021200/simpleImageCaptionZoo

Simple but Comprehensive PyTorch Implementation of Image Captioning Models.

Language: OpenEdge ABL - Size: 122 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 1

maxy0524/image_captioning Fork of DeepRNN/image_captioning

Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention" Support python3.6, python3.7 TensorFlow1.8 TensorFlow1.12 TensorFlow1.13 TensorFlow1.14 numpy 1.12 or newer

Language: Python - Size: 74 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 32 - Forks: 9

zlsh80826/image-caption-tf

Image Caption

Language: Python - Size: 17.1 MB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 1

abideenml/Image-Captioning-with-MobileNet-and-LSTM

Image Captioning with Visual Attention

Language: Jupyter Notebook - Size: 682 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lychengrex/Image-Descriptor

Image Descriptor with Visual Attention Mechanism Using Long Short-term Memory

Language: Jupyter Notebook - Size: 2.26 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

jkooy/image-caption

Image caption using soft-attention

Language: Jupyter Notebook - Size: 294 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

parmarsuraj99/keras-transformer-flex

Transformer block in tf.keras similar to PyTorch's nn.Transformer block.

Language: Jupyter Notebook - Size: 963 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 1

WuLC/ImageCaption

Image Captioning with Google‘s NIC For AI Challenger

Language: Python - Size: 74.8 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 5

pedrorio/image_caption_augmentation

A text generation library to paraphrase image captions using back translations or transfer learning.

Language: Python - Size: 10.9 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

aqweteddy/ImageCaptioning

CCU Computer Vision final project

Language: Python - Size: 42 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

wasifferoze/image-caption

Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Related Keywords
image-caption 40 image-captioning 17 image-caption-generator 9 deep-learning 8 pytorch 8 tensorflow 7 lstm 5 vgg16 4 keras 4 python 4 blip 3 image-caption-generation 3 computer-vision 3 object-detection 2 cnn-keras 2 python3 2 cnn 2 multimodal 2 image-to-text 2 soft-attention 2 pytorch-implementation 2 lstm-neural-networks 2 image-captions 2 imagecaptioning 2 transformer 2 neural-network 2 attention 2 attention-mechanism 2 image-recognition 2 beam-search 2 convolutional-neural-networks 2 flickr-dataset 2 nlp 2 machine-learning 2 dataset-creation 1 dataset-analysis 1 detr 1 virtex 1 back-translation 1 encoder-decoder 1 website 1 image 1 hcaptcha-solver 1 hcaptcha 1 google-translate 1 api 1 ai 1 mindspore 1 tenforflow 1 resnet 1 object-recognition 1 paraphrase-generation 1 pytorch-lightning 1 t5 1 text-generation 1 embeddings 1 recurrent-neural-networks 1 inceptionv3 1 inception-v3 1 transformers 1 mscoco-dataset 1 self-critical-sequence-training 1 image-show-attend-tell 1 coco-datasets 1 css3 1 numpy-1-17-2 1 opencv-python-4-1-1-16 1 python-3-7-4 1 mscoco-image-dataset 1 tensorflow-1-14 1 semantic-segmentation 1 cs565600 1 kaggle-competition 1 research 1 mscoco 1 papers 1 papanoptic-segmentation 1 visual-attention 1 cnn-lstm 1 medical-image-computing 1 instance-segmentation 1 image-classification 1 coco 1 generative-adversarial-network 1 gan 1 vist 1 coco-dataset 1 dataset-generator 1 image-descriptor 1 dataset-generation 1 flickr-8k 1 nuralnetwork 1 image-classifier 1 text-to-image 1 dataset 1 representation-learning 1 pretraining 1 multi-modal 1 diffusion-model 1 ml 1