GitHub topics: caption-generation
aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Language: Python - Size: 7.07 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 538 - Forks: 134

photoprism/photoprism-vision
Computer Vision Models for PhotoPrism®
Language: Python - Size: 135 KB - Last synced at: about 16 hours ago - Pushed at: 24 days ago - Stars: 33 - Forks: 7

Vinventive/live-captions-vr
Accessibility-focused SteamVR Overlay improving communication between deaf, hard-of-hearing, and hearing users in VR. It is leveraging AI allowing users to see real-time speech transcription in their 3D space. DISCLAIMER: Voice recognition technology is prone to errors and project should not be used as a replacement for medical hearing aid.
Language: Python - Size: 131 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 4 - Forks: 0

khushalimakani/image-captioning
Captionify: Describing Images with AI An AI-powered image captioning system that uses CNNs and LSTMs to generate human-like captions for images. Trained on the Flickr8k dataset and evaluated with BLEU scores, it bridges computer vision and natural language processing for real-world applications like accessibility, social media, and e-commerce.
Language: Python - Size: 42 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

aimagelab/DiCO
[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
Language: Python - Size: 6.76 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 18 - Forks: 0

aimagelab/show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Language: Python - Size: 1.71 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 283 - Forks: 61

oshtz/tagmeister-pc
Efficient image captioning using OpenAI API
Language: TypeScript - Size: 14.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 6 - Forks: 0

Aavtic/thamburaan
auto-caption program for generation word by word captioning on a green-screen video
Language: Rust - Size: 43 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

AlphaV2/AutoImageCaptioning
AutoImageCaptioning🖼️ – A lightweight image captioning system using the **BLIP model**, designed for efficiency and minimal computation. Automatically generate meaningful captions for images with ease! 🚀
Language: Jupyter Notebook - Size: 40 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

NormVg/AutoCaptionGenAI
A Python project that extracts audio from video files, transcribes the speech, translates it into a target language, and generates SRT subtitles.
Language: Python - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

OpenShapeLab/ShapeGPT
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model
Size: 1.2 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 95 - Forks: 1

trucaption/trucaption
A real-time captioning system with support for large and small screen display.
Language: JavaScript - Size: 2.68 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

rahulsonone1234/Traffic-Sign-Recognition
To ease the driver to identify the Traffic Signs and also for the efficient working of Self-Driving Cars.
Language: Python - Size: 3.21 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 7

Iteranya/Captioner
Image To Text with Florence 2
Language: Python - Size: 9.77 KB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

abachaa/3D-MIR
3D Medical Image Retrieval in Radiology
Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 1

ailimcgregor/subtext
TreeHacks 2024 SubText project
Language: TypeScript - Size: 67.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

shunk031/huggingface-datasets_MSCOCO
Microsoft COCO: Common Objects in Context for huggingface datasets
Language: Python - Size: 176 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

Arunesh-Tiwari/ClipCaption
Allows users to upload videos, extract subtitles using ffmpeg, and search within the video for specific phrases, providing an easy-to-use platform for subtitle management and video search functionality.
Language: Python - Size: 31.3 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

debrajhyper/videocap
The Video Caption App is a simple web application that allows users to add captions to a video at specific timestamps. This app ensures that captions are displayed synchronously with the video playback. Users can input video URLs, add captions with their corresponding timestamps, and view the video with captions overlaid on it.
Language: TypeScript - Size: 1.37 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ch3cook-fdu/Vote2Cap-DETR
[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
Language: Python - Size: 308 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 82 - Forks: 5

Nexdata-AI/100-Hours-Indonesian-Children-Spontaneous-Speech-Data
Indonesian Child's Spontaneous Speech Data
Size: 1.23 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Nexdata-AI/101-Hours-Italian-Children-Spontaneous-Speech-Data
Italian Child's Spontaneous Speech Data
Size: 2.52 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Nexdata-AI/100-Hours-Thai-Children-Spontaneous-Speech-Data
Thai Child's Spontaneous Speech Data
Size: 652 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

RisshiN24/Image-Captioning-App Fork of petermartens98/SceneXplain-LangChain-Image-Captioning-App
Caption generator that allows users to upload or provide links to images. Partly powered by GPT-4o.
Size: 1.22 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ababiyaworku/GPT4V_Captioner
A simple & powerful GPT4V- Image captioner for images. Single or Batch process multiple images in a directory where you run the script.
Language: Python - Size: 101 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Imiloin/Capoom
A real-time subtitle generator, based on whisper.
Language: Python - Size: 1.5 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

sabirdvd/BLIP_image_caption_demo
BLIP image caption demo - medium post blog
Language: Jupyter Notebook - Size: 2.28 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 3

NaimLehbiben/NLP
NLP project from Paris-Dauphine University lecture. This project is aimed to predict Dataset captions using the "show and tell" approach developped by Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan.
Language: Jupyter Notebook - Size: 14.8 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ashishyadav2/Image-Captioning
Language: Jupyter Notebook - Size: 66.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

qowngus33/q-align-caption
q-align model with caption capacity
Language: Python - Size: 49 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

seanwongg/python-scraping-projects
for web scraping & AI caption generation with Python
Language: Jupyter Notebook - Size: 76.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Neloy-Barman/Artwork-Description-Generator
Language: Jupyter Notebook - Size: 6.77 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

adarshanand67/SegCapNet
SegCapNet
Language: Python - Size: 310 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Kiwinicki/img-caption-py
Small image captioning program with automatic caption generation option
Language: Jupyter Notebook - Size: 83 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

daveredrum/D3Net
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Language: Python - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 36 - Forks: 5

LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model
Generate caption on images using CNN Encoder- LSTM Decoder structure
Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

jeyprabu/Image-Decoding-and-Captioning
A web application developed to generate captions from images. It can also detect edges and corners of an image. Furthermore, it can perform comparative anomaly detection.
Language: HTML - Size: 45.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

daveredrum/Scan2Cap
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Language: Python - Size: 108 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 94 - Forks: 16

imanom/Generating-Subtitles
Generates subtitles from a video/audio file. Developed in Python and uses Google Cloud APIs.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

lachhabw/Image-Captioning-Extension-for-LM-Studio
LM Studio extension for automatic image captioning.
Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

leeyunjai/image2text
caption generator using lavis and argostranslate
Language: Python - Size: 128 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 1

jayant1211/A-Multi-Modal-Approach-to-Improve-Scene-Context
This GitHub repository focuses on an integrated approach to scene classification and image caption generation, aiming to improve the accuracy of scene evaluation in computer vision applications.
Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ihaeyong/drama-graph
Drama-Graph repository produces both knowledge base on drama scripts and video graph for Video Turing Test (VTT).
Language: Jupyter Notebook - Size: 201 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

Anshler/ICG_sd_extension
Image caption extension for A1111 Webui 👁️📜🖋️
Language: Python - Size: 181 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

yash-sarwaswa/Image-Caption-Generator
Fabricating a Python application that generates a caption for a selected image. Involves the use of Deep Learning and NLP Frameworks in Tensorflow, Keras and NLTK modules for data processing and creation of deep learning models and their evaluation.
Language: Jupyter Notebook - Size: 110 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

ghostofpokemon/oCaption
oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning
Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dabasajay/Image-Caption-Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Language: Python - Size: 2.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 247 - Forks: 76

chenxinpeng/ARNet
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Language: Python - Size: 190 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 95 - Forks: 22

aimagelab/speaksee
PyTorch library for Visual-Semantic tasks
Language: Python - Size: 68.7 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 28 - Forks: 8

devinzhang415/captioner
Image caption generator Chrome extension for WaffleHacks 2023
Language: JavaScript - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

devinzhang415/caption-gen
Deep Learning Image Caption Generator
Language: Jupyter Notebook - Size: 1.78 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Merterm/COSMic
Public repo for the paper: "COSMic: A Coherence-Aware Generation Metric for Image Descriptions" by Mert İnan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone, Malihe Alikhani
Language: Python - Size: 396 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

apivideo/caption.new
Sample app to add captions to an uploaded video. From api.video (https://api.video)
Language: JavaScript - Size: 692 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

naumanthunder22/mscs-thesis
Thesis Topic: Transfer Learning Based Food Item Recognition and Estimation of an Attributes.
Size: 4.44 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dibyansu24-maker/Neural-Image-Caption-Generator
Automatically describing the content of an image fundamental problem in artificial intelligence that connects computer vision and natural language processing.
Language: Jupyter Notebook - Size: 1.21 GB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 3

todd-gavin/DSCI550-PixstoryMediaExtractionAndAnalysis
Extraction analysis of PixStory Social Media Dataset using language detection, language translation, tike geotopic parser, tika image object recognition/image caption generation, and PyTorch detoxify.
Language: Jupyter Notebook - Size: 349 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

frederikgramkortegaard/describe
Content-based Deep Image-Search for Conversational Language
Language: Python - Size: 51.7 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

TanyaChutani/Image-Captioning-Generator
Image Captioning Generator Keras
Language: Jupyter Notebook - Size: 156 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

nrb2310/AI_ML-Caption-Generator
The Code allows users to upload an image and generates captions for the uploaded image using the ViT-GPT2 vision encoder-decoder model. It provides an easy-to-use interface for caption generation
Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

pritishmishra703/Image-Captioning
This project uses transformer for generating captions for images.
Language: Python - Size: 229 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 6

prabhat-ranjan50/Cricket-Caption-Generator-using-CNN_RNN-model
Our model generates descriptive captions for the given input image related to cricket domain.
Language: Jupyter Notebook - Size: 434 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

juletx/image-caption-generation
Automatic Image Caption Generation model that uses a CNN to condition a LSTM based language model
Language: Jupyter Notebook - Size: 1.05 GB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

heng-hw/SpaCap3D
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
Language: Python - Size: 91 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 15 - Forks: 6

damminhtien/deep-learning-image-caption-generator
Deep CNN-LSTM for Generating Image Descriptions :smiling_imp:
Language: Jupyter Notebook - Size: 3.32 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 27 - Forks: 7

sabirdvd/DivStats_Caption
DivStats Div 1 and Div 2 and mBLEU for caption diversity evaluation
Language: Python - Size: 30.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

tanishqgautam/Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformers
Language: Jupyter Notebook - Size: 2.23 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 35 - Forks: 18

ApoorvGit/god-s-eye
Aid for blinds. This AI will describe the surrounding, it will tell who is in front of him (if that person is a known person to AI using Facial Recognition) and it will also help him to know what is written (Optical Character Recognition)
Language: Python - Size: 37.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 6

nalbert9/Image-Captioning
Computer Vision: Generate captions that describe the contents of images using PyTorch
Language: Jupyter Notebook - Size: 129 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 21 - Forks: 6

connorguy/MemeToAlt
Self contained alt text generator for memes
Language: Python - Size: 40.9 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

akshat-khosya/video-text-search
Can we fetch time stamp of any sentence spoken in a particular video ? If so how? We can do it by fetching subtitle by cc extractor from the video and applying search algorithm. Let's see.
Language: TypeScript - Size: 55.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MADHAVAN001/image-captioning-approaches
A modular repository for developing Image Captioning Approaches
Language: Jupyter Notebook - Size: 53 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning
A deep learning model that generates descriptions of an image.
Language: Jupyter Notebook - Size: 49.4 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

amdnsr/machinelearningapps
Flask frontend for ML-Apps, a collection of text-summarization, caption-generation and cartoonization services.
Language: Python - Size: 86.2 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

amdnsr/mlmodelsapi
FastAPI backend for ML-Apps, a collection of text-summarization, caption-generation and cartoonization services.
Language: Python - Size: 85.1 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

akarsh-saxena/Image-Caption-Generator
A deep learning based image caption generator.
Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

mahendranandi/Image_Captioning
Image captioning using ResNet50 and LSTM in keras library. An application of both CV (Computer Vision) and NLP(Natural Language Processing) concepts.
Language: Jupyter Notebook - Size: 27.6 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

AbhinavS99/Image-Caption-Generation-using-Attention-Networks
Neural Image Caption Generation Using Bahadanu Attention
Language: Jupyter Notebook - Size: 1.92 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

rockleona/fotocaptioner
FotoCaptioner is a software that can easily generate captions for social media (e.g. Instagram, Facebook, Twitter), also this app also can read EXIF metadata from pictures (if avaliable).
Language: HTML - Size: 7.96 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

abdullahzia510/Effecient-Urdu-Caption-Generation-using-Attention-Mechanism
This repository contains code and results for the Course Project by Deep Learning Spring 2020 course offered at Information Technology University, Lahore, Pakistan. This repository is only for learning purposes and is not intended to be used for commercial purposes.
Language: Jupyter Notebook - Size: 4.37 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 3

SaiHari-N/Image-Caption-Generator-Using-CNN
Image caption generator is a task that involves computer vision and natural language processing concepts to recognize the context of an image and describe them in a natural language like English.
Language: Python - Size: 143 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

scionoftech/image_caption_generation
Image caption generation using Deep Learning-LSTM
Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: about 9 hours ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Lucasfrota/imdex
Imdex is a library that allows semantic searches over images sets
Language: Python - Size: 320 MB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

SilasKati/Image-Caption-Generator
An Image Caption Generator which generates a caption describing the given image.
Language: Python - Size: 216 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

VSJMilewski/disentangled-caption-generator
The code with an attempt on the disentanglement for the decoder part of a caption generator.
Language: Python - Size: 135 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

leesharma/WhatDoesItMean
A native caption generation application using the Show and Tell model
Language: Java - Size: 2.65 MB - Last synced at: 22 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
