An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: caption-generation

aimagelab/meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Language: Python - Size: 7.07 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 538 - Forks: 134

photoprism/photoprism-vision

Computer Vision Models for PhotoPrism®

Language: Python - Size: 135 KB - Last synced at: about 16 hours ago - Pushed at: 24 days ago - Stars: 33 - Forks: 7

Vinventive/live-captions-vr

Accessibility-focused SteamVR Overlay improving communication between deaf, hard-of-hearing, and hearing users in VR. It is leveraging AI allowing users to see real-time speech transcription in their 3D space. DISCLAIMER: Voice recognition technology is prone to errors and project should not be used as a replacement for medical hearing aid.

Language: Python - Size: 131 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 4 - Forks: 0

khushalimakani/image-captioning

Captionify: Describing Images with AI An AI-powered image captioning system that uses CNNs and LSTMs to generate human-like captions for images. Trained on the Flickr8k dataset and evaluated with BLEU scores, it bridges computer vision and natural language processing for real-world applications like accessibility, social media, and e-commerce.

Language: Python - Size: 42 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

aimagelab/DiCO

[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization

Language: Python - Size: 6.76 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 18 - Forks: 0

aimagelab/show-control-and-tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

Language: Python - Size: 1.71 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 283 - Forks: 61

oshtz/tagmeister-pc

Efficient image captioning using OpenAI API

Language: TypeScript - Size: 14.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 6 - Forks: 0

Aavtic/thamburaan

auto-caption program for generation word by word captioning on a green-screen video

Language: Rust - Size: 43 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

AlphaV2/AutoImageCaptioning

AutoImageCaptioning🖼️ – A lightweight image captioning system using the **BLIP model**, designed for efficiency and minimal computation. Automatically generate meaningful captions for images with ease! 🚀

Language: Jupyter Notebook - Size: 40 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

NormVg/AutoCaptionGenAI

A Python project that extracts audio from video files, transcribes the speech, translates it into a target language, and generates SRT subtitles.

Language: Python - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

OpenShapeLab/ShapeGPT

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model

Size: 1.2 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 95 - Forks: 1

trucaption/trucaption

A real-time captioning system with support for large and small screen display.

Language: JavaScript - Size: 2.68 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

rahulsonone1234/Traffic-Sign-Recognition

To ease the driver to identify the Traffic Signs and also for the efficient working of Self-Driving Cars.

Language: Python - Size: 3.21 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 7

Iteranya/Captioner

Image To Text with Florence 2

Language: Python - Size: 9.77 KB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

abachaa/3D-MIR

3D Medical Image Retrieval in Radiology

Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 1

ailimcgregor/subtext

TreeHacks 2024 SubText project

Language: TypeScript - Size: 67.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

shunk031/huggingface-datasets_MSCOCO

Microsoft COCO: Common Objects in Context for huggingface datasets

Language: Python - Size: 176 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

Arunesh-Tiwari/ClipCaption

Allows users to upload videos, extract subtitles using ffmpeg, and search within the video for specific phrases, providing an easy-to-use platform for subtitle management and video search functionality.

Language: Python - Size: 31.3 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

debrajhyper/videocap

The Video Caption App is a simple web application that allows users to add captions to a video at specific timestamps. This app ensures that captions are displayed synchronously with the video playback. Users can input video URLs, add captions with their corresponding timestamps, and view the video with captions overlaid on it.

Language: TypeScript - Size: 1.37 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ch3cook-fdu/Vote2Cap-DETR

[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods

Language: Python - Size: 308 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 82 - Forks: 5

Nexdata-AI/100-Hours-Indonesian-Children-Spontaneous-Speech-Data

Indonesian Child's Spontaneous Speech Data

Size: 1.23 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Nexdata-AI/101-Hours-Italian-Children-Spontaneous-Speech-Data

Italian Child's Spontaneous Speech Data

Size: 2.52 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Nexdata-AI/100-Hours-Thai-Children-Spontaneous-Speech-Data

Thai Child's Spontaneous Speech Data

Size: 652 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

RisshiN24/Image-Captioning-App Fork of petermartens98/SceneXplain-LangChain-Image-Captioning-App

Caption generator that allows users to upload or provide links to images. Partly powered by GPT-4o.

Size: 1.22 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ababiyaworku/GPT4V_Captioner

A simple & powerful GPT4V- Image captioner for images. Single or Batch process multiple images in a directory where you run the script.

Language: Python - Size: 101 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Imiloin/Capoom

A real-time subtitle generator, based on whisper.

Language: Python - Size: 1.5 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

sabirdvd/BLIP_image_caption_demo

BLIP image caption demo - medium post blog

Language: Jupyter Notebook - Size: 2.28 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 3

NaimLehbiben/NLP

NLP project from Paris-Dauphine University lecture. This project is aimed to predict Dataset captions using the "show and tell" approach developped by Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan.

Language: Jupyter Notebook - Size: 14.8 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ashishyadav2/Image-Captioning

Language: Jupyter Notebook - Size: 66.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

qowngus33/q-align-caption

q-align model with caption capacity

Language: Python - Size: 49 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

seanwongg/python-scraping-projects

for web scraping & AI caption generation with Python

Language: Jupyter Notebook - Size: 76.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Neloy-Barman/Artwork-Description-Generator

Language: Jupyter Notebook - Size: 6.77 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

adarshanand67/SegCapNet

SegCapNet

Language: Python - Size: 310 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Kiwinicki/img-caption-py

Small image captioning program with automatic caption generation option

Language: Jupyter Notebook - Size: 83 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

daveredrum/D3Net

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Language: Python - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 36 - Forks: 5

LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model

Generate caption on images using CNN Encoder- LSTM Decoder structure

Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

jeyprabu/Image-Decoding-and-Captioning

A web application developed to generate captions from images. It can also detect edges and corners of an image. Furthermore, it can perform comparative anomaly detection.

Language: HTML - Size: 45.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

daveredrum/Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Language: Python - Size: 108 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 94 - Forks: 16

imanom/Generating-Subtitles

Generates subtitles from a video/audio file. Developed in Python and uses Google Cloud APIs.

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

lachhabw/Image-Captioning-Extension-for-LM-Studio

LM Studio extension for automatic image captioning.

Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

leeyunjai/image2text

caption generator using lavis and argostranslate

Language: Python - Size: 128 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 1

jayant1211/A-Multi-Modal-Approach-to-Improve-Scene-Context

This GitHub repository focuses on an integrated approach to scene classification and image caption generation, aiming to improve the accuracy of scene evaluation in computer vision applications.

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ihaeyong/drama-graph

Drama-Graph repository produces both knowledge base on drama scripts and video graph for Video Turing Test (VTT).

Language: Jupyter Notebook - Size: 201 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

Anshler/ICG_sd_extension

Image caption extension for A1111 Webui 👁️📜🖋️

Language: Python - Size: 181 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

yash-sarwaswa/Image-Caption-Generator

Fabricating a Python application that generates a caption for a selected image. Involves the use of Deep Learning and NLP Frameworks in Tensorflow, Keras and NLTK modules for data processing and creation of deep learning models and their evaluation.

Language: Jupyter Notebook - Size: 110 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

ghostofpokemon/oCaption

oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning

Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dabasajay/Image-Caption-Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Language: Python - Size: 2.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 247 - Forks: 76

chenxinpeng/ARNet

CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

Language: Python - Size: 190 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 95 - Forks: 22

aimagelab/speaksee

PyTorch library for Visual-Semantic tasks

Language: Python - Size: 68.7 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 28 - Forks: 8

devinzhang415/captioner

Image caption generator Chrome extension for WaffleHacks 2023

Language: JavaScript - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

devinzhang415/caption-gen

Deep Learning Image Caption Generator

Language: Jupyter Notebook - Size: 1.78 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Merterm/COSMic

Public repo for the paper: "COSMic: A Coherence-Aware Generation Metric for Image Descriptions" by Mert İnan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone, Malihe Alikhani

Language: Python - Size: 396 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

apivideo/caption.new

Sample app to add captions to an uploaded video. From api.video (https://api.video)

Language: JavaScript - Size: 692 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

naumanthunder22/mscs-thesis

Thesis Topic: Transfer Learning Based Food Item Recognition and Estimation of an Attributes.

Size: 4.44 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dibyansu24-maker/Neural-Image-Caption-Generator

Automatically describing the content of an image fundamental problem in artificial intelligence that connects computer vision and natural language processing.

Language: Jupyter Notebook - Size: 1.21 GB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 3

todd-gavin/DSCI550-PixstoryMediaExtractionAndAnalysis

Extraction analysis of PixStory Social Media Dataset using language detection, language translation, tike geotopic parser, tika image object recognition/image caption generation, and PyTorch detoxify.

Language: Jupyter Notebook - Size: 349 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

frederikgramkortegaard/describe

Content-based Deep Image-Search for Conversational Language

Language: Python - Size: 51.7 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

TanyaChutani/Image-Captioning-Generator

Image Captioning Generator Keras

Language: Jupyter Notebook - Size: 156 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

nrb2310/AI_ML-Caption-Generator

The Code allows users to upload an image and generates captions for the uploaded image using the ViT-GPT2 vision encoder-decoder model. It provides an easy-to-use interface for caption generation

Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

pritishmishra703/Image-Captioning

This project uses transformer for generating captions for images.

Language: Python - Size: 229 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 6

prabhat-ranjan50/Cricket-Caption-Generator-using-CNN_RNN-model

Our model generates descriptive captions for the given input image related to cricket domain.

Language: Jupyter Notebook - Size: 434 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

juletx/image-caption-generation

Automatic Image Caption Generation model that uses a CNN to condition a LSTM based language model

Language: Jupyter Notebook - Size: 1.05 GB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

heng-hw/SpaCap3D

[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)

Language: Python - Size: 91 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 15 - Forks: 6

damminhtien/deep-learning-image-caption-generator

Deep CNN-LSTM for Generating Image Descriptions :smiling_imp:

Language: Jupyter Notebook - Size: 3.32 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 27 - Forks: 7

sabirdvd/DivStats_Caption

DivStats Div 1 and Div 2 and mBLEU for caption diversity evaluation

Language: Python - Size: 30.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

tanishqgautam/Image-Captioning

Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformers

Language: Jupyter Notebook - Size: 2.23 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 35 - Forks: 18

ApoorvGit/god-s-eye

Aid for blinds. This AI will describe the surrounding, it will tell who is in front of him (if that person is a known person to AI using Facial Recognition) and it will also help him to know what is written (Optical Character Recognition)

Language: Python - Size: 37.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 6

nalbert9/Image-Captioning

Computer Vision: Generate captions that describe the contents of images using PyTorch

Language: Jupyter Notebook - Size: 129 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 21 - Forks: 6

connorguy/MemeToAlt

Self contained alt text generator for memes

Language: Python - Size: 40.9 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

akshat-khosya/video-text-search

Can we fetch time stamp of any sentence spoken in a particular video ? If so how? We can do it by fetching subtitle by cc extractor from the video and applying search algorithm. Let's see.

Language: TypeScript - Size: 55.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MADHAVAN001/image-captioning-approaches

A modular repository for developing Image Captioning Approaches

Language: Jupyter Notebook - Size: 53 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning

A deep learning model that generates descriptions of an image.

Language: Jupyter Notebook - Size: 49.4 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

amdnsr/machinelearningapps

Flask frontend for ML-Apps, a collection of text-summarization, caption-generation and cartoonization services.

Language: Python - Size: 86.2 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

amdnsr/mlmodelsapi

FastAPI backend for ML-Apps, a collection of text-summarization, caption-generation and cartoonization services.

Language: Python - Size: 85.1 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

akarsh-saxena/Image-Caption-Generator

A deep learning based image caption generator.

Language: Jupyter Notebook - Size: 16.4 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 2

mahendranandi/Image_Captioning

Image captioning using ResNet50 and LSTM in keras library. An application of both CV (Computer Vision) and NLP(Natural Language Processing) concepts.

Language: Jupyter Notebook - Size: 27.6 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

AbhinavS99/Image-Caption-Generation-using-Attention-Networks

Neural Image Caption Generation Using Bahadanu Attention

Language: Jupyter Notebook - Size: 1.92 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

rockleona/fotocaptioner

FotoCaptioner is a software that can easily generate captions for social media (e.g. Instagram, Facebook, Twitter), also this app also can read EXIF metadata from pictures (if avaliable).

Language: HTML - Size: 7.96 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

abdullahzia510/Effecient-Urdu-Caption-Generation-using-Attention-Mechanism

This repository contains code and results for the Course Project by Deep Learning Spring 2020 course offered at Information Technology University, Lahore, Pakistan. This repository is only for learning purposes and is not intended to be used for commercial purposes.

Language: Jupyter Notebook - Size: 4.37 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 3

SaiHari-N/Image-Caption-Generator-Using-CNN

Image caption generator is a task that involves computer vision and natural language processing concepts to recognize the context of an image and describe them in a natural language like English.

Language: Python - Size: 143 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

scionoftech/image_caption_generation

Image caption generation using Deep Learning-LSTM

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: about 9 hours ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Lucasfrota/imdex

Imdex is a library that allows semantic searches over images sets

Language: Python - Size: 320 MB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

SilasKati/Image-Caption-Generator

An Image Caption Generator which generates a caption describing the given image.

Language: Python - Size: 216 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

VSJMilewski/disentangled-caption-generator

The code with an attempt on the disentanglement for the decoder part of a caption generator.

Language: Python - Size: 135 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

leesharma/WhatDoesItMean

A native caption generation application using the Show and Tell model

Language: Java - Size: 2.65 MB - Last synced at: 22 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Keywords
caption-generation 85 captioning-images 19 image-captioning 19 deep-learning 14 python 14 pytorch 12 machine-learning 11 cnn 9 lstm 9 computer-vision 8 convolutional-neural-networks 8 captions 8 tensorflow 7 nlp 7 ai 6 natural-language-processing 6 keras 6 beam-search 5 rnn 5 bleu-score 5 speech-recognition 5 inceptionv3 5 recurrent-neural-networks 5 speech-to-text 4 image-processing 4 transformer 4 deep-neural-networks 4 flickr-8k 3 python3 3 attention 3 artificial-intelligence 3 transfer-learning 3 cnn-keras 3 visual-semantic 3 tkinter 3 multimodal-deep-learning 3 lstm-neural-networks 3 video-processing 3 object-detection 3 encoder-decoder 3 deeplearning 3 image-recognition 3 subtitles 3 flask 3 vision-and-language 3 captioning 3 vgg16 3 blip 3 3d 3 point-cloud 3 flickr-dataset 2 bleu 2 gradio 2 transformers 2 typescript 2 dense-captioning 2 nodejs 2 feature-extraction 2 children-speech-recognition 2 asr 2 audio 2 gpt-4 2 image 2 openai-api 2 images 2 inception-v3 2 openai 2 nlp-machine-learning 2 scene-understanding 2 lstm-model 2 cnn-model 2 rnn-lstm 2 rnn-encoder-decoder 2 caption 2 attention-model 2 attention-mechanism 2 image-caption-generator 2 caption-generator 2 gpt 2 textsummarization 2 accessibility 2 cartoonization 2 image-classification 2 img2txt 2 multi-modal 2 ocr 2 multimodal-learning 1 multimodal 1 tensorflow2 1 automatic-metrics 1 wafflehacks 1 flickr30k 1 chrome-extension 1 regularizing-rnns 1 code-captioning 1 sound-event-localization 1 transformer-architecture 1 flickr 1 transformer-models 1 transformer-pytorch 1