An open API service providing repository metadata for many open source software ecosystems.

Topic: "image-captioning"

salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language: Jupyter Notebook - Size: 79.3 MB - Last synced at: about 11 hours ago - Pushed at: 6 months ago - Stars: 10,534 - Forks: 1,026

salesforce/BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 5,165 - Forks: 681

OpenGVLab/InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language: Python - Size: 41.9 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 3,214 - Forks: 230

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language: Python - Size: 12.6 MB - Last synced at: 29 days ago - Pushed at: almost 3 years ago - Stars: 2,830 - Forks: 722

OFA-Sys/OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language: Python - Size: 120 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 2,491 - Forks: 249

ttengwang/Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Language: Python - Size: 51.9 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,732 - Forks: 104

peteanderson80/bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 1,429 - Forks: 379

imaginary-cloud/CameraManager

Simple Swift class to provide all the configurations you need to create custom camera view in your app

Language: Swift - Size: 4.7 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 1,383 - Forks: 329

NVlabs/prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Language: Python - Size: 4.25 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,310 - Forks: 73

microsoft/Oscar 📦

Oscar and VinVL

Language: Python - Size: 715 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 1,049 - Forks: 252

ruotianluo/self-critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Language: Python - Size: 600 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 977 - Forks: 286

YehLi/xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Language: Python - Size: 12.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 970 - Forks: 105

yunjey/show-attend-and-tell

TensorFlow Implementation of "Show, Attend and Tell"

Language: Jupyter Notebook - Size: 49.1 MB - Last synced at: 9 months ago - Pushed at: almost 7 years ago - Stars: 908 - Forks: 324

jhc13/taggui

Tag manager and captioner for image datasets

Language: Python - Size: 22.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 891 - Forks: 41

SkalskiP/awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Language: Python - Size: 58.6 KB - Last synced at: about 17 hours ago - Pushed at: about 1 year ago - Stars: 615 - Forks: 46

kdexd/virtex

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Language: Python - Size: 3.65 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 561 - Forks: 61

aimagelab/meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Language: Python - Size: 7.07 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 531 - Forks: 135

subho406/OmniNet

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Language: Python - Size: 17.6 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 513 - Forks: 58

kuanghuei/SCAN

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Language: Python - Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 490 - Forks: 106

ufal/neuralmonkey

An open-source tool for sequence learning in NLP built on TensorFlow.

Language: Python - Size: 13.5 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 410 - Forks: 106

MahanFathi/CS231

Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition

Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 364 - Forks: 154

scopeInfinity/Video2Description

Video to Text: Natural language description generator for some given video. [Video Captioning]

Language: Python - Size: 33 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 343 - Forks: 70

jiasenlu/AdaptiveAttention

Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"

Language: Jupyter Notebook - Size: 3.75 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 335 - Forks: 74

yashk2810/Image-Captioning

Image Captioning using InceptionV3 and beam search

Language: Jupyter Notebook - Size: 74.6 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 327 - Forks: 122

husthuaan/AoANet

Code for paper "Attention on Attention for Image Captioning". ICCV 2019

Language: Python - Size: 104 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 315 - Forks: 66

krasserm/fairseq-image-captioning

Transformer-based image captioning extension for pytorch/fairseq

Language: Python - Size: 3.09 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 315 - Forks: 57

sethuiyer/Image-to-Image-Search 📦

A reverse image search engine powered by elastic search and tensorflow

Language: Python - Size: 1.98 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 310 - Forks: 51

cuixing158/Awesome-CV-MasterHub

:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works

Size: 22.4 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 289 - Forks: 23

aimagelab/show-control-and-tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

Language: Python - Size: 1.71 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 61

JDAI-CV/image-captioning

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

Language: Python - Size: 733 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 273 - Forks: 54

DataTurks/DataTurks

ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.

Language: JavaScript - Size: 3.95 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 266 - Forks: 125

anuragmishracse/caption_generator

A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.

Language: Python - Size: 902 KB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 265 - Forks: 119

yxuansu/MAGIC

Language Models Can See: Plugging Visual Controls in Text Generation

Language: Python - Size: 132 MB - Last synced at: 15 days ago - Pushed at: almost 3 years ago - Stars: 256 - Forks: 27

dabasajay/Image-Caption-Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Language: Python - Size: 2.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 247 - Forks: 76

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

Size: 49.8 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 246 - Forks: 33

peteanderson80/Up-Down-Captioner

Automatic image captioning model based on Caffe, using features from bottom-up attention.

Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 245 - Forks: 69

j-min/CLIP-Caption-Reward

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

Language: Python - Size: 2.64 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 241 - Forks: 26

gokayfem/ComfyUI_VLM_nodes

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

Language: Python - Size: 285 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 235 - Forks: 14

saahiluppal/catr

Image Captioning Using Transformer

Language: Python - Size: 2.99 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 231 - Forks: 57

google/imageinwords

Data release for the ImageInWords (IIW) paper.

Language: JavaScript - Size: 21.4 MB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 209 - Forks: 9

zjuchenlong/sca-cnn.cvpr17

Image Captions Generation with Spatial and Channel-wise Attention

Language: Python - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 206 - Forks: 73

li-xirong/coco-cn

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

Language: OpenEdge ABL - Size: 195 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 187 - Forks: 21

luo3300612/image-captioning-DLCT

Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 183 - Forks: 30

ZexinYan/Medical-Report-Generation

A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.

Language: Python - Size: 70.1 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 179 - Forks: 64

milaan9/Deep_Learning_Algorithms_from_Scratch

This repository explores the variety of techniques and algorithms commonly used in deep learning and the implementation in MATLAB and PYTHON

Language: Jupyter Notebook - Size: 9.85 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 173 - Forks: 171

davidnvq/grit

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Language: Python - Size: 84.2 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 151 - Forks: 20

neural-nuts/image-caption-generator 📦

[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow

Language: Jupyter Notebook - Size: 9.64 MB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 148 - Forks: 57

tsenghungchen/show-adapt-and-tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Language: Python - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 148 - Forks: 40

peteanderson80/SPICE

Semantic Propositional Image Caption Evaluation

Language: Java - Size: 27.4 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 140 - Forks: 31

snrazavi/Deep_Learning_in_Python_2018

Deep Learning workshop including image classification, face recognition, Object detection, language modelling, image captioning and neural machine translation.

Language: Jupyter Notebook - Size: 449 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 125 - Forks: 61

zhiqwang/sightseq

Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

Language: Python - Size: 203 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 125 - Forks: 34

phachon/gis

gis (go image server) go 实现的图片服务,实现基本的上传,下载,存储,按比例裁剪等功能

Language: Go - Size: 1.84 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 123 - Forks: 36

jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

Language: Python - Size: 873 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 117 - Forks: 32

zsdonghao/Image-Captioning

TensorFlow (TensorLayer) Implementation of Image Captioning

Language: Python - Size: 199 KB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 115 - Forks: 55

hlamba28/Automatic-Image-Captioning

Generating Captions for images using Deep Learning

Language: Jupyter Notebook - Size: 252 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 112 - Forks: 80

terry-r123/Awesome-Captioning

A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)

Size: 56.6 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 110 - Forks: 10

MIMICLab/L-Verse

L-Verse: Bidirectional Generation Between Image and Text

Language: Python - Size: 1.83 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 108 - Forks: 6

iOPENCap/awesome-remote-image-captioning

A list of awesome remote sensing image captioning resources

Language: Python - Size: 438 KB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 105 - Forks: 1

yufengm/Adaptive

Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning

Language: Jupyter Notebook - Size: 230 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 104 - Forks: 45

zhangxuying1004/RSTNet

Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)

Language: Python - Size: 6.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 99 - Forks: 30

Bjarten/computer-vision-ND

Projects and exercises for the Udacity Computer Vision Nanodegree

Language: Jupyter Notebook - Size: 690 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 99 - Forks: 44

njchoma/transformer_image_caption

Image Captioning based on Bottom-Up and Top-Down Attention model

Language: Jupyter Notebook - Size: 108 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 98 - Forks: 17

TheoCoombes/ClipCap

Using pretrained encoder and language models to generate captions from multimedia inputs.

Language: Python - Size: 92.7 MB - Last synced at: 29 days ago - Pushed at: about 2 years ago - Stars: 96 - Forks: 13

chenxinpeng/ARNet

CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

Language: Python - Size: 190 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 95 - Forks: 22

alasdairtran/transform-and-tell

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning

Language: Python - Size: 14.2 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 90 - Forks: 15

X-PLUG/mPLUG

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

Language: Python - Size: 1.56 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 89 - Forks: 7

nikhilmaram/Show_and_Tell

Show and Tell : A Neural Image Caption Generator

Language: Python - Size: 8.9 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 88 - Forks: 43

MiteshPuthran/Image-Caption-Generator

The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)

Language: Jupyter Notebook - Size: 69.8 MB - Last synced at: 28 days ago - Pushed at: over 5 years ago - Stars: 86 - Forks: 32

jchenghu/ExpansionNet_v2

Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"

Language: Python - Size: 98.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 85 - Forks: 24

richardaecn/cvpr18-caption-eval

Learning to Evaluate Image Captioning. CVPR 2018

Language: Python - Size: 6.11 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 84 - Forks: 11

google/localized-narratives

Localized Narratives

Language: HTML - Size: 9.4 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 82 - Forks: 14

anubhavshrimal/Machine-Learning

The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.

Language: Jupyter Notebook - Size: 31.6 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 80 - Forks: 27

zchoi/S2-Transformer

[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”

Language: Python - Size: 70.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 78 - Forks: 4

tangbinh/image-captioning

Language: Python - Size: 2.39 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 74 - Forks: 19

Markin-Wang/awesome_radiology_report_generation

Awesome radiology report generation and image captioning papers.

Size: 59.6 KB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 73 - Forks: 6

nocaps-org/updown-baseline

Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".

Language: Python - Size: 633 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 73 - Forks: 12

RoyalSkye/Image-Caption

Using LSTM or Transformer to solve Image Captioning in Pytorch

Language: Python - Size: 68.6 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 70 - Forks: 25

aehrc/cvt2distilgpt2

Improving Chest X-Ray Report Generation by Leveraging Warm-Starting

Language: Python - Size: 93.5 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 67 - Forks: 7

TRoboto/Udacity

This repo includes all the projects I have finished in the Udacity Nanodegree programs

Language: Jupyter Notebook - Size: 1.29 GB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 67 - Forks: 58

watsonyanghx/Image-Text-Papers

Image Caption and Text to Image papers.

Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 67 - Forks: 8

ntrang086/image_captioning

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset

Language: Python - Size: 3.55 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 66 - Forks: 42

tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

Language: Python - Size: 11.5 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 65 - Forks: 7

fregu856/CS224n_project

Neural Image Captioning in TensorFlow.

Language: Jupyter Notebook - Size: 290 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 65 - Forks: 30

fenglinliu98/MIA

Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)

Language: Python - Size: 971 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 63 - Forks: 14

Div99/Image-Captioning

Image Captioning with Keras

Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 28 days ago - Pushed at: almost 5 years ago - Stars: 63 - Forks: 45

coldmanck/show-attend-and-tell Fork of DeepRNN/image_captioning

[Python 3] Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

Language: Python - Size: 73.7 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 63 - Forks: 36

kacky24/stylenet

A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"

Language: Python - Size: 13.2 MB - Last synced at: 29 days ago - Pushed at: over 4 years ago - Stars: 62 - Forks: 10

AaronCCWong/Show-Attend-and-Tell

A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Language: Python - Size: 7.88 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 62 - Forks: 23

Shobhit20/Image-Captioning

Image Captioning: Implementing the Neural Image Caption Generator with python

Language: Python - Size: 2.63 MB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 62 - Forks: 36

GT-RIPL/Xmodal-Ctx

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

Language: Python - Size: 93.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 10

bearcatt/LaBERT

A length-controllable and non-autoregressive image captioning model.

Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 56 - Forks: 10

neural-nuts/Cam2Caption 📦

[DEPRECATED] An Android application which converts camera feed to captions in real time

Language: Java - Size: 30.4 MB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 53 - Forks: 18

milhidaka/chainer-image-caption Fork of dsanno/chainer-image-caption

Image caption generator using Chainer, Python 3 and ResNet feature version

Language: Python - Size: 49.2 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 50 - Forks: 11

peteanderson80/coco-caption

Adds SPICE metric to coco-caption evaluation server codes

Language: Jupyter Notebook - Size: 121 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 42

israfelsr/CS231n

CS231n Assignments Solutions - Spring 2020

Language: Jupyter Notebook - Size: 173 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 47 - Forks: 20

daqingliu/CAVP

Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)

Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 47 - Forks: 4

Curt-Park/cs231n_assignments

[Assignments] CS231N: Convolutional Neural Networks for Visual Recognition (2016 & 2017)

Language: Jupyter Notebook - Size: 37.9 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 46 - Forks: 10

chenxinpeng/im2p Fork of jcjohnson/densecap

Tensorflow implement of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs

Language: Lua - Size: 14.7 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 45 - Forks: 16

thushv89/packt_nlp_tensorflow_2

This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2)

Language: Jupyter Notebook - Size: 8.67 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 40 - Forks: 17

srinadhu/CS231n

My solutions for Assignments of CS231n: Convolutional Neural Networks for Visual Recognition

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 40 - Forks: 23

Related Topics
deep-learning 237 pytorch 173 computer-vision 154 lstm 118 cnn 89 tensorflow 88 machine-learning 87 python 83 nlp 76 rnn 69 keras 61 natural-language-processing 49 transformer 44 convolutional-neural-networks 44 attention-mechanism 39 neural-networks 38 lstm-neural-networks 35 image-processing 35 recurrent-neural-networks 34 object-detection 34 encoder-decoder 30 image-classification 30 neural-network 30 transformers 28 image-caption-generator 28 deep-neural-networks 26 python3 26 captioning-images 24 artificial-intelligence 23 attention 22 flask 21 ai 21 image-to-text 20 caption-generation 19 flickr8k-dataset 18 vgg16 18 multimodal 18 clip 17 image-caption 17 visual-question-answering 17 resnet-50 16 huggingface 16 blip 16 generative-ai 16 inceptionv3 16 show-attend-and-tell 15 keras-tensorflow 15 image-recognition 15 llm 15 beam-search 14 bleu-score 14 transfer-learning 14 image-generation 13 image 13 mscoco-dataset 13 resnet 13 ocr 12 streamlit 12 multimodal-learning 12 vision-and-language 12 mscoco 12 vqa 12 vision-transformer 11 face-recognition 11 dataset 11 image-segmentation 11 coco-dataset 11 opencv 10 jupyter-notebook 10 attention-model 10 show-and-tell 10 tensorflow2 10 vision-language 10 reinforcement-learning 10 docker 9 encoder 9 huggingface-transformers 9 video-captioning 9 face-detection 9 nlp-machine-learning 9 gan 9 stable-diffusion 9 coco 8 cnn-keras 8 machine-translation 8 data-science 8 encoder-decoder-model 8 llava 8 inception-v3 8 text-to-image 8 torch 8 word-embeddings 8 captioning 8 gpt-2 8 decoder 8 text-generation 7 cs231n 7 deeplearning 7 gru 7 rnn-tensorflow 7