Topic: "image-captioning"
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language: Jupyter Notebook - Size: 79.3 MB - Last synced at: about 11 hours ago - Pushed at: 6 months ago - Stars: 10,534 - Forks: 1,026

salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 5,165 - Forks: 681

OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language: Python - Size: 41.9 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 3,214 - Forks: 230

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Language: Python - Size: 12.6 MB - Last synced at: 29 days ago - Pushed at: almost 3 years ago - Stars: 2,830 - Forks: 722

OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Language: Python - Size: 120 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 2,491 - Forks: 249

ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language: Python - Size: 51.9 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,732 - Forks: 104

peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 1,429 - Forks: 379

imaginary-cloud/CameraManager
Simple Swift class to provide all the configurations you need to create custom camera view in your app
Language: Swift - Size: 4.7 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 1,383 - Forks: 329

NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
Language: Python - Size: 4.25 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,310 - Forks: 73

microsoft/Oscar 📦
Oscar and VinVL
Language: Python - Size: 715 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 1,049 - Forks: 252

ruotianluo/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Language: Python - Size: 600 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 977 - Forks: 286

YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language: Python - Size: 12.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 970 - Forks: 105

yunjey/show-attend-and-tell
TensorFlow Implementation of "Show, Attend and Tell"
Language: Jupyter Notebook - Size: 49.1 MB - Last synced at: 9 months ago - Pushed at: almost 7 years ago - Stars: 908 - Forks: 324

jhc13/taggui
Tag manager and captioner for image datasets
Language: Python - Size: 22.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 891 - Forks: 41

SkalskiP/awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Language: Python - Size: 58.6 KB - Last synced at: about 17 hours ago - Pushed at: about 1 year ago - Stars: 615 - Forks: 46

kdexd/virtex
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
Language: Python - Size: 3.65 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 561 - Forks: 61

aimagelab/meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Language: Python - Size: 7.07 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 531 - Forks: 135

subho406/OmniNet
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
Language: Python - Size: 17.6 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 513 - Forks: 58

kuanghuei/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Language: Python - Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 490 - Forks: 106

ufal/neuralmonkey
An open-source tool for sequence learning in NLP built on TensorFlow.
Language: Python - Size: 13.5 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 410 - Forks: 106

MahanFathi/CS231
Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition
Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 364 - Forks: 154

scopeInfinity/Video2Description
Video to Text: Natural language description generator for some given video. [Video Captioning]
Language: Python - Size: 33 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 343 - Forks: 70

jiasenlu/AdaptiveAttention
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
Language: Jupyter Notebook - Size: 3.75 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 335 - Forks: 74

yashk2810/Image-Captioning
Image Captioning using InceptionV3 and beam search
Language: Jupyter Notebook - Size: 74.6 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 327 - Forks: 122

husthuaan/AoANet
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
Language: Python - Size: 104 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 315 - Forks: 66

krasserm/fairseq-image-captioning
Transformer-based image captioning extension for pytorch/fairseq
Language: Python - Size: 3.09 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 315 - Forks: 57

sethuiyer/Image-to-Image-Search 📦
A reverse image search engine powered by elastic search and tensorflow
Language: Python - Size: 1.98 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 310 - Forks: 51

cuixing158/Awesome-CV-MasterHub
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
Size: 22.4 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 289 - Forks: 23

aimagelab/show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Language: Python - Size: 1.71 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 281 - Forks: 61

JDAI-CV/image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Language: Python - Size: 733 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 273 - Forks: 54

DataTurks/DataTurks
ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
Language: JavaScript - Size: 3.95 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 266 - Forks: 125

anuragmishracse/caption_generator
A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.
Language: Python - Size: 902 KB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 265 - Forks: 119

yxuansu/MAGIC
Language Models Can See: Plugging Visual Controls in Text Generation
Language: Python - Size: 132 MB - Last synced at: 15 days ago - Pushed at: almost 3 years ago - Stars: 256 - Forks: 27

dabasajay/Image-Caption-Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Language: Python - Size: 2.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 247 - Forks: 76

HanXinzi-AI/awesome-computer-vision-resources
a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。
Size: 49.8 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 246 - Forks: 33

peteanderson80/Up-Down-Captioner
Automatic image captioning model based on Caffe, using features from bottom-up attention.
Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 245 - Forks: 69

j-min/CLIP-Caption-Reward
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
Language: Python - Size: 2.64 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 241 - Forks: 26

gokayfem/ComfyUI_VLM_nodes
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Language: Python - Size: 285 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 235 - Forks: 14

saahiluppal/catr
Image Captioning Using Transformer
Language: Python - Size: 2.99 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 231 - Forks: 57

google/imageinwords
Data release for the ImageInWords (IIW) paper.
Language: JavaScript - Size: 21.4 MB - Last synced at: 23 days ago - Pushed at: 6 months ago - Stars: 209 - Forks: 9

zjuchenlong/sca-cnn.cvpr17
Image Captions Generation with Spatial and Channel-wise Attention
Language: Python - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 206 - Forks: 73

li-xirong/coco-cn
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
Language: OpenEdge ABL - Size: 195 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 187 - Forks: 21

luo3300612/image-captioning-DLCT
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 183 - Forks: 30

ZexinYan/Medical-Report-Generation
A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.
Language: Python - Size: 70.1 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 179 - Forks: 64

milaan9/Deep_Learning_Algorithms_from_Scratch
This repository explores the variety of techniques and algorithms commonly used in deep learning and the implementation in MATLAB and PYTHON
Language: Jupyter Notebook - Size: 9.85 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 173 - Forks: 171

davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
Language: Python - Size: 84.2 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 151 - Forks: 20

neural-nuts/image-caption-generator 📦
[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
Language: Jupyter Notebook - Size: 9.64 MB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 148 - Forks: 57

tsenghungchen/show-adapt-and-tell
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Language: Python - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 148 - Forks: 40

peteanderson80/SPICE
Semantic Propositional Image Caption Evaluation
Language: Java - Size: 27.4 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 140 - Forks: 31

snrazavi/Deep_Learning_in_Python_2018
Deep Learning workshop including image classification, face recognition, Object detection, language modelling, image captioning and neural machine translation.
Language: Jupyter Notebook - Size: 449 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 125 - Forks: 61

zhiqwang/sightseq
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Language: Python - Size: 203 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 125 - Forks: 34

phachon/gis
gis (go image server) go 实现的图片服务,实现基本的上传,下载,存储,按比例裁剪等功能
Language: Go - Size: 1.84 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 123 - Forks: 36

jmisilo/clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
Language: Python - Size: 873 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 117 - Forks: 32

zsdonghao/Image-Captioning
TensorFlow (TensorLayer) Implementation of Image Captioning
Language: Python - Size: 199 KB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 115 - Forks: 55

hlamba28/Automatic-Image-Captioning
Generating Captions for images using Deep Learning
Language: Jupyter Notebook - Size: 252 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 112 - Forks: 80

terry-r123/Awesome-Captioning
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
Size: 56.6 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 110 - Forks: 10

MIMICLab/L-Verse
L-Verse: Bidirectional Generation Between Image and Text
Language: Python - Size: 1.83 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 108 - Forks: 6

iOPENCap/awesome-remote-image-captioning
A list of awesome remote sensing image captioning resources
Language: Python - Size: 438 KB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 105 - Forks: 1

yufengm/Adaptive
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Language: Jupyter Notebook - Size: 230 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 104 - Forks: 45

zhangxuying1004/RSTNet
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
Language: Python - Size: 6.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 99 - Forks: 30

Bjarten/computer-vision-ND
Projects and exercises for the Udacity Computer Vision Nanodegree
Language: Jupyter Notebook - Size: 690 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 99 - Forks: 44

njchoma/transformer_image_caption
Image Captioning based on Bottom-Up and Top-Down Attention model
Language: Jupyter Notebook - Size: 108 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 98 - Forks: 17

TheoCoombes/ClipCap
Using pretrained encoder and language models to generate captions from multimedia inputs.
Language: Python - Size: 92.7 MB - Last synced at: 29 days ago - Pushed at: about 2 years ago - Stars: 96 - Forks: 13

chenxinpeng/ARNet
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Language: Python - Size: 190 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 95 - Forks: 22

alasdairtran/transform-and-tell
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
Language: Python - Size: 14.2 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 90 - Forks: 15

X-PLUG/mPLUG
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Language: Python - Size: 1.56 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 89 - Forks: 7

nikhilmaram/Show_and_Tell
Show and Tell : A Neural Image Caption Generator
Language: Python - Size: 8.9 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 88 - Forks: 43

MiteshPuthran/Image-Caption-Generator
The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)
Language: Jupyter Notebook - Size: 69.8 MB - Last synced at: 28 days ago - Pushed at: over 5 years ago - Stars: 86 - Forks: 32

jchenghu/ExpansionNet_v2
Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"
Language: Python - Size: 98.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 85 - Forks: 24

richardaecn/cvpr18-caption-eval
Learning to Evaluate Image Captioning. CVPR 2018
Language: Python - Size: 6.11 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 84 - Forks: 11

google/localized-narratives
Localized Narratives
Language: HTML - Size: 9.4 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 82 - Forks: 14

anubhavshrimal/Machine-Learning
The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Language: Jupyter Notebook - Size: 31.6 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 80 - Forks: 27

zchoi/S2-Transformer
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
Language: Python - Size: 70.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 78 - Forks: 4

tangbinh/image-captioning
Language: Python - Size: 2.39 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 74 - Forks: 19

Markin-Wang/awesome_radiology_report_generation
Awesome radiology report generation and image captioning papers.
Size: 59.6 KB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 73 - Forks: 6

nocaps-org/updown-baseline
Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".
Language: Python - Size: 633 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 73 - Forks: 12

RoyalSkye/Image-Caption
Using LSTM or Transformer to solve Image Captioning in Pytorch
Language: Python - Size: 68.6 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 70 - Forks: 25

aehrc/cvt2distilgpt2
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
Language: Python - Size: 93.5 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 67 - Forks: 7

TRoboto/Udacity
This repo includes all the projects I have finished in the Udacity Nanodegree programs
Language: Jupyter Notebook - Size: 1.29 GB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 67 - Forks: 58

watsonyanghx/Image-Text-Papers
Image Caption and Text to Image papers.
Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 67 - Forks: 8

ntrang086/image_captioning
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Language: Python - Size: 3.55 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 66 - Forks: 42

tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
Language: Python - Size: 11.5 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 65 - Forks: 7

fregu856/CS224n_project
Neural Image Captioning in TensorFlow.
Language: Jupyter Notebook - Size: 290 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 65 - Forks: 30

fenglinliu98/MIA
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
Language: Python - Size: 971 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 63 - Forks: 14

Div99/Image-Captioning
Image Captioning with Keras
Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 28 days ago - Pushed at: almost 5 years ago - Stars: 63 - Forks: 45

coldmanck/show-attend-and-tell Fork of DeepRNN/image_captioning
[Python 3] Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
Language: Python - Size: 73.7 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 63 - Forks: 36

kacky24/stylenet
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
Language: Python - Size: 13.2 MB - Last synced at: 29 days ago - Pushed at: over 4 years ago - Stars: 62 - Forks: 10

AaronCCWong/Show-Attend-and-Tell
A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Language: Python - Size: 7.88 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 62 - Forks: 23

Shobhit20/Image-Captioning
Image Captioning: Implementing the Neural Image Caption Generator with python
Language: Python - Size: 2.63 MB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 62 - Forks: 36

GT-RIPL/Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Language: Python - Size: 93.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 10

bearcatt/LaBERT
A length-controllable and non-autoregressive image captioning model.
Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 56 - Forks: 10

neural-nuts/Cam2Caption 📦
[DEPRECATED] An Android application which converts camera feed to captions in real time
Language: Java - Size: 30.4 MB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 53 - Forks: 18

milhidaka/chainer-image-caption Fork of dsanno/chainer-image-caption
Image caption generator using Chainer, Python 3 and ResNet feature version
Language: Python - Size: 49.2 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 50 - Forks: 11

peteanderson80/coco-caption
Adds SPICE metric to coco-caption evaluation server codes
Language: Jupyter Notebook - Size: 121 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 49 - Forks: 42

israfelsr/CS231n
CS231n Assignments Solutions - Spring 2020
Language: Jupyter Notebook - Size: 173 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 47 - Forks: 20

daqingliu/CAVP
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)
Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 47 - Forks: 4

Curt-Park/cs231n_assignments
[Assignments] CS231N: Convolutional Neural Networks for Visual Recognition (2016 & 2017)
Language: Jupyter Notebook - Size: 37.9 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 46 - Forks: 10

chenxinpeng/im2p Fork of jcjohnson/densecap
Tensorflow implement of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs
Language: Lua - Size: 14.7 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 45 - Forks: 16

thushv89/packt_nlp_tensorflow_2
This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2)
Language: Jupyter Notebook - Size: 8.67 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 40 - Forks: 17

srinadhu/CS231n
My solutions for Assignments of CS231n: Convolutional Neural Networks for Visual Recognition
Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 40 - Forks: 23
