Topic: "mscoco"
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language: Python - Size: 1.05 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 14,662 - Forks: 2,126

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Language: Python - Size: 12.6 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2,830 - Forks: 722

SwinTransformer/Swin-Transformer-Object-Detection Fork of open-mmlab/mmdetection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Language: Python - Size: 19.9 MB - Last synced at: 25 days ago - Pushed at: about 2 years ago - Stars: 1,859 - Forks: 381

apple/ml-cvnets
CVNets: A library for training computer vision networks
Language: Python - Size: 5.76 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,851 - Forks: 240

peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1,429 - Forks: 379

HRNet/HRNet-Object-Detection Fork of open-mmlab/mmdetection
Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
Language: Python - Size: 1.03 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 625 - Forks: 98

JDAI-CV/CoTNet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Language: Python - Size: 451 KB - Last synced at: 16 days ago - Pushed at: over 3 years ago - Stars: 531 - Forks: 82

sacmehta/EdgeNets
This repository contains the source code of our work on designing efficient CNNs for computer vision
Language: Python - Size: 470 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 401 - Forks: 83

hyz-xmaster/VarifocalNet
VarifocalNet: An IoU-aware Dense Object Detector
Language: Python - Size: 20.5 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 332 - Forks: 52

ViTAE-Transformer/ViTAE-Transformer
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
Language: Python - Size: 22.2 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 269 - Forks: 29

hyz-xmaster/swa_object_detection
SWA Object Detection
Language: Python - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 244 - Forks: 25

MichiganCOG/ViP
Video Platform for Action Recognition and Object Detection in Pytorch
Language: Python - Size: 694 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 221 - Forks: 35

hustvl/BMaskR-CNN
[ECCV 2020] Boundary-preserving Mask R-CNN
Language: Python - Size: 16.5 MB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 197 - Forks: 41

YehLi/ImageNetModel
Official ImageNet Model repository
Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 157 - Forks: 27

peteanderson80/SPICE
Semantic Propositional Image Caption Evaluation
Language: Java - Size: 27.4 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 140 - Forks: 31

HRNet/HRNet-FCOS
High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm
Language: Python - Size: 4.15 MB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 125 - Forks: 37

610265158/mobilenetv3_centernet
A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).
Language: Python - Size: 2.78 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 69 - Forks: 15

ntrang086/image_captioning
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Language: Python - Size: 3.55 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 66 - Forks: 42

lightly-ai/labelformat
A tool for converting computer vision label formats.
Language: Python - Size: 1.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 62 - Forks: 6

Weed-AI/Weed-AI
A repository to support the development of a repository and interchange format for weed identification annotation
Language: Python - Size: 105 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 49 - Forks: 6

peteanderson80/coco-caption
Adds SPICE metric to coco-caption evaluation server codes
Language: Jupyter Notebook - Size: 121 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 49 - Forks: 42

utahnlp/consistency
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
Language: Python - Size: 12.9 MB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 30 - Forks: 3

oswaldoludwig/visually-informed-embedding-of-word-VIEW-
Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.
Language: Python - Size: 12.8 MB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 30 - Forks: 11

ayansengupta17/GAN
We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.
Language: HTML - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 17 - Forks: 8

gautamchitnis/cocoapi Fork of philferriere/cocoapi
Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3
Language: Jupyter Notebook - Size: 12 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 7

leftthomas/DeepMask
A Keras implementation of DeepMask based on NIPS 2015 paper "Learning to Segment Object Candidates"
Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 6

howardyclo/ImageNet2COCO
A demo for mapping class labels from ImageNet to COCO.
Language: Jupyter Notebook - Size: 33.2 KB - Last synced at: 3 days ago - Pushed at: almost 6 years ago - Stars: 11 - Forks: 2

deepplants/ViT-PCM
Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"
Language: Python - Size: 13.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 0

CLT29/semantic_neighborhoods
Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]
Language: Python - Size: 3.17 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 6

jakarto3d/jakarnotator
The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.
Language: JavaScript - Size: 38.2 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0

canesee-project/Arabic-COCO
MS COCO captions in Arabic
Size: 12.8 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1

BUAADreamer/CCRK
[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning
Language: Python - Size: 644 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

Lukeasargen/Show-Attend-and-Tell-Pytorch-Lightning
Encoder-Decoder CNN-LSTM Model with an attention mechanism for image captioning. Trained using the Microsoft COCO Dataset.
Language: Jupyter Notebook - Size: 114 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

computer-vision/kwcoco
The Kitware COCO Image Annotation Module
Last synced at: 9 months ago - Stars: 5 - Forks: 4

shunk031/huggingface-datasets_MSCOCO
Microsoft COCO: Common Objects in Context for huggingface datasets
Language: Python - Size: 176 KB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

shunk031/huggingface-datasets_COCOA
COCOA: Semantic Amodal Segmentation for huggingface datasets
Language: Python - Size: 75.2 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

waikato-ufdl/wai-annotations
Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).
Language: Dockerfile - Size: 643 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

VladimirSinitsin/labelme_converter
LabelMe to MsCOCO, PascalVOC, Yolo
Language: Python - Size: 83 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

ma-xu/ParameterFree
Official code for “Cascaded Context Dependency: An Extremely Lightweight Module for Deep Convolutional Neural Networks”
Language: Python - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

CedricPicron/FQDet
FQDet: Fast-converging Query-based Detector
Language: Python - Size: 178 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

XPhyro/image-entropy 📦
An ongoing research project on image entropy assessment using machine learning.
Language: Python - Size: 5.64 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

vgupta123/consistency Fork of utahnlp/consistency
Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models
Language: Python - Size: 12.9 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

altruistcoder/Digivision
A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the textual description of what's happening in front of the camera and conveys it to person through audio. It is capable of recognising faces and tell user whether a known person is present in front of him or not.
Language: Jupyter Notebook - Size: 82.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

pnkvalavala/image-captioning
Image Caption Generator using a Pretrained ResNet-50 and an LSTM architecture. Trained on COCO 2017 dataset, it's accessible via a Streamlit app.
Language: Python - Size: 117 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

biyoml/PyTorch-SSD
PyTorch implementation of SSD: Single Shot MultiBox Detector.
Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

yao-zhao/EDGAN
EDGAN: StackGAN with Embedding Distance Training
Language: Python - Size: 122 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

MaximumEntropy/IFT6266
Inpainting on MSCOCO
Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: about 2 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

k2-gc/object-detection-format-converter
Object Detection Dataset Format Converter
Language: Python - Size: 39.1 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

shunk031/huggingface-datasets_cocoapi-tools
A helper library for easily converting MSCOCO format data using the loading script of huggingface datasets.
Language: Python - Size: 109 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

CedricPicron/TPN
Trident Pyramid Networks for Object Detection (BMVC 2022)
Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lumeilevel/NNDL_project3
The third project for Neural Network and Deep Learning: Image Captioning of Novel Objects
Language: Python - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

shunk031/huggingface-datasets_cocostuff
COCO-Stuff dataset for huggingface datasets
Language: Python - Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

abideenml/Image-Captioning-with-MobileNet-and-LSTM
Image Captioning with Visual Attention
Language: Jupyter Notebook - Size: 682 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

trannam710/NIC-with-Soft-Attention
Show, Attend, and Tell. Modified to use on UIT-ViIC dataset.
Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

rishabh1323/Object-Detection-YOLOv3-OpenCV
A deep-learning object detection project pre-trained on COCO dataset
Language: Python - Size: 869 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

StiliyanDr/neural-image-caption
A simple Python API (built on top of TensorFlow) for neural image captioning with MSCOCO data.
Language: Python - Size: 299 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

khlee369/MSCOCO_ObjectDetection
MSCOCO data format details and how to evaluate mAP with pycocotools
Language: Jupyter Notebook - Size: 17.6 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

waikato-datamining/mscocodata 📦
Scripts for converting annotations into MS COCO JSON format.
Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

junjiedong/Image_Captioning_MSCOCO
Image Captioning on Microsoft Coco Dataset
Language: Jupyter Notebook - Size: 104 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 2

ogrenenmakine/Phototouch
Phototouch is multimodal photo editing tool
Language: Python - Size: 19.5 KB - Last synced at: 12 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1
