An open API service providing repository metadata for many open source software ecosystems.

Topic: "mscoco"

microsoft/Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language: Python - Size: 1.05 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 14,662 - Forks: 2,126

sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Language: Python - Size: 12.6 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 2,830 - Forks: 722

SwinTransformer/Swin-Transformer-Object-Detection Fork of open-mmlab/mmdetection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Language: Python - Size: 19.9 MB - Last synced at: 25 days ago - Pushed at: about 2 years ago - Stars: 1,859 - Forks: 381

apple/ml-cvnets

CVNets: A library for training computer vision networks

Language: Python - Size: 5.76 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,851 - Forks: 240

peteanderson80/bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1,429 - Forks: 379

HRNet/HRNet-Object-Detection Fork of open-mmlab/mmdetection

Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919

Language: Python - Size: 1.03 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 625 - Forks: 98

JDAI-CV/CoTNet

This is an official implementation for "Contextual Transformer Networks for Visual Recognition".

Language: Python - Size: 451 KB - Last synced at: 16 days ago - Pushed at: over 3 years ago - Stars: 531 - Forks: 82

sacmehta/EdgeNets

This repository contains the source code of our work on designing efficient CNNs for computer vision

Language: Python - Size: 470 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 401 - Forks: 83

hyz-xmaster/VarifocalNet

VarifocalNet: An IoU-aware Dense Object Detector

Language: Python - Size: 20.5 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 332 - Forks: 52

ViTAE-Transformer/ViTAE-Transformer

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

Language: Python - Size: 22.2 MB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 269 - Forks: 29

hyz-xmaster/swa_object_detection

SWA Object Detection

Language: Python - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 244 - Forks: 25

MichiganCOG/ViP

Video Platform for Action Recognition and Object Detection in Pytorch

Language: Python - Size: 694 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 221 - Forks: 35

hustvl/BMaskR-CNN

[ECCV 2020] Boundary-preserving Mask R-CNN

Language: Python - Size: 16.5 MB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 197 - Forks: 41

YehLi/ImageNetModel

Official ImageNet Model repository

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 157 - Forks: 27

peteanderson80/SPICE

Semantic Propositional Image Caption Evaluation

Language: Java - Size: 27.4 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 140 - Forks: 31

HRNet/HRNet-FCOS

High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm

Language: Python - Size: 4.15 MB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 125 - Forks: 37

610265158/mobilenetv3_centernet

A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).

Language: Python - Size: 2.78 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 69 - Forks: 15

ntrang086/image_captioning

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset

Language: Python - Size: 3.55 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 66 - Forks: 42

lightly-ai/labelformat

A tool for converting computer vision label formats.

Language: Python - Size: 1.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 62 - Forks: 6

Weed-AI/Weed-AI

A repository to support the development of a repository and interchange format for weed identification annotation

Language: Python - Size: 105 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 49 - Forks: 6

peteanderson80/coco-caption

Adds SPICE metric to coco-caption evaluation server codes

Language: Jupyter Notebook - Size: 121 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 49 - Forks: 42

utahnlp/consistency

Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models

Language: Python - Size: 12.9 MB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 30 - Forks: 3

oswaldoludwig/visually-informed-embedding-of-word-VIEW-

Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.

Language: Python - Size: 12.8 MB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 30 - Forks: 11

ayansengupta17/GAN

We aim to generate realistic images from text descriptions using GAN architecture. The network that we have designed is used for image generation for two datasets: MSCOCO and CUBS.

Language: HTML - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 17 - Forks: 8

gautamchitnis/cocoapi Fork of philferriere/cocoapi

Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3

Language: Jupyter Notebook - Size: 12 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 7

leftthomas/DeepMask

A Keras implementation of DeepMask based on NIPS 2015 paper "Learning to Segment Object Candidates"

Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 6

howardyclo/ImageNet2COCO

A demo for mapping class labels from ImageNet to COCO.

Language: Jupyter Notebook - Size: 33.2 KB - Last synced at: 3 days ago - Pushed at: almost 6 years ago - Stars: 11 - Forks: 2

deepplants/ViT-PCM

Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"

Language: Python - Size: 13.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 0

CLT29/semantic_neighborhoods

Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]

Language: Python - Size: 3.17 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 6

jakarto3d/jakarnotator

The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.

Language: JavaScript - Size: 38.2 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0

canesee-project/Arabic-COCO

MS COCO captions in Arabic

Size: 12.8 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 1

BUAADreamer/CCRK

[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning

Language: Python - Size: 644 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

Lukeasargen/Show-Attend-and-Tell-Pytorch-Lightning

Encoder-Decoder CNN-LSTM Model with an attention mechanism for image captioning. Trained using the Microsoft COCO Dataset.

Language: Jupyter Notebook - Size: 114 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

computer-vision/kwcoco

The Kitware COCO Image Annotation Module

Last synced at: 9 months ago - Stars: 5 - Forks: 4

shunk031/huggingface-datasets_MSCOCO

Microsoft COCO: Common Objects in Context for huggingface datasets

Language: Python - Size: 176 KB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

shunk031/huggingface-datasets_COCOA

COCOA: Semantic Amodal Segmentation for huggingface datasets

Language: Python - Size: 75.2 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

waikato-ufdl/wai-annotations

Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).

Language: Dockerfile - Size: 643 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

VladimirSinitsin/labelme_converter

LabelMe to MsCOCO, PascalVOC, Yolo

Language: Python - Size: 83 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

ma-xu/ParameterFree

Official code for “Cascaded Context Dependency: An Extremely Lightweight Module for Deep Convolutional Neural Networks”

Language: Python - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

CedricPicron/FQDet

FQDet: Fast-converging Query-based Detector

Language: Python - Size: 178 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

XPhyro/image-entropy 📦

An ongoing research project on image entropy assessment using machine learning.

Language: Python - Size: 5.64 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

vgupta123/consistency Fork of utahnlp/consistency

Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models

Language: Python - Size: 12.9 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

altruistcoder/Digivision

A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the textual description of what's happening in front of the camera and conveys it to person through audio. It is capable of recognising faces and tell user whether a known person is present in front of him or not.

Language: Jupyter Notebook - Size: 82.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

pnkvalavala/image-captioning

Image Caption Generator using a Pretrained ResNet-50 and an LSTM architecture. Trained on COCO 2017 dataset, it's accessible via a Streamlit app.

Language: Python - Size: 117 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

biyoml/PyTorch-SSD

PyTorch implementation of SSD: Single Shot MultiBox Detector.

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

yao-zhao/EDGAN

EDGAN: StackGAN with Embedding Distance Training

Language: Python - Size: 122 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

MaximumEntropy/IFT6266

Inpainting on MSCOCO

Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: about 2 months ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

k2-gc/object-detection-format-converter

Object Detection Dataset Format Converter

Language: Python - Size: 39.1 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

shunk031/huggingface-datasets_cocoapi-tools

A helper library for easily converting MSCOCO format data using the loading script of huggingface datasets.

Language: Python - Size: 109 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

CedricPicron/TPN

Trident Pyramid Networks for Object Detection (BMVC 2022)

Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lumeilevel/NNDL_project3

The third project for Neural Network and Deep Learning: Image Captioning of Novel Objects

Language: Python - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

shunk031/huggingface-datasets_cocostuff

COCO-Stuff dataset for huggingface datasets

Language: Python - Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

abideenml/Image-Captioning-with-MobileNet-and-LSTM

Image Captioning with Visual Attention

Language: Jupyter Notebook - Size: 682 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

trannam710/NIC-with-Soft-Attention

Show, Attend, and Tell. Modified to use on UIT-ViIC dataset.

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

rishabh1323/Object-Detection-YOLOv3-OpenCV

A deep-learning object detection project pre-trained on COCO dataset

Language: Python - Size: 869 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

StiliyanDr/neural-image-caption

A simple Python API (built on top of TensorFlow) for neural image captioning with MSCOCO data.

Language: Python - Size: 299 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

khlee369/MSCOCO_ObjectDetection

MSCOCO data format details and how to evaluate mAP with pycocotools

Language: Jupyter Notebook - Size: 17.6 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

waikato-datamining/mscocodata 📦

Scripts for converting annotations into MS COCO JSON format.

Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

junjiedong/Image_Captioning_MSCOCO

Image Captioning on Microsoft Coco Dataset

Language: Jupyter Notebook - Size: 104 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 2

ogrenenmakine/Phototouch

Phototouch is multimodal photo editing tool

Language: Python - Size: 19.5 KB - Last synced at: 12 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1