GitHub topics: ms-coco
ashnair1/COCO-Assistant
Helper for dealing with MS-COCO annotations
Language: Python - Size: 12.4 MB - Last synced at: 18 days ago - Pushed at: 8 months ago - Stars: 91 - Forks: 33

SamsungLabs/adaptis
[ICCV19] AdaptIS: Adaptive Instance Selection Network, https://arxiv.org/abs/1909.07829
Language: Jupyter Notebook - Size: 5.41 MB - Last synced at: 6 days ago - Pushed at: about 4 years ago - Stars: 336 - Forks: 32

PINTO0309/coco-viewer
Drawing and visualizing bounding boxes and key points.
Language: Python - Size: 356 KB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

bethgelab/robust-detection-benchmark
Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)
Language: Jupyter Notebook - Size: 29.3 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 181 - Forks: 24

zchoi/S2-Transformer
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
Language: Python - Size: 70.8 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 78 - Forks: 4

mohtasimhadi/resnet_exploration
Course project for COMP 6130 Data Mining, Summer'24, Auburn University
Language: Python - Size: 207 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

bbenligiray/ms_coco_formatter
A tool to download and format MS COCO dataset for multilabel classification
Language: Python - Size: 358 KB - Last synced at: 14 days ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 1

hemanthh17/Image_Segmentation_Parking
Using Image Segmentation for identifying free car parking slots
Language: Jupyter Notebook - Size: 554 KB - Last synced at: 10 days ago - Pushed at: about 4 years ago - Stars: 5 - Forks: 0

arghyawning/image-captioning-using-knn
Using Fast KNN for an image captioning task
Language: Jupyter Notebook - Size: 795 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

roshanr11/image-captioning
Used deep learning to train a CNN + RNN/LSTM on the MS-COCO dataset to automatically generate captions.
Language: HTML - Size: 187 MB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

ChryssaNab/Image-Colorization
PyTorch implementation of Conditional Generative Adversarial Networks (cGAN) for image colorization of the MS COCO dataset
Language: Python - Size: 3.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jaleedkhan/neusire
NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment
Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 3

tldoan/-HYP-OW-AAAI-2024-
[AAAI 2024] Official code for "Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection"
Size: 1.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

boschresearch/Hyp-OW
[AAAI 2024] Official code for "Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection"
Language: Python - Size: 13.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

DuncanZauss/Keypoint_Communities
[ICCV '21] In this repository you find the code to our paper "Keypoint Communities".
Language: Python - Size: 28.6 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 268 - Forks: 16

ozayr/detection-assisted-annotation-tool
labeling tool that allows easy plugin of detection networks that can assist in the labeling process
Language: Python - Size: 133 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

ashual/scene_generation
A PyTorch implementation of the paper: Specifying Object Attributes and Relations in Interactive Scene Generation
Language: Python - Size: 2.08 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 181 - Forks: 32

tooth2/Automatic-Image-Captioning
A Pytorch implementation of the CNN+RNN architecture on the MS-COCO dataset
Language: Jupyter Notebook - Size: 1.68 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

abhinav-neil/lavise
Reproduction of LaVisE: Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention
Language: Python - Size: 150 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

eric-yyjau/pytorch-superpoint
Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629
Language: Jupyter Notebook - Size: 193 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 658 - Forks: 151

chrise96/image-to-coco-json-converter
Convert segmentation RGB mask images to COCO JSON format
Language: Jupyter Notebook - Size: 878 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 173 - Forks: 57

akshitac8/OW-DETR
[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer
Language: Python - Size: 1.08 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 192 - Forks: 32

tohinz/multiple-objects-gan
Implementation for "Generating Multiple Objects at Spatially Distinct Locations" (ICLR 2019)
Language: Python - Size: 22.5 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 112 - Forks: 14

fazeVaib/DigiVision
A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the textual description of what's happening in front of the camera and conveys it to person through audio. It is capable of recognising faces and tell user whether a known person is standing in front of him or not.
Language: Python - Size: 148 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 5

tohinz/semantic-object-accuracy-for-generative-text-to-image-synthesis
Code for "Semantic Object Accuracy for Generative Text-to-Image Synthesis" (TPAMI 2020)
Language: Python - Size: 7.06 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 105 - Forks: 23

gaobb/MCAR
Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition
Language: Python - Size: 12.7 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 39 - Forks: 6

cdluminate/ladderloss
Ladder Loss for Coherent Visual-Semantic Embedding, AAAI, 2020
Language: Python - Size: 18.9 MB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 1

fmahoudeau/ShelfNet-Human-Pose-Estimation
Fast and accurate Human Pose Estimation using ShelfNet with PyTorch
Language: Python - Size: 13.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 114 - Forks: 33

jchenghu/captioning_eos
SacreEOS experiments
Language: Python - Size: 91.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

devparanjay/Multi-Auto-Annotate Fork of mdhmz1/Auto-Annotate
Multi-Auto-Annotate : Automatically annotate multiple labels in your entire image directory by a single command. Works with COCO dataset and also has the ability to train on custom dataset.
Language: Python - Size: 3.84 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ZJLAB-AMMI/DeMix
Python codes to implement DeMix, a DETR assisted CutMix method for image data augmentation
Language: Python - Size: 1.04 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Daniboy370/Deep-Learning
Side projects and hands-on work
Language: Jupyter Notebook - Size: 162 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 1

ngun7/Annotation-Converters
This Repo covers all formats of annotations for Object Detection and can easily convert from one form to another using attached scripts
Language: Python - Size: 1.21 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 8

JoshuaPlacidi/ms_coco_object_tags
Python dictionary storing object tags for MS-COCO images. Data from 3 different sources (COCO ground truths, VG classifier and Microsoft's VinVL) are availible.
Language: Jupyter Notebook - Size: 5.35 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Sshanu/civic_issue_dataset
Civic Issue Detection Dataset from Adversarial Adaptation of Scene Graph Models for Understanding Civic Issues
Language: HTML - Size: 5.55 MB - Last synced at: 21 days ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 0

louisyuzhe/car-damage-detector
Mask R-CNN Model to detect the area of damage on a car. The rationale for such a model is that it can be used by insurance companies for faster processing of claims if users can upload pics and they can assess damage from them. This model can also be used by lenders if they are underwriting a car loan especially for a used car.
Language: Jupyter Notebook - Size: 81.4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 39 - Forks: 29

HaydenFaulkner/VideoYOLO
Object Detection for Video with MXNet and GluonCV using YOLOv3
Language: Python - Size: 6.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 3

msindev/YOLO-v3-Object-Detection
This repository contains code for YOLO v3 Object detection, and is capable of fast object detection. Input can be given through images, videos and webcam input feed.
Language: Python - Size: 2.36 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 7

shunk031/huggingface-datasets_cocostuff
COCO-Stuff dataset for huggingface datasets
Language: Python - Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

CISiPLab/cisip-CoCa
Compact Image Captioning (CoCA) is an open source image captioning project to promote Green Computer Vision, as well as to make image captioning research accessible to universities, research labs and individual practitioners with limited financial resources.
Language: Python - Size: 125 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

sailee2781/obstacle_detection_recognition-
Deep Learning based project developed using YOLO-v5 (You Only Look Once) which helps to detect and recognize the obstacles for Autonomous Vehicles.The model developed also estimates the distance of each obstacle from initial position considered.
Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

charlie6echo/VBDLDSCC
Vision Based Document Layout Detection, Segmentation and context classification using MaskRCNN on Tensorflow-Keras, PyTorch & Detectron2.
Language: Jupyter Notebook - Size: 15 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

sharanp98/Intelligent-Advertisement-Generation
Intelligent Advertisement Generation for e-commerce websites using deep learning.
Language: Python - Size: 141 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

gwinndr/YOLOv4-Pytorch
Implementation of Darknet with You Only Look Once (YOLO) in Pytorch
Language: Python - Size: 2.42 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 4

MADHAVAN001/semantic-segmentation
A collection of semantic segmentation approaches
Language: Python - Size: 115 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Ritik-Sharma38/YoloV3_training_Hands
Performed object detection and logging time periods by deploying YOLO-V3 with transfer learning and fine tuning classifications for all layers of the network. The model is fine-tuned the model using the pre-trained MS-COCO weights and accordingly modified the same for custom dataset.
Language: C - Size: 170 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1
