Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: datasets

awesomedata/awesome-public-datasets

A topic-centric list of HQ open datasets.

Size: 1.04 MB - Last synced: 28 days ago - Pushed: 5 months ago - Stars: 58,210 - Forks: 9,685

huggingface/datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language: Python - Size: 84.2 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 18,474 - Forks: 2,528

HumanSignal/label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language: JavaScript - Size: 1.86 GB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 16,367 - Forks: 2,018

tonybeltramelli/pix2code

pix2code: Generating Code from a Graphical User Interface Screenshot

Language: Python - Size: 1.15 GB - Last synced: 15 days ago - Pushed: 3 months ago - Stars: 11,899 - Forks: 1,429

doccano/doccano

Open source annotation tool for machine learning practitioners.

Language: Python - Size: 53.7 MB - Last synced: about 19 hours ago - Pushed: 2 months ago - Stars: 9,029 - Forks: 1,663

simonw/datasette

An open source multi-tool for exploring and publishing data

Language: Python - Size: 6.15 MB - Last synced: about 3 hours ago - Pushed: 5 days ago - Stars: 8,965 - Forks: 631

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language: Python - Size: 11.1 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 8,710 - Forks: 670

akfamily/akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Language: Python - Size: 375 MB - Last synced: about 2 hours ago - Pushed: about 5 hours ago - Stars: 8,458 - Forks: 1,753

satellite-image-deep-learning/techniques

Techniques for deep learning with satellite & aerial imagery

Size: 27.7 MB - Last synced: 11 days ago - Pushed: 20 days ago - Stars: 7,780 - Forks: 1,347

activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Language: Python - Size: 65.1 MB - Last synced: about 2 hours ago - Pushed: 2 days ago - Stars: 7,734 - Forks: 593

imaNNeo/fl_chart

FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.

Language: Dart - Size: 57.3 MB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 6,434 - Forks: 1,658

liuruoze/EasyPR

(CGCSTCD'2017) An easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations. CGCSTCD = China Graduate Contest on Smart-city Technology and Creative Design

Language: C++ - Size: 186 MB - Last synced: 2 months ago - Pushed: over 4 years ago - Stars: 6,316 - Forks: 2,507

tensorflow/datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Language: Python - Size: 945 MB - Last synced: 28 days ago - Pushed: about 1 month ago - Stars: 4,157 - Forks: 1,507

CLUEbenchmark/CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language: Python - Size: 8.87 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 3,772 - Forks: 581

jdorfman/awesome-json-datasets

A curated list of awesome JSON datasets that don't require authentication.

Language: JavaScript - Size: 236 KB - Last synced: about 19 hours ago - Pushed: 10 months ago - Stars: 3,200 - Forks: 372

roapi/roapi

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

Language: Rust - Size: 1.13 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 3,089 - Forks: 170

justinzm/gopup

数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…

Language: Python - Size: 689 KB - Last synced: 5 days ago - Pushed: 8 months ago - Stars: 2,531 - Forks: 383

microsoft/torchgeo

TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data

Language: Python - Size: 129 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 2,232 - Forks: 287

github/CodeSearchNet 📦

Datasets, tools, and benchmarks for representation learning of code.

Language: Jupyter Notebook - Size: 28.6 MB - Last synced: 5 days ago - Pushed: over 2 years ago - Stars: 2,117 - Forks: 377

zhulf0804/3D-PointCloud

Papers and Datasets about Point Cloud.

Language: Python - Size: 1.47 MB - Last synced: 26 days ago - Pushed: 26 days ago - Stars: 2,096 - Forks: 287

jsbroks/coco-annotator

:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints

Language: Vue - Size: 2.02 MB - Last synced: 31 minutes ago - Pushed: 6 months ago - Stars: 2,019 - Forks: 442

FreedomIntelligence/Medical_NLP

Medical NLP Competition, dataset, large models, paper

Size: 439 KB - Last synced: 15 days ago - Pushed: 17 days ago - Stars: 1,984 - Forks: 385

colour-science/colour

Colour Science for Python

Language: Python - Size: 122 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 1,969 - Forks: 246

snap-stanford/ogb

Benchmark datasets, data loaders, and evaluators for graph machine learning

Language: Python - Size: 4.24 MB - Last synced: 10 days ago - Pushed: 3 months ago - Stars: 1,871 - Forks: 396

prabhuomkar/pytorch-cpp

C++ Implementation of PyTorch Tutorials for Everyone

Language: C++ - Size: 482 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1,837 - Forks: 249

diffgram/diffgram

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

Language: Python - Size: 56.1 MB - Last synced: 18 days ago - Pushed: 26 days ago - Stars: 1,796 - Forks: 114

ChineseGLUE/ChineseGLUE

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Language: Python - Size: 2.65 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 1,763 - Forks: 246

JuliaData/DataFrames.jl

In-memory tabular data in Julia

Language: Julia - Size: 28.3 MB - Last synced: about 22 hours ago - Pushed: 23 days ago - Stars: 1,698 - Forks: 360

isl-org/Open3D-ML

An extension of Open3D to address 3D Machine Learning tasks

Language: Python - Size: 45.7 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,632 - Forks: 302

jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Size: 136 KB - Last synced: 1 day ago - Pushed: about 2 months ago - Stars: 1,555 - Forks: 218

logpai/loghub

A large collection of system log datasets for AI-driven log analytics [ISSRE'23]

Size: 7.01 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 1,512 - Forks: 560

juand-r/entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Language: Python - Size: 2.47 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,425 - Forks: 246

explosion/projects

🪐 End-to-end NLP workflows from prototype to production

Language: Python - Size: 18.5 MB - Last synced: 3 days ago - Pushed: about 1 month ago - Stars: 1,249 - Forks: 470

PolyAI-LDN/conversational-datasets

Large datasets for conversational AI

Language: Python - Size: 178 KB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 1,224 - Forks: 163

MobilityData/awesome-transit

Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:

Size: 644 KB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 1,223 - Forks: 191

shramos/Awesome-Cybersecurity-Datasets

A curated list of amazingly awesome Cybersecurity datasets

Size: 26.4 KB - Last synced: 4 days ago - Pushed: 2 months ago - Stars: 1,197 - Forks: 234

PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language: Python - Size: 4.01 MB - Last synced: 27 days ago - Pushed: about 1 month ago - Stars: 1,137 - Forks: 92

eosphoros-ai/DB-GPT-Hub

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Language: Python - Size: 27 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 1,014 - Forks: 137

yaodongC/awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

Size: 33.2 KB - Last synced: about 14 hours ago - Pushed: 4 months ago - Stars: 1,007 - Forks: 55

midas-research/audino

Open source audio annotation tool for humans

Language: JavaScript - Size: 10.7 MB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 1,005 - Forks: 118

jbrownlee/Datasets

Machine learning datasets used in tutorials on MachineLearningMastery.com

Size: 215 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 977 - Forks: 1,454

iamaziz/PyDataset

Instant access to many datasets in Python.

Language: Python - Size: 14.9 MB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 932 - Forks: 86

mims-harvard/TDC

Therapeutics Commons: Artificial Intelligence Foundation for Therapeutic Science

Language: Jupyter Notebook - Size: 67.6 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 930 - Forks: 167

shaypal5/awesome-twitter-data

A list of Twitter datasets and related resources.

Size: 68.4 KB - Last synced: about 12 hours ago - Pushed: 6 months ago - Stars: 908 - Forks: 121

ahundt/awesome-robotics

A curated list of awesome links and software libraries that are useful for robots.

Size: 190 KB - Last synced: about 14 hours ago - Pushed: 4 months ago - Stars: 901 - Forks: 148

CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

Size: 308 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 877 - Forks: 78

DmitryRyumin/ICCV-2023-Papers

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Language: Python - Size: 16.8 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 869 - Forks: 39

JizhiziLi/GFM

[IJCV 2022] Bridging Composite and Real: Towards End-to-end Deep Image Matting

Language: Python - Size: 38.7 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 862 - Forks: 134

OYE93/Chinese-NLP-Corpus

Collections of Chinese NLP corpus

Language: Python - Size: 7.14 MB - Last synced: 8 days ago - Pushed: over 3 years ago - Stars: 848 - Forks: 207

caserec/Datasets-for-Recommender-Systems

This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)

Language: Jupyter Notebook - Size: 72.2 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 843 - Forks: 159

WLiK/LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

Size: 1.13 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 812 - Forks: 72

jsbroks/awesome-dataset-tools

🔧 A curated list of awesome dataset tools

Size: 44.9 KB - Last synced: 4 days ago - Pushed: 11 months ago - Stars: 798 - Forks: 119

zjunlp/Prompt4ReasoningPapers

[ACL 2023] Reasoning with Language Model Prompting: A Survey

Size: 7.29 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 773 - Forks: 61

ipeaGIT/geobr

Easy access to official spatial data sets of Brazil in R and Python

Language: R - Size: 46.3 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 766 - Forks: 117

DeepTecher/awesome-autonomous-vehicle

无人驾驶的资源列表中文版

Size: 85 KB - Last synced: 4 days ago - Pushed: over 2 years ago - Stars: 756 - Forks: 211

datasets/awesome-data

Curated list of quality open datasets

Size: 238 KB - Last synced: 1 day ago - Pushed: 26 days ago - Stars: 735 - Forks: 92

davidsbatista/Annotated-Semantic-Relationships-Datasets

A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)

Size: 51.1 MB - Last synced: 25 days ago - Pushed: almost 3 years ago - Stars: 679 - Forks: 132

adalca/medical-datasets

tracking medical datasets, with a focus on medical imaging

Size: 92.8 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 674 - Forks: 110

codefuse-ai/Awesome-Code-LLM

A curated list of language modeling researches for code and related datasets.

Size: 4.76 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 667 - Forks: 48

openml/OpenML

Open Machine Learning

Language: PHP - Size: 581 MB - Last synced: 11 days ago - Pushed: about 1 month ago - Stars: 636 - Forks: 90

huggingface/dataset-viewer

Lightweight web API for visualizing and exploring any dataset - computer vision, speech, text, and tabular - stored on the Hugging Face Hub

Language: Python - Size: 21.2 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 619 - Forks: 59

st-tech/zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Language: Python - Size: 28.6 MB - Last synced: 21 days ago - Pushed: 9 months ago - Stars: 613 - Forks: 83

holistic-3d/awesome-holistic-3d

A list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision

Size: 2.97 MB - Last synced: 3 days ago - Pushed: about 3 years ago - Stars: 607 - Forks: 89

saltudelft/ml4se

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

Size: 549 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 588 - Forks: 85

lartpang/awesome-segmentation-saliency-dataset

A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:

Size: 23.3 MB - Last synced: 3 days ago - Pushed: 11 months ago - Stars: 509 - Forks: 94

IndoNLP/indonlu

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

Language: Jupyter Notebook - Size: 9.21 MB - Last synced: 15 days ago - Pushed: over 1 year ago - Stars: 501 - Forks: 179

satellite-image-deep-learning/datasets

Datasets for deep learning with satellite & aerial imagery

Size: 315 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 498 - Forks: 56

openvinotoolkit/datumaro

Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.

Language: Python - Size: 240 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 481 - Forks: 121

BaseModelAI/cleora

Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.

Language: Jupyter Notebook - Size: 4.82 MB - Last synced: about 2 hours ago - Pushed: 7 months ago - Stars: 476 - Forks: 51

JuliaData/DataFramesMeta.jl

Metaprogramming tools for DataFrames

Language: Julia - Size: 1.34 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 470 - Forks: 55

EagleW/PaperRobot

Code for PaperRobot: Incremental Draft Generation of Scientific Ideas

Language: Python - Size: 63.8 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 469 - Forks: 135

mahmoudnafifi/Exposure_Correction

Project page of the paper "Learning Multi-Scale Photo Exposure Correction" (CVPR 2021).

Language: MATLAB - Size: 30.9 MB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 466 - Forks: 59

codingonion/awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.

Size: 180 KB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 459 - Forks: 43

CLUEbenchmark/pCLUE

pCLUE: 1000000+多任务提示学习数据集

Language: Jupyter Notebook - Size: 192 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 448 - Forks: 52

yoosan/video-understanding-dataset

A collection of recent video understanding datasets, under construction!

Size: 21.5 KB - Last synced: 6 months ago - Pushed: almost 6 years ago - Stars: 438 - Forks: 79

mathiasmantelli/awesome-mobile-robotics

Useful links of different content related to AI, Computer Vision, and Robotics.

Size: 961 KB - Last synced: 3 days ago - Pushed: 22 days ago - Stars: 431 - Forks: 85

RenzeLou/awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Language: Python - Size: 6.25 MB - Last synced: 4 days ago - Pushed: about 1 month ago - Stars: 402 - Forks: 21

datascienceid/machine-learning-resources

A curated list of awesome machine learning frameworks, libraries, courses, books and many more.

Size: 24.4 KB - Last synced: 5 days ago - Pushed: about 1 year ago - Stars: 383 - Forks: 117

bytewax/awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

Size: 52.7 KB - Last synced: about 4 hours ago - Pushed: 2 days ago - Stars: 369 - Forks: 13

OpenCSGs/CSGHub

CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数据集、模型文件、代码等)。CSGHub提供类似私有化的Huggingface功能,以类似OpenStack Glance管理虚拟机镜像、Harbor管理容器镜像以及Sonatype Nexus管理制品的方式,实现对LLM资产的管理。欢迎关注反馈和Star⭐️

Language: Vue - Size: 26 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 369 - Forks: 21

JAIJANYANI/Automated-Resume-Screening-System

Automated Resume Screening System using Machine Learning (With Dataset)

Language: CSS - Size: 5.53 MB - Last synced: 7 months ago - Pushed: 10 months ago - Stars: 360 - Forks: 189

JizhiziLi/AIM

[IJCAI'21] Deep Automatic Natural Image Matting

Language: Python - Size: 53 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 351 - Forks: 33

MOLAorg/mola

A Modular Optimization framework for Localization and mApping (MOLA)

Language: C++ - Size: 2.44 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 347 - Forks: 69

dkulagin/kartaslov

Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.

Size: 20.1 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 346 - Forks: 50

Koziev/NLP_Datasets

My NLP datasets for Russian language

Language: C# - Size: 1.1 GB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 342 - Forks: 52

chaoswork/sft_datasets

开源SFT数据集整理,随时补充

Size: 3.91 KB - Last synced: 23 days ago - Pushed: 12 months ago - Stars: 340 - Forks: 29

jianzhnie/awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

Size: 182 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 339 - Forks: 15

cleardusk/MeGlass

An eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.

Size: 7.41 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 336 - Forks: 64

chakki-works/chakin

Simple downloader for pre-trained word vectors

Language: Python - Size: 172 KB - Last synced: 10 days ago - Pushed: almost 2 years ago - Stars: 334 - Forks: 49

davidsbatista/NER-datasets

Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)

Language: Python - Size: 59.5 MB - Last synced: 25 days ago - Pushed: over 1 year ago - Stars: 333 - Forks: 83

jasonmanesis/Satellite-Imagery-Datasets-Containing-Ships

A list of radar and optical satellite datasets for ship detection, classification, semantic segmentation and instance segmentation tasks.

Size: 213 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 330 - Forks: 62

CelebV-HQ/CelebV-HQ

[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

Language: Python - Size: 3.6 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 324 - Forks: 21

arjunmann73/Data-Analytics-Projects

:mag_right: Data analysis with real world data sets using Python :mag:

Language: Jupyter Notebook - Size: 436 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 317 - Forks: 103

CambridgeUniversityPress/FirstCourseNetworkScience

Tutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis

Language: Jupyter Notebook - Size: 175 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 316 - Forks: 172

jumpingrivers/datasauRus

R Package 📦 Containing the Datasaurus Dozen datasets :bar_chart:

Language: R - Size: 19.2 MB - Last synced: 3 days ago - Pushed: 2 months ago - Stars: 309 - Forks: 46

JovianHQ/opendatasets

A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.

Language: Python - Size: 25.9 MB - Last synced: 2 days ago - Pushed: 6 months ago - Stars: 308 - Forks: 141

src-d/datasets

source{d} datasets ("big code") for source code analysis and machine learning on source code

Language: Jupyter Notebook - Size: 47.5 MB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 307 - Forks: 82

weecology/retriever

Quickly download, clean up, and install public datasets into a database management system

Language: Python - Size: 77.4 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 302 - Forks: 134

uoneway/Text-Summarization-Repo

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.

Size: 1.2 MB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 300 - Forks: 45

waico/SKAB

SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.

Language: Jupyter Notebook - Size: 30.8 MB - Last synced: 2 days ago - Pushed: 8 months ago - Stars: 295 - Forks: 52