Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: datasets
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
Size: 1.04 MB - Last synced: 28 days ago - Pushed: 5 months ago - Stars: 58,210 - Forks: 9,685
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language: Python - Size: 84.2 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 18,474 - Forks: 2,528
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language: JavaScript - Size: 1.86 GB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 16,367 - Forks: 2,018
tonybeltramelli/pix2code
pix2code: Generating Code from a Graphical User Interface Screenshot
Language: Python - Size: 1.15 GB - Last synced: 15 days ago - Pushed: 3 months ago - Stars: 11,899 - Forks: 1,429
doccano/doccano
Open source annotation tool for machine learning practitioners.
Language: Python - Size: 53.7 MB - Last synced: about 19 hours ago - Pushed: 2 months ago - Stars: 9,029 - Forks: 1,663
simonw/datasette
An open source multi-tool for exploring and publishing data
Language: Python - Size: 6.15 MB - Last synced: about 3 hours ago - Pushed: 5 days ago - Stars: 8,965 - Forks: 631
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language: Python - Size: 11.1 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 8,710 - Forks: 670
akfamily/akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Language: Python - Size: 375 MB - Last synced: about 2 hours ago - Pushed: about 5 hours ago - Stars: 8,458 - Forks: 1,753
satellite-image-deep-learning/techniques
Techniques for deep learning with satellite & aerial imagery
Size: 27.7 MB - Last synced: 11 days ago - Pushed: 20 days ago - Stars: 7,780 - Forks: 1,347
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Language: Python - Size: 65.1 MB - Last synced: about 2 hours ago - Pushed: 2 days ago - Stars: 7,734 - Forks: 593
imaNNeo/fl_chart
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
Language: Dart - Size: 57.3 MB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 6,434 - Forks: 1,658
liuruoze/EasyPR
(CGCSTCD'2017) An easy, flexible, and accurate plate recognition project for Chinese licenses in unconstrained situations. CGCSTCD = China Graduate Contest on Smart-city Technology and Creative Design
Language: C++ - Size: 186 MB - Last synced: 2 months ago - Pushed: over 4 years ago - Stars: 6,316 - Forks: 2,507
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Language: Python - Size: 945 MB - Last synced: 28 days ago - Pushed: about 1 month ago - Stars: 4,157 - Forks: 1,507
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Language: Python - Size: 8.87 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 3,772 - Forks: 581
jdorfman/awesome-json-datasets
A curated list of awesome JSON datasets that don't require authentication.
Language: JavaScript - Size: 236 KB - Last synced: about 19 hours ago - Pushed: 10 months ago - Stars: 3,200 - Forks: 372
roapi/roapi
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Language: Rust - Size: 1.13 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 3,089 - Forks: 170
justinzm/gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Language: Python - Size: 689 KB - Last synced: 5 days ago - Pushed: 8 months ago - Stars: 2,531 - Forks: 383
microsoft/torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Language: Python - Size: 129 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 2,232 - Forks: 287
github/CodeSearchNet 📦
Datasets, tools, and benchmarks for representation learning of code.
Language: Jupyter Notebook - Size: 28.6 MB - Last synced: 5 days ago - Pushed: over 2 years ago - Stars: 2,117 - Forks: 377
zhulf0804/3D-PointCloud
Papers and Datasets about Point Cloud.
Language: Python - Size: 1.47 MB - Last synced: 26 days ago - Pushed: 26 days ago - Stars: 2,096 - Forks: 287
jsbroks/coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
Language: Vue - Size: 2.02 MB - Last synced: 31 minutes ago - Pushed: 6 months ago - Stars: 2,019 - Forks: 442
FreedomIntelligence/Medical_NLP
Medical NLP Competition, dataset, large models, paper
Size: 439 KB - Last synced: 15 days ago - Pushed: 17 days ago - Stars: 1,984 - Forks: 385
colour-science/colour
Colour Science for Python
Language: Python - Size: 122 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 1,969 - Forks: 246
snap-stanford/ogb
Benchmark datasets, data loaders, and evaluators for graph machine learning
Language: Python - Size: 4.24 MB - Last synced: 10 days ago - Pushed: 3 months ago - Stars: 1,871 - Forks: 396
prabhuomkar/pytorch-cpp
C++ Implementation of PyTorch Tutorials for Everyone
Language: C++ - Size: 482 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1,837 - Forks: 249
diffgram/diffgram
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Language: Python - Size: 56.1 MB - Last synced: 18 days ago - Pushed: 26 days ago - Stars: 1,796 - Forks: 114
ChineseGLUE/ChineseGLUE
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Language: Python - Size: 2.65 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 1,763 - Forks: 246
JuliaData/DataFrames.jl
In-memory tabular data in Julia
Language: Julia - Size: 28.3 MB - Last synced: about 22 hours ago - Pushed: 23 days ago - Stars: 1,698 - Forks: 360
isl-org/Open3D-ML
An extension of Open3D to address 3D Machine Learning tasks
Language: Python - Size: 45.7 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,632 - Forks: 302
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Size: 136 KB - Last synced: 1 day ago - Pushed: about 2 months ago - Stars: 1,555 - Forks: 218
logpai/loghub
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
Size: 7.01 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 1,512 - Forks: 560
juand-r/entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Language: Python - Size: 2.47 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1,425 - Forks: 246
explosion/projects
🪐 End-to-end NLP workflows from prototype to production
Language: Python - Size: 18.5 MB - Last synced: 3 days ago - Pushed: about 1 month ago - Stars: 1,249 - Forks: 470
PolyAI-LDN/conversational-datasets
Large datasets for conversational AI
Language: Python - Size: 178 KB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 1,224 - Forks: 163
MobilityData/awesome-transit
Community list of transit APIs, apps, datasets, research, and software :bus::star2::train::star2::steam_locomotive:
Size: 644 KB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 1,223 - Forks: 191
shramos/Awesome-Cybersecurity-Datasets
A curated list of amazingly awesome Cybersecurity datasets
Size: 26.4 KB - Last synced: 4 days ago - Pushed: 2 months ago - Stars: 1,197 - Forks: 234
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python - Size: 4.01 MB - Last synced: 27 days ago - Pushed: about 1 month ago - Stars: 1,137 - Forks: 92
eosphoros-ai/DB-GPT-Hub
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Language: Python - Size: 27 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 1,014 - Forks: 137
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Size: 33.2 KB - Last synced: about 14 hours ago - Pushed: 4 months ago - Stars: 1,007 - Forks: 55
midas-research/audino
Open source audio annotation tool for humans
Language: JavaScript - Size: 10.7 MB - Last synced: 3 months ago - Pushed: 4 months ago - Stars: 1,005 - Forks: 118
jbrownlee/Datasets
Machine learning datasets used in tutorials on MachineLearningMastery.com
Size: 215 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 977 - Forks: 1,454
iamaziz/PyDataset
Instant access to many datasets in Python.
Language: Python - Size: 14.9 MB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 932 - Forks: 86
mims-harvard/TDC
Therapeutics Commons: Artificial Intelligence Foundation for Therapeutic Science
Language: Jupyter Notebook - Size: 67.6 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 930 - Forks: 167
shaypal5/awesome-twitter-data
A list of Twitter datasets and related resources.
Size: 68.4 KB - Last synced: about 12 hours ago - Pushed: 6 months ago - Stars: 908 - Forks: 121
ahundt/awesome-robotics
A curated list of awesome links and software libraries that are useful for robots.
Size: 190 KB - Last synced: about 14 hours ago - Pushed: 4 months ago - Stars: 901 - Forks: 148
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
Size: 308 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 877 - Forks: 78
DmitryRyumin/ICCV-2023-Papers
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!
Language: Python - Size: 16.8 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 869 - Forks: 39
JizhiziLi/GFM
[IJCV 2022] Bridging Composite and Real: Towards End-to-end Deep Image Matting
Language: Python - Size: 38.7 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 862 - Forks: 134
OYE93/Chinese-NLP-Corpus
Collections of Chinese NLP corpus
Language: Python - Size: 7.14 MB - Last synced: 8 days ago - Pushed: over 3 years ago - Stars: 848 - Forks: 207
caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Language: Jupyter Notebook - Size: 72.2 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 843 - Forks: 159
WLiK/LLM4Rec-Awesome-Papers
A list of awesome papers and resources of recommender system on large language model (LLM).
Size: 1.13 MB - Last synced: 3 days ago - Pushed: 4 days ago - Stars: 812 - Forks: 72
jsbroks/awesome-dataset-tools
🔧 A curated list of awesome dataset tools
Size: 44.9 KB - Last synced: 4 days ago - Pushed: 11 months ago - Stars: 798 - Forks: 119
zjunlp/Prompt4ReasoningPapers
[ACL 2023] Reasoning with Language Model Prompting: A Survey
Size: 7.29 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 773 - Forks: 61
ipeaGIT/geobr
Easy access to official spatial data sets of Brazil in R and Python
Language: R - Size: 46.3 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 766 - Forks: 117
DeepTecher/awesome-autonomous-vehicle
无人驾驶的资源列表中文版
Size: 85 KB - Last synced: 4 days ago - Pushed: over 2 years ago - Stars: 756 - Forks: 211
datasets/awesome-data
Curated list of quality open datasets
Size: 238 KB - Last synced: 1 day ago - Pushed: 26 days ago - Stars: 735 - Forks: 92
davidsbatista/Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Size: 51.1 MB - Last synced: 25 days ago - Pushed: almost 3 years ago - Stars: 679 - Forks: 132
adalca/medical-datasets
tracking medical datasets, with a focus on medical imaging
Size: 92.8 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 674 - Forks: 110
codefuse-ai/Awesome-Code-LLM
A curated list of language modeling researches for code and related datasets.
Size: 4.76 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 667 - Forks: 48
openml/OpenML
Open Machine Learning
Language: PHP - Size: 581 MB - Last synced: 11 days ago - Pushed: about 1 month ago - Stars: 636 - Forks: 90
huggingface/dataset-viewer
Lightweight web API for visualizing and exploring any dataset - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
Language: Python - Size: 21.2 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 619 - Forks: 59
st-tech/zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Language: Python - Size: 28.6 MB - Last synced: 21 days ago - Pushed: 9 months ago - Stars: 613 - Forks: 83
holistic-3d/awesome-holistic-3d
A list of papers and resources (data,code,etc) for holistic 3D reconstruction in computer vision
Size: 2.97 MB - Last synced: 3 days ago - Pushed: about 3 years ago - Stars: 607 - Forks: 89
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Size: 549 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 588 - Forks: 85
lartpang/awesome-segmentation-saliency-dataset
A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:
Size: 23.3 MB - Last synced: 3 days ago - Pushed: 11 months ago - Stars: 509 - Forks: 94
IndoNLP/indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Language: Jupyter Notebook - Size: 9.21 MB - Last synced: 15 days ago - Pushed: over 1 year ago - Stars: 501 - Forks: 179
satellite-image-deep-learning/datasets
Datasets for deep learning with satellite & aerial imagery
Size: 315 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 498 - Forks: 56
openvinotoolkit/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Language: Python - Size: 240 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 481 - Forks: 121
BaseModelAI/cleora
Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Language: Jupyter Notebook - Size: 4.82 MB - Last synced: about 2 hours ago - Pushed: 7 months ago - Stars: 476 - Forks: 51
JuliaData/DataFramesMeta.jl
Metaprogramming tools for DataFrames
Language: Julia - Size: 1.34 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 470 - Forks: 55
EagleW/PaperRobot
Code for PaperRobot: Incremental Draft Generation of Scientific Ideas
Language: Python - Size: 63.8 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 469 - Forks: 135
mahmoudnafifi/Exposure_Correction
Project page of the paper "Learning Multi-Scale Photo Exposure Correction" (CVPR 2021).
Language: MATLAB - Size: 30.9 MB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 466 - Forks: 59
codingonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
Size: 180 KB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 459 - Forks: 43
CLUEbenchmark/pCLUE
pCLUE: 1000000+多任务提示学习数据集
Language: Jupyter Notebook - Size: 192 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 448 - Forks: 52
yoosan/video-understanding-dataset
A collection of recent video understanding datasets, under construction!
Size: 21.5 KB - Last synced: 6 months ago - Pushed: almost 6 years ago - Stars: 438 - Forks: 79
mathiasmantelli/awesome-mobile-robotics
Useful links of different content related to AI, Computer Vision, and Robotics.
Size: 961 KB - Last synced: 3 days ago - Pushed: 22 days ago - Stars: 431 - Forks: 85
RenzeLou/awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
Language: Python - Size: 6.25 MB - Last synced: 4 days ago - Pushed: about 1 month ago - Stars: 402 - Forks: 21
datascienceid/machine-learning-resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Size: 24.4 KB - Last synced: 5 days ago - Pushed: about 1 year ago - Stars: 383 - Forks: 117
bytewax/awesome-public-real-time-datasets
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
Size: 52.7 KB - Last synced: about 4 hours ago - Pushed: 2 days ago - Stars: 369 - Forks: 13
OpenCSGs/CSGHub
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数据集、模型文件、代码等)。CSGHub提供类似私有化的Huggingface功能,以类似OpenStack Glance管理虚拟机镜像、Harbor管理容器镜像以及Sonatype Nexus管理制品的方式,实现对LLM资产的管理。欢迎关注反馈和Star⭐️
Language: Vue - Size: 26 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 369 - Forks: 21
JAIJANYANI/Automated-Resume-Screening-System
Automated Resume Screening System using Machine Learning (With Dataset)
Language: CSS - Size: 5.53 MB - Last synced: 7 months ago - Pushed: 10 months ago - Stars: 360 - Forks: 189
JizhiziLi/AIM
[IJCAI'21] Deep Automatic Natural Image Matting
Language: Python - Size: 53 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 351 - Forks: 33
MOLAorg/mola
A Modular Optimization framework for Localization and mApping (MOLA)
Language: C++ - Size: 2.44 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 347 - Forks: 69
dkulagin/kartaslov
Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.
Size: 20.1 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 346 - Forks: 50
Koziev/NLP_Datasets
My NLP datasets for Russian language
Language: C# - Size: 1.1 GB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 342 - Forks: 52
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
Size: 3.91 KB - Last synced: 23 days ago - Pushed: 12 months ago - Stars: 340 - Forks: 29
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
Size: 182 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 339 - Forks: 15
cleardusk/MeGlass
An eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Size: 7.41 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 336 - Forks: 64
chakki-works/chakin
Simple downloader for pre-trained word vectors
Language: Python - Size: 172 KB - Last synced: 10 days ago - Pushed: almost 2 years ago - Stars: 334 - Forks: 49
davidsbatista/NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
Language: Python - Size: 59.5 MB - Last synced: 25 days ago - Pushed: over 1 year ago - Stars: 333 - Forks: 83
jasonmanesis/Satellite-Imagery-Datasets-Containing-Ships
A list of radar and optical satellite datasets for ship detection, classification, semantic segmentation and instance segmentation tasks.
Size: 213 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 330 - Forks: 62
CelebV-HQ/CelebV-HQ
[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
Language: Python - Size: 3.6 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 324 - Forks: 21
arjunmann73/Data-Analytics-Projects
:mag_right: Data analysis with real world data sets using Python :mag:
Language: Jupyter Notebook - Size: 436 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 317 - Forks: 103
CambridgeUniversityPress/FirstCourseNetworkScience
Tutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis
Language: Jupyter Notebook - Size: 175 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 316 - Forks: 172
jumpingrivers/datasauRus
R Package 📦 Containing the Datasaurus Dozen datasets :bar_chart:
Language: R - Size: 19.2 MB - Last synced: 3 days ago - Pushed: 2 months ago - Stars: 309 - Forks: 46
JovianHQ/opendatasets
A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
Language: Python - Size: 25.9 MB - Last synced: 2 days ago - Pushed: 6 months ago - Stars: 308 - Forks: 141
src-d/datasets
source{d} datasets ("big code") for source code analysis and machine learning on source code
Language: Jupyter Notebook - Size: 47.5 MB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 307 - Forks: 82
weecology/retriever
Quickly download, clean up, and install public datasets into a database management system
Language: Python - Size: 77.4 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 302 - Forks: 134
uoneway/Text-Summarization-Repo
텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.
Size: 1.2 MB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 300 - Forks: 45
waico/SKAB
SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
Language: Jupyter Notebook - Size: 30.8 MB - Last synced: 2 days ago - Pushed: 8 months ago - Stars: 295 - Forks: 52