An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: summarization-corpora

csebuetnlp/xl-sum

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

Language: Python - Size: 5.41 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 249 - Forks: 42

nakhunchumpolsathien/ThaiSum

A Dataset for Thai text summarization from Thairath, ThaiPBS, Prachathai and The Standard with over 350,000 articles. Trained models are provided.

Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 37 - Forks: 13

Georgetown-IR-Lab/ExtendedSumm

On Generating Extended Summaries of Long Documents

Language: Python - Size: 305 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 77 - Forks: 13

IlyaGusev/gazeta

Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке

Language: Python - Size: 76.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 27 - Forks: 1

dykang/biassum

Summarization benchmark for studying corpus bias of your system

Language: Python - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0