GitHub topics: language-data
WeblateOrg/language-data
Language definitions used by Weblate
Language: Python - Size: 73.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 20 - Forks: 23

dotWee/structured-stern-neon-articles
This repository contains approximately 16k user written texts, articles, and poetry pulled from archives of the Stern NEON website. Stern NEON was a community platform where users could write and publish their own articles. Many of the articles are personal stories, poems, or opinion pieces.
Size: 1.34 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

dart-community/linguist_lang_info
A collection of language information tracked by the linguist project.
Language: Dart - Size: 103 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 2 - Forks: 0

Aatlantise/k-snacs-ud
k-sncacs dataset for Universal Depdencies
Language: Python - Size: 12.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

termsurf/chat
Natural Language Grammars in TypeScript
Language: TypeScript - Size: 5.63 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

erhankilic/languagesSqlTable 📦
Languages Sql Table - Diller Sql Tablosu
Size: 2.93 KB - Last synced at: 3 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

eliyetres/lt2316-ht19-a1
Language identification with as few characters as possible
Language: Python - Size: 269 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

lexicalcomputing/hamod
a High Agreement Multi-lingual Outlier Detection dataset
Language: Python - Size: 541 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jonsafari/toy-data
Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks
Language: JavaScript - Size: 269 KB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0
