Topic: "language-documentation"
RichardLitt/low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
Language: TeX - Size: 1.33 MB - Last synced at: about 21 hours ago - Pushed at: about 2 months ago - Stars: 419 - Forks: 59

timarkh/tsakorpus
Yet another search platform for linguistic corpora.
Language: Python - Size: 3.53 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 25 - Forks: 15

agricolamz/lingglosses
R package that helps to render interlinear glossed linguistic examples in html rmarkdown documents and then semi-automatically compiles the glosses list
Language: R - Size: 26.2 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 3

langdoc/elan-fst
Script for workflow to add morphological analysis into ELAN files
Language: Python - Size: 907 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 1

CNRS-LACITO/Pangloss_website
Tools for the Pangloss Collection, an online archive of under-documented languages
Language: HTML - Size: 249 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

digitallinguistics/scription
A specification for formatting interlinear glossed texts in a way that is computationally parseable
Size: 83 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

digitallinguistics/scription2dlx
A JavaScript library that converts scription text files to the Data Format for Digital Linguistics
Language: JavaScript - Size: 754 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

digitallinguistics/data-explorer
The DLx portal for viewing, searching, and aggregating data
Language: JavaScript - Size: 7.66 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

digitallinguistics/tools
Tools for Linguistic Productivity (TooLiP)
Language: JavaScript - Size: 348 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

iljackb/Mixtepec_Mixtec
Mostly XML (TEI) markup of Mixtepec-Mixtec Language resources
Language: HTML - Size: 92.3 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

dwhieb/Nuuchahnulth
Linguistic data on the Nuuchahnulth (Wakashan) language
Language: JavaScript - Size: 7.06 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

digitallinguistics/dlx2html
A JavaScript library for converting linguistic data to HTML
Language: JavaScript - Size: 392 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

CNRS-LACITO/eastlingeditor
This is the official repository of the Eastling editor. It is part of the Eastling suite: Easy Annotation and Synchronization Tool for linguists.
Language: JavaScript - Size: 125 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

digitallinguistics/app
The Lotus web app for managing linguistic data
Language: JavaScript - Size: 10.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

digitallinguistics/javascript
A JavaScript library for working with linguistic data in DLx format
Language: JavaScript - Size: 842 KB - Last synced at: about 12 hours ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

Nisinoon/Nisinoon
Website for the Algonquian Components Project (Nisinoon)
Language: TeX - Size: 21.2 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

l12maro/SileroVAD-Elan
An implementation of SileroVAD as a recognizer for ELAN
Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

langdoc/ocr-pipeline
Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

engganolang/digitised-holle-list
Cross-Linguistic Data Format (CLDF) dataset for The Digitised, Searchable Holle List in Stokhof (1980). The interactive version is deployed as a webpage 👇.
Language: HTML - Size: 8.99 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/holle-list-enggano-1895
Cross-Linguistic Data Format (CLDF) dataset for the Enggano word list from the late 19th century (c1895) based on the Holle List.
Language: R - Size: 478 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/modigliani-1894
Digitised comparative word list in Modigliani's "L'isola delle donne" from 1894. The word list captures forms in Nias, Toba-Batak, Enggano, and Malay, with Italian reference. The Enggano forms are included in the EnoLEX database (https://doi.org/10.25446/oxford.28282169).
Language: R - Size: 7.81 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

engganolang/oudemans1889
Digitised comparative Enggano word list from Oudemans (1889). This publication contains the unpublished Enggano word list by Francis (1870) put in comparison with those by Boewang (1854), van de Straaten & Severijn (1855), von Rosenberg (1855).
Language: TeX - Size: 44.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

BonnieMcLean/JapaneseMimeticNetwork
A network visualisation of Japanese mimetics. You can view and interact with the network at:
Language: R - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

BonnieMcLean/SiwuIdeophoneNetwork
A network visualisation of Siwu ideophones. You can view and interact with the network at:
Language: R - Size: 22.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

engganolang/enggano-flora-fauna
A repository to track R codes in (pre-)processing the flora and fauna Google Spreadsheet. The original, lightly annotated data is now archived in Oxford SDS 👇
Language: R - Size: 12.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

engganolang/flora-fauna-lexicon
A repository of raw datasets of the Enggano flora and fauna lexicon as part of the AHRC-funded research titled Lexical Resources for Enggano, A Threatened Language of Indonesia (AH/W007290/1).
Size: 40 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

amaliaskilton/auto-ffmpeg
Scripts to automate common ffmpeg commands for processing video in language documentation.
Language: Shell - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

CaterinaBi/my-academic-work
Papers and books I published during my 7 years of research at the Universities of Geneva and Cambridge.
Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

matthmr/docs.sd
SD language documentation
Language: HTML - Size: 38.1 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

alexis-michaud/pangloss
This repository hosts scripts and materials related to the Pangloss Collection. The official repository of the Pangloss Collection is at: https://github.com/CNRS/Pangloss
Size: 1.26 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dwhieb/mixtec-conversational-phrases 📦
A collection of conversational phrases in Tlahuapa Mixtec (Oto-Manguean)
Language: HTML - Size: 66 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

aryamanarora/kholosi
The Kholosi language of Iran.
Language: JavaScript - Size: 24.5 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

langdoc/kpv-lit
Collection of Public Domain data in Komi-Zyrian
Size: 3.52 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
