An open API service providing repository metadata for many open source software ecosystems.

Topic: "language-documentation"

RichardLitt/low-resource-languages

Resources for conservation, development, and documentation of low resource (human) languages.

Language: TeX - Size: 1.33 MB - Last synced at: about 21 hours ago - Pushed at: about 2 months ago - Stars: 419 - Forks: 59

timarkh/tsakorpus

Yet another search platform for linguistic corpora.

Language: Python - Size: 3.53 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 25 - Forks: 15

agricolamz/lingglosses

R package that helps to render interlinear glossed linguistic examples in html rmarkdown documents and then semi-automatically compiles the glosses list

Language: R - Size: 26.2 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 3

langdoc/elan-fst

Script for workflow to add morphological analysis into ELAN files

Language: Python - Size: 907 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 1

CNRS-LACITO/Pangloss_website

Tools for the Pangloss Collection, an online archive of under-documented languages

Language: HTML - Size: 249 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

digitallinguistics/scription

A specification for formatting interlinear glossed texts in a way that is computationally parseable

Size: 83 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

digitallinguistics/scription2dlx

A JavaScript library that converts scription text files to the Data Format for Digital Linguistics

Language: JavaScript - Size: 754 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

digitallinguistics/data-explorer

The DLx portal for viewing, searching, and aggregating data

Language: JavaScript - Size: 7.66 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

digitallinguistics/tools

Tools for Linguistic Productivity (TooLiP)

Language: JavaScript - Size: 348 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

iljackb/Mixtepec_Mixtec

Mostly XML (TEI) markup of Mixtepec-Mixtec Language resources

Language: HTML - Size: 92.3 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

dwhieb/Nuuchahnulth

Linguistic data on the Nuuchahnulth (Wakashan) language

Language: JavaScript - Size: 7.06 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

digitallinguistics/dlx2html

A JavaScript library for converting linguistic data to HTML

Language: JavaScript - Size: 392 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

CNRS-LACITO/eastlingeditor

This is the official repository of the Eastling editor. It is part of the Eastling suite: Easy Annotation and Synchronization Tool for linguists.

Language: JavaScript - Size: 125 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

digitallinguistics/app

The Lotus web app for managing linguistic data

Language: JavaScript - Size: 10.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

digitallinguistics/javascript

A JavaScript library for working with linguistic data in DLx format

Language: JavaScript - Size: 842 KB - Last synced at: about 12 hours ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

Nisinoon/Nisinoon

Website for the Algonquian Components Project (Nisinoon)

Language: TeX - Size: 21.2 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

l12maro/SileroVAD-Elan

An implementation of SileroVAD as a recognizer for ELAN

Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

langdoc/ocr-pipeline

Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

engganolang/digitised-holle-list

Cross-Linguistic Data Format (CLDF) dataset for The Digitised, Searchable Holle List in Stokhof (1980). The interactive version is deployed as a webpage 👇.

Language: HTML - Size: 8.99 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/holle-list-enggano-1895

Cross-Linguistic Data Format (CLDF) dataset for the Enggano word list from the late 19th century (c1895) based on the Holle List.

Language: R - Size: 478 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/modigliani-1894

Digitised comparative word list in Modigliani's "L'isola delle donne" from 1894. The word list captures forms in Nias, Toba-Batak, Enggano, and Malay, with Italian reference. The Enggano forms are included in the EnoLEX database (https://doi.org/10.25446/oxford.28282169).

Language: R - Size: 7.81 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

engganolang/oudemans1889

Digitised comparative Enggano word list from Oudemans (1889). This publication contains the unpublished Enggano word list by Francis (1870) put in comparison with those by Boewang (1854), van de Straaten & Severijn (1855), von Rosenberg (1855).

Language: TeX - Size: 44.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

BonnieMcLean/JapaneseMimeticNetwork

A network visualisation of Japanese mimetics. You can view and interact with the network at:

Language: R - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

BonnieMcLean/SiwuIdeophoneNetwork

A network visualisation of Siwu ideophones. You can view and interact with the network at:

Language: R - Size: 22.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

engganolang/enggano-flora-fauna

A repository to track R codes in (pre-)processing the flora and fauna Google Spreadsheet. The original, lightly annotated data is now archived in Oxford SDS 👇

Language: R - Size: 12.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

engganolang/flora-fauna-lexicon

A repository of raw datasets of the Enggano flora and fauna lexicon as part of the AHRC-funded research titled Lexical Resources for Enggano, A Threatened Language of Indonesia (AH/W007290/1).

Size: 40 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

amaliaskilton/auto-ffmpeg

Scripts to automate common ffmpeg commands for processing video in language documentation.

Language: Shell - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

CaterinaBi/my-academic-work

Papers and books I published during my 7 years of research at the Universities of Geneva and Cambridge.

Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

matthmr/docs.sd

SD language documentation

Language: HTML - Size: 38.1 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

alexis-michaud/pangloss

This repository hosts scripts and materials related to the Pangloss Collection. The official repository of the Pangloss Collection is at: https://github.com/CNRS/Pangloss

Size: 1.26 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dwhieb/mixtec-conversational-phrases 📦

A collection of conversational phrases in Tlahuapa Mixtec (Oto-Manguean)

Language: HTML - Size: 66 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

aryamanarora/kholosi

The Kholosi language of Iran.

Language: JavaScript - Size: 24.5 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

langdoc/kpv-lit

Collection of Public Domain data in Komi-Zyrian

Size: 3.52 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Topics
linguistics 15 digital-humanities 9 documentary-linguistics 8 corpus-linguistics 6 digital-linguistics 6 lexicography 6 lexical-database 5 corpus 5 enggano 5 dlx 5 language 5 enggano-language 4 corpora 4 r-programming 3 endangered-languages 3 word-list 3 enggano-lexical-database 3 indonesian-language 3 legacy-material 2 r 2 lexibank1 2 lexibank 2 holle-list 2 cross-linguistic-data-format 2 cldf 2 indonesian-languages 2 barrier-island-languages-indonesia 2 barrier-island-sumatra 2 glosses 2 ideophones 2 scription 2 interlinear-gloss 2 morphology 2 mixtec 2 low-resource-languages 1 lrls 1 scription-files 1 minority-language 1 lexicon 1 languages 1 language-description 1 natural-language 1 descriptive-linguistics 1 parallel-corpora 1 media-aligned-corpora 1 natural-language-processing 1 resourced-languages 1 linguistic-corpora 1 flask 1 elasticsearch 1 corpus-tools 1 nlp 1 complexico 1 barrier-island-languages 1 nuuchahnulth 1 wakashan 1 algonquian 1 historical-linguistics 1 glosses-list 1 rmarkdown 1 typology 1 glossing 1 awesome 1 awesome-list 1 human-language 1 concepticon-gloss 1 concepticon 1 language-learning 1 language-resources 1 list 1 lexical-resource 1 lexical-analysis 1 language-processing 1 multimedia-systems 1 multimedia-player 1 linguistic-annotation-framework 1 interlinear-text 1 uralic-languages 1 ocr 1 open-archives 1 xml 1 praat 1 voice-activity-detection 1 elan 1 giellatekno 1 finite-state-transducer 1 theoretical-linguistics 1 syntax 1 syntactic-theory 1 interrogatives 1 focus 1 cartography 1 ffmpeg 1 siwu 1 mimetics 1 japanese 1 kholosi 1 indo-european 1 indo-aryan 1 toba-batak 1