An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: lexibank1

engganolang/digitised-holle-list

Cross-Linguistic Data Format (CLDF) dataset for The Digitised, Searchable Holle List in Stokhof (1980). The interactive version is deployed as a webpage 👇.

Language: HTML - Size: 8.99 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/holle-list-enggano-1895

Cross-Linguistic Data Format (CLDF) dataset for the Enggano word list from the late 19th century (c1895) based on the Holle List.

Language: R - Size: 478 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/enolex

A repository of R codes and curated dataset for a Shiny web database titled "EnoLEX, a diachronic lexical database for the Enggano language". Access the database via the link below 👇.

Language: R - Size: 29.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 2

complexico/vrosenberg1853-numeral

Cross-Linguistic Data Format (CLDF) dataset derived from von Rosenberg's "De Mentawei-Eilanden en Hunne Bewoners" from 1853 for the comparative numeral data (p. 434). It is another practice session with CLDF to handle/test multple languages.

Language: Python - Size: 85 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

complexico/lingpy-practice

A repository for my (GPWR) practice with the lingpy module.

Language: Jupyter Notebook - Size: 233 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

lexibank/abvd

CLDF dataset derived from Greenhill et al.'s "Austronesian Basic Vocabulary Database" from 2020.

Language: TeX - Size: 78 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 2

lexibank/csd

CLDF dataset derived from Rankin et al.'s "Comparative Siouan Dictionary" from 2015.

Language: Python - Size: 10.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lexibank/tppsr

Tableaux Phonétiques des Patois Suisses Romands

Language: Python - Size: 10.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lexibank/transnewguineaorg

CLDF dataset derived from Greenhill's "TransNewGuinea.org" from 2015

Language: TeX - Size: 30.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

complexico/mentawai-word-list-1853

Cross-Linguistic Data Format (CLDF) dataset derived from von Rosenberg's "De Mentawei-Eilanden en Hunne Bewoners" from 1853.

Language: Python - Size: 223 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lexibank/johanssonsoundsymbolic

CLDF dataset derived from the Johansson et al.'s "The typology of sound symbolism" from 2020

Language: TeX - Size: 24.8 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

lexibank/hantganbangime

CLDF dataset supplementing Hantgan and List's "Bangime: Secret Language, Language Isolate, or Language Island?" (to appear)

Language: TeX - Size: 507 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/lieberherrkhobwa

CLDF dataset derived from Lieberherr and Bodt's "Comparative Wordlists of Kho-Bwa" from 2017

Language: Python - Size: 582 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/lundgrenomagoa

CLDF dataset derived from Lundgren's "Phonological Reconstruction of Proto-Omagua–Kokama–Tupinambá" from 2020

Language: Python - Size: 719 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lexibank/yanglalo

CLDF dataset derived from Yang's "Lalo Regional Varieties" from 2011

Language: TeX - Size: 2.01 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/suntb

CLDF dataset derived from Sūn's "Tibeto-Burman Phonology and Lexicon" from 1991

Language: Python - Size: 9.51 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/pharaocoracholaztecan

CLDF dataset derived from Pharao Hansen's "Investigation of the Relation between Proto-Náhuatl and Proto-Corachol" from 2020

Language: Python - Size: 437 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/leeainu

CLDF dataset derived from Lee and Hasegawa's "Evolution of the Ainu Language in Space and Time" from 2013

Language: Python - Size: 443 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/northeuralex

CLDF dataset derived from Dellert et al.'s "NorthEuraLex" from 2020

Language: TeX - Size: 36.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 2

lexibank/mcelhanonhuon

CLDF dataset derived from McElhanon's "Preliminary Observations on Huon Peninsula Languages" from 1967

Language: Python - Size: 393 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/hsiuhmongmien

Language: Python - Size: 1.73 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/joophonosemantic

CLDF dataset derived from Joo's "Phonosemantic Biases" from 2019

Language: TeX - Size: 633 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

lexibank/luangthongkumkaren

CLDF dataset derived from Luangthongkum's "Proto-Karen Phonology and Lexicon" from 2019

Language: Python - Size: 908 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

lexibank/leekoreanic

CLDF dataset derived from Lee's "Sketch of Language History in the Korean Peninsula" from 2015

Language: Python - Size: 353 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/leejaponic

CLDF dataset derived from Lee and Hasegawa's "Bayesian phylogenetic analysis supports an agricultural origin of Japonic languages" from 2011

Language: Python - Size: 1.98 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/kraftchadic

CLDF dataset derived from Kraft's "Chadic Wordlists" from 1981

Language: Python - Size: 4.03 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/kleinewillinghoeferbikwinjen

CLDF dataset derived from Kleinewillinghöfer's "Bikwin-Jen Comparative Wordlist" from 2015

Language: Python - Size: 219 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/galuciotupi

CLDF dataset derived from Galucio et al.'s "Lexical Distances within the Tupian Linguistic family" from 2015

Language: Python - Size: 517 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/mannburmish

CLDF dataset derived from Mann's "Reconstruction of Proto-Northern Burmish" from 1998

Language: Python - Size: 359 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/mitterhoferbena

CLDF dataset derived from Mitterhofer's "Dialect Survey of Bena" from 2013

Language: Python - Size: 223 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 2

lexibank/polyglottaafricana

CLDF dataset derived from Koelle's "Polyglotta Africana" from 1854

Language: Python - Size: 7.94 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/walworthpolynesian

CLDF dataset derived from Walworth's "Polynesian Segmented Data" from 2019

Language: TeX - Size: 1.72 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/simsrma

CLDF dataset derived from Sims' "Diachrony of Tone in Proto-Rma" from 2020

Language: Python - Size: 313 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/wheelerutoaztecan

CLDF dataset derived from Wheeler and Whiteley's "Evolution of Uto-Aztecan Languages" from 2014

Language: Python - Size: 1.07 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/walkerarawakan

Walker and Ribeiro (2011) Arawakan dataset

Language: TeX - Size: 1.17 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/peirosaustroasiatic

Peiros (2004) data on Austro-Asiatic languages

Language: Python - Size: 5.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 2

lexibank/starostinkaren

CLDF dataset derived from Starostin's "Annotated Swadesh Wordlists for the Karen Group" from 2017

Language: Python - Size: 131 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/bodtkhobwa

Work on Kho-Bwa subgrouping with Tim Bodt

Language: Python - Size: 1.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/birchallchapacuran

CLDF dataset derived from Birchall et al.'s "A Combined Comparative and Phylogenetic Analysis of the Chapacuran Language Family" from 2016

Language: Rich Text Format - Size: 503 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/syrjaenenuralic

CLDF datasets derived from Syrjänen et al.'s "Shedding more light on language classification" from 2013

Language: Python - Size: 130 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/sawkatokaleya

CLDF dataset derived from Sawka et al.'s "Toka-Leya of Zambia" from 2019

Language: Python - Size: 156 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/saenkoromance

CLDF Dataset derived from Saenko's "Annotated Swadesh wordlists for the Romance group" from 2015

Language: Python - Size: 323 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/robinsonap

CLDF dataset derived from Robinson and Holton's "Internal Classification of the Alor-Pantar Language Family" from 2012

Language: Python - Size: 1.03 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/tls

CLDF dataset derived from Nurse and Philippson's "Tanzania Language Survey" from 1975

Language: Python - Size: 30.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/zgraggenmadang

CLDF dataset derived from Z'graggen's "Madang Comparative Wordlists" from 1980.

Language: Python - Size: 6.47 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 2

lexibank/savelyevturkic

CLDF dataset derived from Savelyev and Robbeet's "Internal Structure of the Turkic Language Family" from 2020

Language: TeX - Size: 1.56 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/wichmannmixezoquean

CLDF dataset derived from Wichmann's "Lexicostatistical Dataset of Mixe-Zoquean" from 2006

Language: TeX - Size: 164 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/zhaobai

CLDF dataset derived from Zhao's "Investigations of Zhaozhuang Bai" from 2006

Language: Python - Size: 47.9 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/wangbai

CLDF dataset derived from Wang's "Language Contact and Language Comparison" from 2004

Language: Python - Size: 737 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/sagartst

Dataset of Sino-Tibetan Languages (Cognate-Coded)

Language: TeX - Size: 5.73 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 1

lexibank/tryonsolomon

CLDF dataset derived from Tryon and Hackman's "Solomon Islands Languages" from 1983

Language: Python - Size: 3.74 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/dravlex

CLDF dataset derived from Kolipakam et al.'s "DravLex:" from 2018.

Language: Python - Size: 371 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/davletshinaztecan

CLDF dataset derived from Davletshin's "Proto-Aztecan languages" from 2012

Language: TeX - Size: 302 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/clarkkimmun

CLDF dataset derived from Clark's "Phonological Analysis and Comparison of Two Kim Mun Varieties" from 2008

Language: Python - Size: 1.35 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/chindialectsurvey

CLDF dataset derived from the LSDO's "Chin Dialect Data" from 2019

Language: Python - Size: 2.73 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/chenhmongmien

CLDF dataset derived from Chén's "Miao and Yao Language" from 2012

Language: Python - Size: 8.44 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 2

lexibank/chaconcolumbian

CLDF dataset derived from Chacon's "Annotated Swadesh Lists for Tukanoan Languages" from 2017

Language: Python - Size: 3.63 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/chaconbaniwa

CLDF dataset derived from Chacon et al.'s "Diversity of Arawakan Languages" from 2019

Language: Python - Size: 641 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/chaconarawakan

CLDF dataset derived from Chacon's "Annotated Swadesh Lists for Arawakan Languages" from 2017

Language: TeX - Size: 254 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/castroyi

CLDF dataset derived from Castro et al.'s "Yi Varieties in Heqing" from 2010

Language: Python - Size: 385 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/castrosui

CLDF dataset derived from Castro's "Sui Dialect Research" from 2015

Language: Python - Size: 1.07 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/cals

CLDF dataset derived from Mennecier et al.'s "Central Asian Language Survey" from 2016

Language: Python - Size: 2.66 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/allenbai

CLDF dataset derived from Allen's "Bai Dialect Survey" from 2007

Language: Python - Size: 888 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/abrahammonpa

Dataset by Abraham et al. (2018) on selected varieties in Western Arunachal Pradesh

Language: Python - Size: 732 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/tolmiebritishcolumbia

CLDF dataset derived from Tolmie and Dawson's "Comparative Vocabulary of the Indigenous Peoples in British Columbia" from 1884

Language: Python - Size: 386 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/bremerberta

CLDF dataset derived from Bremer's "Sociolinguistic Survey of Six Berta Speech Varieties in Ethiopia" from 2016

Language: Python - Size: 165 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/halenepal

CLDF dataset derived from Hale's "Wordlists in Selected Languages of Nepal" from 1973

Language: Python - Size: 3.14 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/bantubvd

CLDF dataset derived from Greenhill and Gray’s "Bantu Basic Vocabulary Database" from 2015

Language: TeX - Size: 581 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/bowernpny

CLDF dataset derived from Bowern and Atkinson's "Internal Structure of Pama-Nyungan" from 2012

Language: Python - Size: 12.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/aaleykusunda

CLDF dataset derived from Aaley and Bodt's "New Kusunda data: A list of 250 concepts" from 2020

Language: Python - Size: 239 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/beidasinitic

CLDF dataset derived from Beijing University's "Chinese Dialect Vocabularies" from 1964

Language: Python - Size: 3.96 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/zhangrgyalrong

Old Chinese Gyalrong cognates

Language: Python - Size: 333 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

lexibank/kitchensemitic

CLDF dataset derived from Kitchen et al.'s "Bayesian phylogenetic analysis of Semitic languages" from 2009

Language: TeX - Size: 601 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 1

lexibank/diacl

CLDF dataset derived from Carling's "Diachronic Atlas of Comparative Linguistics" from 2017

Language: TeX - Size: 20.6 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

intercontinental-dictionary-series/ids

CLDF dataset derived from Key and Comrie's "Intercontinental Dictionary Series" from 2015

Language: TeX - Size: 105 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

lexibank/servamalagasy 📦

CLDF dataset derived from Serva et al.'s "Malagasy dialects and the peopling of Madagascar" from 2011

Language: Python - Size: 116 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

lexibank/asjp

CLDF dataset derived from Wichmann et al.'s "ASJP Database"

Language: TeX - Size: 62.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

lexibank/wold

CLDF dataset derived from Haspelmath and Tadmor's "World Loanword Database" from 2009

Language: TeX - Size: 41.5 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

lexibank/wanghmongmien

CLDF datset derived from Wang's "Comparative Study of Miao-Yao Languages" from 2015

Language: Python - Size: 270 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

lexibank/bowerntasmanian

CLDF dataset derived from Bowerns's "The riddle of the Tasmanian languages" from 2012

Language: Python - Size: 459 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

lexibank/hayniecolorterms

CLDF dataset derived from Haynie and Bowern's "Phylogenetic Approach to the Evolution of Color Term Systems" from 2016

Language: Python - Size: 142 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

lexibank/powerma

CLDF dataset accompanying Power et al.'s "Evolutionary Dynamics in the Dispersal of Sign Languages" from 2020

Language: TeX - Size: 306 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1