GitHub topics: lexibank1
engganolang/digitised-holle-list
Cross-Linguistic Data Format (CLDF) dataset for The Digitised, Searchable Holle List in Stokhof (1980). The interactive version is deployed as a webpage 👇.
Language: HTML - Size: 8.99 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/holle-list-enggano-1895
Cross-Linguistic Data Format (CLDF) dataset for the Enggano word list from the late 19th century (c1895) based on the Holle List.
Language: R - Size: 478 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

engganolang/enolex
A repository of R codes and curated dataset for a Shiny web database titled "EnoLEX, a diachronic lexical database for the Enggano language". Access the database via the link below 👇.
Language: R - Size: 29.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 2

complexico/vrosenberg1853-numeral
Cross-Linguistic Data Format (CLDF) dataset derived from von Rosenberg's "De Mentawei-Eilanden en Hunne Bewoners" from 1853 for the comparative numeral data (p. 434). It is another practice session with CLDF to handle/test multple languages.
Language: Python - Size: 85 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

complexico/lingpy-practice
A repository for my (GPWR) practice with the lingpy module.
Language: Jupyter Notebook - Size: 233 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

lexibank/abvd
CLDF dataset derived from Greenhill et al.'s "Austronesian Basic Vocabulary Database" from 2020.
Language: TeX - Size: 78 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 2

lexibank/csd
CLDF dataset derived from Rankin et al.'s "Comparative Siouan Dictionary" from 2015.
Language: Python - Size: 10.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lexibank/tppsr
Tableaux Phonétiques des Patois Suisses Romands
Language: Python - Size: 10.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lexibank/transnewguineaorg
CLDF dataset derived from Greenhill's "TransNewGuinea.org" from 2015
Language: TeX - Size: 30.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

complexico/mentawai-word-list-1853
Cross-Linguistic Data Format (CLDF) dataset derived from von Rosenberg's "De Mentawei-Eilanden en Hunne Bewoners" from 1853.
Language: Python - Size: 223 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lexibank/johanssonsoundsymbolic
CLDF dataset derived from the Johansson et al.'s "The typology of sound symbolism" from 2020
Language: TeX - Size: 24.8 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

lexibank/hantganbangime
CLDF dataset supplementing Hantgan and List's "Bangime: Secret Language, Language Isolate, or Language Island?" (to appear)
Language: TeX - Size: 507 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/lieberherrkhobwa
CLDF dataset derived from Lieberherr and Bodt's "Comparative Wordlists of Kho-Bwa" from 2017
Language: Python - Size: 582 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/lundgrenomagoa
CLDF dataset derived from Lundgren's "Phonological Reconstruction of Proto-Omagua–Kokama–Tupinambá" from 2020
Language: Python - Size: 719 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lexibank/yanglalo
CLDF dataset derived from Yang's "Lalo Regional Varieties" from 2011
Language: TeX - Size: 2.01 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/suntb
CLDF dataset derived from Sūn's "Tibeto-Burman Phonology and Lexicon" from 1991
Language: Python - Size: 9.51 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/pharaocoracholaztecan
CLDF dataset derived from Pharao Hansen's "Investigation of the Relation between Proto-Náhuatl and Proto-Corachol" from 2020
Language: Python - Size: 437 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/leeainu
CLDF dataset derived from Lee and Hasegawa's "Evolution of the Ainu Language in Space and Time" from 2013
Language: Python - Size: 443 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/northeuralex
CLDF dataset derived from Dellert et al.'s "NorthEuraLex" from 2020
Language: TeX - Size: 36.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 2

lexibank/mcelhanonhuon
CLDF dataset derived from McElhanon's "Preliminary Observations on Huon Peninsula Languages" from 1967
Language: Python - Size: 393 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/hsiuhmongmien
Language: Python - Size: 1.73 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/joophonosemantic
CLDF dataset derived from Joo's "Phonosemantic Biases" from 2019
Language: TeX - Size: 633 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

lexibank/luangthongkumkaren
CLDF dataset derived from Luangthongkum's "Proto-Karen Phonology and Lexicon" from 2019
Language: Python - Size: 908 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

lexibank/leekoreanic
CLDF dataset derived from Lee's "Sketch of Language History in the Korean Peninsula" from 2015
Language: Python - Size: 353 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/leejaponic
CLDF dataset derived from Lee and Hasegawa's "Bayesian phylogenetic analysis supports an agricultural origin of Japonic languages" from 2011
Language: Python - Size: 1.98 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/kraftchadic
CLDF dataset derived from Kraft's "Chadic Wordlists" from 1981
Language: Python - Size: 4.03 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/kleinewillinghoeferbikwinjen
CLDF dataset derived from Kleinewillinghöfer's "Bikwin-Jen Comparative Wordlist" from 2015
Language: Python - Size: 219 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/galuciotupi
CLDF dataset derived from Galucio et al.'s "Lexical Distances within the Tupian Linguistic family" from 2015
Language: Python - Size: 517 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/mannburmish
CLDF dataset derived from Mann's "Reconstruction of Proto-Northern Burmish" from 1998
Language: Python - Size: 359 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/mitterhoferbena
CLDF dataset derived from Mitterhofer's "Dialect Survey of Bena" from 2013
Language: Python - Size: 223 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 2

lexibank/polyglottaafricana
CLDF dataset derived from Koelle's "Polyglotta Africana" from 1854
Language: Python - Size: 7.94 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/walworthpolynesian
CLDF dataset derived from Walworth's "Polynesian Segmented Data" from 2019
Language: TeX - Size: 1.72 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/simsrma
CLDF dataset derived from Sims' "Diachrony of Tone in Proto-Rma" from 2020
Language: Python - Size: 313 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/wheelerutoaztecan
CLDF dataset derived from Wheeler and Whiteley's "Evolution of Uto-Aztecan Languages" from 2014
Language: Python - Size: 1.07 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/walkerarawakan
Walker and Ribeiro (2011) Arawakan dataset
Language: TeX - Size: 1.17 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/peirosaustroasiatic
Peiros (2004) data on Austro-Asiatic languages
Language: Python - Size: 5.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 2

lexibank/starostinkaren
CLDF dataset derived from Starostin's "Annotated Swadesh Wordlists for the Karen Group" from 2017
Language: Python - Size: 131 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/bodtkhobwa
Work on Kho-Bwa subgrouping with Tim Bodt
Language: Python - Size: 1.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/birchallchapacuran
CLDF dataset derived from Birchall et al.'s "A Combined Comparative and Phylogenetic Analysis of the Chapacuran Language Family" from 2016
Language: Rich Text Format - Size: 503 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/syrjaenenuralic
CLDF datasets derived from Syrjänen et al.'s "Shedding more light on language classification" from 2013
Language: Python - Size: 130 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/sawkatokaleya
CLDF dataset derived from Sawka et al.'s "Toka-Leya of Zambia" from 2019
Language: Python - Size: 156 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/saenkoromance
CLDF Dataset derived from Saenko's "Annotated Swadesh wordlists for the Romance group" from 2015
Language: Python - Size: 323 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/robinsonap
CLDF dataset derived from Robinson and Holton's "Internal Classification of the Alor-Pantar Language Family" from 2012
Language: Python - Size: 1.03 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/tls
CLDF dataset derived from Nurse and Philippson's "Tanzania Language Survey" from 1975
Language: Python - Size: 30.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/zgraggenmadang
CLDF dataset derived from Z'graggen's "Madang Comparative Wordlists" from 1980.
Language: Python - Size: 6.47 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 2

lexibank/savelyevturkic
CLDF dataset derived from Savelyev and Robbeet's "Internal Structure of the Turkic Language Family" from 2020
Language: TeX - Size: 1.56 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/wichmannmixezoquean
CLDF dataset derived from Wichmann's "Lexicostatistical Dataset of Mixe-Zoquean" from 2006
Language: TeX - Size: 164 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/zhaobai
CLDF dataset derived from Zhao's "Investigations of Zhaozhuang Bai" from 2006
Language: Python - Size: 47.9 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/wangbai
CLDF dataset derived from Wang's "Language Contact and Language Comparison" from 2004
Language: Python - Size: 737 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/sagartst
Dataset of Sino-Tibetan Languages (Cognate-Coded)
Language: TeX - Size: 5.73 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 1

lexibank/tryonsolomon
CLDF dataset derived from Tryon and Hackman's "Solomon Islands Languages" from 1983
Language: Python - Size: 3.74 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/dravlex
CLDF dataset derived from Kolipakam et al.'s "DravLex:" from 2018.
Language: Python - Size: 371 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/davletshinaztecan
CLDF dataset derived from Davletshin's "Proto-Aztecan languages" from 2012
Language: TeX - Size: 302 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/clarkkimmun
CLDF dataset derived from Clark's "Phonological Analysis and Comparison of Two Kim Mun Varieties" from 2008
Language: Python - Size: 1.35 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/chindialectsurvey
CLDF dataset derived from the LSDO's "Chin Dialect Data" from 2019
Language: Python - Size: 2.73 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/chenhmongmien
CLDF dataset derived from Chén's "Miao and Yao Language" from 2012
Language: Python - Size: 8.44 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 2

lexibank/chaconcolumbian
CLDF dataset derived from Chacon's "Annotated Swadesh Lists for Tukanoan Languages" from 2017
Language: Python - Size: 3.63 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/chaconbaniwa
CLDF dataset derived from Chacon et al.'s "Diversity of Arawakan Languages" from 2019
Language: Python - Size: 641 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/chaconarawakan
CLDF dataset derived from Chacon's "Annotated Swadesh Lists for Arawakan Languages" from 2017
Language: TeX - Size: 254 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/castroyi
CLDF dataset derived from Castro et al.'s "Yi Varieties in Heqing" from 2010
Language: Python - Size: 385 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/castrosui
CLDF dataset derived from Castro's "Sui Dialect Research" from 2015
Language: Python - Size: 1.07 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/cals
CLDF dataset derived from Mennecier et al.'s "Central Asian Language Survey" from 2016
Language: Python - Size: 2.66 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/allenbai
CLDF dataset derived from Allen's "Bai Dialect Survey" from 2007
Language: Python - Size: 888 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/abrahammonpa
Dataset by Abraham et al. (2018) on selected varieties in Western Arunachal Pradesh
Language: Python - Size: 732 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/tolmiebritishcolumbia
CLDF dataset derived from Tolmie and Dawson's "Comparative Vocabulary of the Indigenous Peoples in British Columbia" from 1884
Language: Python - Size: 386 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/bremerberta
CLDF dataset derived from Bremer's "Sociolinguistic Survey of Six Berta Speech Varieties in Ethiopia" from 2016
Language: Python - Size: 165 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/halenepal
CLDF dataset derived from Hale's "Wordlists in Selected Languages of Nepal" from 1973
Language: Python - Size: 3.14 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/bantubvd
CLDF dataset derived from Greenhill and Gray’s "Bantu Basic Vocabulary Database" from 2015
Language: TeX - Size: 581 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/bowernpny
CLDF dataset derived from Bowern and Atkinson's "Internal Structure of Pama-Nyungan" from 2012
Language: Python - Size: 12.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

lexibank/aaleykusunda
CLDF dataset derived from Aaley and Bodt's "New Kusunda data: A list of 250 concepts" from 2020
Language: Python - Size: 239 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lexibank/beidasinitic
CLDF dataset derived from Beijing University's "Chinese Dialect Vocabularies" from 1964
Language: Python - Size: 3.96 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

lexibank/zhangrgyalrong
Old Chinese Gyalrong cognates
Language: Python - Size: 333 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

lexibank/kitchensemitic
CLDF dataset derived from Kitchen et al.'s "Bayesian phylogenetic analysis of Semitic languages" from 2009
Language: TeX - Size: 601 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 1

lexibank/diacl
CLDF dataset derived from Carling's "Diachronic Atlas of Comparative Linguistics" from 2017
Language: TeX - Size: 20.6 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

intercontinental-dictionary-series/ids
CLDF dataset derived from Key and Comrie's "Intercontinental Dictionary Series" from 2015
Language: TeX - Size: 105 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

lexibank/servamalagasy 📦
CLDF dataset derived from Serva et al.'s "Malagasy dialects and the peopling of Madagascar" from 2011
Language: Python - Size: 116 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

lexibank/asjp
CLDF dataset derived from Wichmann et al.'s "ASJP Database"
Language: TeX - Size: 62.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

lexibank/wold
CLDF dataset derived from Haspelmath and Tadmor's "World Loanword Database" from 2009
Language: TeX - Size: 41.5 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

lexibank/wanghmongmien
CLDF datset derived from Wang's "Comparative Study of Miao-Yao Languages" from 2015
Language: Python - Size: 270 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

lexibank/bowerntasmanian
CLDF dataset derived from Bowerns's "The riddle of the Tasmanian languages" from 2012
Language: Python - Size: 459 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

lexibank/hayniecolorterms
CLDF dataset derived from Haynie and Bowern's "Phylogenetic Approach to the Evolution of Color Term Systems" from 2016
Language: Python - Size: 142 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

lexibank/powerma
CLDF dataset accompanying Power et al.'s "Evolutionary Dynamics in the Dispersal of Sign Languages" from 2020
Language: TeX - Size: 306 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1
