GitHub topics: universal-dependencies
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Language: Python - Size: 82.3 MB - Last synced at: 2 days ago - Pushed at: 16 days ago - Stars: 7,461 - Forks: 907

udapi/udapi-python
Python framework for processing Universal Dependencies data
Language: Python - Size: 3.03 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 57 - Forks: 32

UniversalDependencies/UD_Portuguese-Bosque
This Universal Dependencies (UD) Portuguese treebank.
Language: Common Lisp - Size: 209 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 50 - Forks: 12

nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 753 - Forks: 103

mehmetoguzderin/ud-2025h1-otk
Universal Dependencies and Old Turkish in 2025H1
Size: 1.29 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

adobe/NLP-Cube
Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
Language: HTML - Size: 11.1 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 558 - Forks: 94

amir-zeldes/gum
Repository for the Georgetown University Multilayer Corpus (GUM)
Language: Python - Size: 1.35 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 94 - Forks: 49

hulln/UD-WALS-Linguistic-Patterns
Repository for a university project exploring linguistic patterns with UD and WALS, featuring Slovenian corpora analysis and transparent documentation.
Language: HTML - Size: 18.4 MB - Last synced at: 27 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

Aatlantise/k-snacs-ud
k-sncacs dataset for Universal Depdencies
Language: Python - Size: 12.9 MB - Last synced at: 29 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

pyconll/pyconll
A minimal, pure Python library to interface with CoNLL-U format files.
Language: Python - Size: 505 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 151 - Forks: 12

huspacy/huspacy
HuSpaCy: industrial-strength Hungarian natural language processing
Language: Python - Size: 2.2 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 165 - Forks: 15

StarlangSoftware/TurkishDependencyParser-Cy
Dependency Parse Tree Processing Library
Language: Cython - Size: 25.7 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

StarlangSoftware/TurkishDependencyParser-Py
Dependency Parse Tree Processing Library
Language: Python - Size: 25.8 MB - Last synced at: about 20 hours ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 2

StarlangSoftware/UniversalDependencyParser
Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms
Language: Java - Size: 1.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

amir-zeldes/HebPipe
An NLP pipeline for Hebrew
Language: Lex - Size: 8.4 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 37 - Forks: 12

Hyperparticle/udify
A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology tags, lemmas, and dependency trees.
Language: Python - Size: 945 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 222 - Forks: 58

ivan-kleshnin/spacy-benchmarks
Comparison of Spacy performance with different architectures, corpuses, hyperparams...
Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

EmilStenstrom/json-tagger
A JSON API to tag a sentence with part of speech tags. Uses UDPipe, so support for hundreds of languages.
Language: HTML - Size: 663 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 2

alexeyev/apertium2ud
tag parser and converter between the two tagsets: Apertium (enhanced Leipzig?) and the one used in UD
Language: Python - Size: 72.3 KB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

TakeLab/spacy-udpipe
spaCy + UDPipe
Language: Python - Size: 104 KB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 161 - Forks: 10

StarlangSoftware/TurkishDependencyParser-CPP
Dependency Parse Tree Processing Library
Language: C++ - Size: 51.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

StarlangSoftware/TurkishDependencyParser-CS
Dependency Parse Tree Processing Library
Language: C# - Size: 25.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

alexeyev/tratreetra
simple syntactic transfer based on the treebank translation
Language: Python - Size: 42 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

StarlangSoftware/UniversalDependencyParser-CPP
Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms
Language: C++ - Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

360er0/COMBO
COMBO is jointly trained tagger, lemmatizer and dependency parser.
Language: Python - Size: 38.1 KB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 8

rhdunn/opennlp-extensions
A set of OpenNLP extensions for reading and training CoNLL-X and CoNLL-U files.
Language: Kotlin - Size: 236 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

LoicGrobol/ginger
Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.
Language: Python - Size: 289 KB - Last synced at: 29 days ago - Pushed at: 8 months ago - Stars: 12 - Forks: 1

unimorph/ud-compatibility
marry.py: A utility for converting Universal Dependencies–annotated corpora to UniMorph
Language: Python - Size: 48.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 6

interrogator/conll-df
CONLL-U to Pandas DataFrame
Language: Python - Size: 14.6 KB - Last synced at: 26 days ago - Pushed at: over 7 years ago - Stars: 31 - Forks: 9

kscanne/grammatach
Cód a bhaineann le gramadach Ghaeilge na hÉireann, Gaeilge na hAlban, agus Gaeilge Mhanann
Language: Python - Size: 245 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

StarlangSoftware/UniversalDependencyParser-CS
Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms
Language: C# - Size: 220 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

StarlangSoftware/UniversalDependencyParser-Js
Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms
Language: TypeScript - Size: 127 KB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

kscanne/gaelg
NLP resources for Manx Gaelic, mainly in support of the gv2ga MT engine
Language: Perl - Size: 11.3 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

fractaldragonflies/universal-dependencies
Modules related to reporting and work with universal dependencies for various languages
Language: Jupyter Notebook - Size: 875 KB - Last synced at: 9 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

furkanakkurt1335/boat
Boğaziçi University Annotation Tool
Language: TeX - Size: 16.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

udon2/udon2
A package for manipulating Universal Dependencies trees
Language: C++ - Size: 6.92 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

StarlangSoftware/TurkishDependencyParser
Dependency Parse Tree Processing Library
Language: Java - Size: 30.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 3

ud-turkic/udtw23
Resources for UD Turkic Workshop 2023
Size: 6.29 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 4

StarlangSoftware/TurkishDependencyParser-Swift
Dependency Parse Tree Processing Library
Language: Swift - Size: 18.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

StarlangSoftware/TurkishDependencyParser-Js
Dependency Parse Tree Processing Library
Language: TypeScript - Size: 32.9 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

div5yesh/natural-language-processing
Implementation of unigram/bigram language models, noisy channel and pointwise mutual information for natural language processing.
Language: Python - Size: 9.77 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

soutsios/pos-tagger-bert-tensorflow
BERT fine-tuning for POS tagging task (google's tensorflow)
Language: Jupyter Notebook - Size: 194 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 9 - Forks: 4

fostroll/mordl
Morphological parser (POS, lemmata, NER etc.)
Language: Python - Size: 3.75 MB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

asiryk/natural-language-processing
Natural language processing on Universal Dependencies framework
Language: TeX - Size: 1.03 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sivareddyg/UDepLambda
A framework to convert Universal Dependencies to Logical Forms
Language: Java - Size: 20.2 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 87 - Forks: 23

soutsios/pos-tagger-bert
BERT fine-tuning for POS tagging task (Keras)
Language: Jupyter Notebook - Size: 711 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 72 - Forks: 27

eric11eca/Udep2Mono
Universal Dependency polarization for monotonicity based natural language inference
Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 1

PhyloStar/UniversalCEFRScoring Fork of nishkalavallabhi/UniversalCEFRScoring
Exploring the idea of a generic, language agnostic, CEFR level classifier
Language: Python - Size: 16.5 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

habeanf/yap
Yet Another (natural language) Parser
Language: Go - Size: 47.4 MB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 43 - Forks: 30

mova-institute/zoloto
розмічений руками морфо’, синт’, кореф’ корпус української мови
Size: 50.6 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 26 - Forks: 2

ElisaDiNuovo/VALICO-UD_guidelines
Annotation guidelines of the VALICO-UD treebank
Size: 7.83 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

AIRI-Institute/Probing_framework
Framework for probing tasks
Language: Python - Size: 65.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 10

dsindex/syntaxnet
reference code for syntaxnet
Language: Python - Size: 102 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 197 - Forks: 57

rhdunn/conllu-en-validator
A tool for validating English CoNLL-U data files.
Language: Python - Size: 334 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vukbatanovic/SETimes.SR
A Reference Training Corpus of Serbian
Language: Python - Size: 1.33 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

fostroll/corpuscula
Toolkit that simplifies corpus processing
Language: Python - Size: 36.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

qanastek/French-Part-Of-Speech-Tagging
Repository for the source code of the HuggingFace Space named qanastek/French-Part-Of-Speech-Tagging
Language: Python - Size: 41 KB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

soutsios/pos_tagger_rnn
BiRNN POS tagger experiments with pre-trained word embeddings, character embeddings and ELMo contextualized representations
Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 1

zacateras/udstats
Visual statistics for Universal Dependencies
Language: Python - Size: 3.31 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

harisont/concept-alignment
Syntax-based Concept Alignment for Machine Translation
Language: TeX - Size: 36.2 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

kasteph/CoNLL-U.tmLanguage
Syntax highlighting for CoNLL-U (.conllu, .conll) files on Sublime Text.
Size: 757 KB - Last synced at: 3 days ago - Pushed at: about 7 years ago - Stars: 9 - Forks: 0

Akshayanti/Masters-Thesis-CUNI-2020 📦
Experimental Data and Annotations for Master's Thesis as submitted to CUNI in July 2020
Language: Python - Size: 441 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sleepyrob/cl2-project
Contains the files of the group project for the "Computational linguistics II" course (A.Y. 2020-21, University of Pisa).
Size: 2.79 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

arademaker/hs-conllu
CoNLL-U/UD library
Language: Haskell - Size: 141 KB - Last synced at: 13 days ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 3

ioverho/morph_tag_lemmatize
Research code used to implement SoTA joint morphological taggers and lemmatizers in context. Reproduction and extension of the SIGMORPHON/CONLL 2019 Shared Task 2.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Esukhia/ud-pos-tagger-bo
Basic Universal Dependencies Part-of-Speech Tagger for Tibetan
Language: Python - Size: 2.43 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 2

zoltan-nz/typescript-sandbox
Typescript Sandbox
Language: TypeScript - Size: 6.33 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

unipv-larl/preverbs
The repository for the paper Annotating “Absolute” Preverbs in the Homeric and Vedic Treebanks.
Language: Python - Size: 4.14 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

tnitn/SPARQL-project
treebanks querying from TüNDRA to SPARQL
Language: Python - Size: 5.05 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

prannerta100/ud-pos-tagger
Hidden Markov Model based POS tagging for 60+ languages on universal dependencies (UD) data
Language: HTML - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

conllul/UL_Hebrew-HTB
CoNLL-UL Repository for UD_Hebrew
Size: 9.24 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

yyellin/tacred-enrichment
tacred-encrichment is a set of modules for supplementing the TACRED dataset with additional attributes, to be used by downstream RE neural networks
Language: Python - Size: 3.97 MB - Last synced at: 8 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

evelinacs/semantic_parsing_with_IRTGs
Experiments of developing an IRTG which simultaneously encodes transformations between phrase structure trees, dependency graphs and semantic graphs.
Language: Python - Size: 6.76 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 3

bureaucratic-labs/conllu
CoNLL-U format parser
Language: Go - Size: 7.81 KB - Last synced at: 11 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 0

hellomasaya/linguistics-data
Assignments covered as part of the course - Linguistics Data: Collection and Modeling
Language: Lex - Size: 52.6 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

lakhdar/ZConllu
Universal dependencies and Conll-u tool
Language: CSS - Size: 142 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

fostroll/morra
Morphological parser (POS, lemmata, NER etc.)
Language: Python - Size: 1.26 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

odanoburu/conllu-mode
CoNLL-U major mode for emacs
Language: Emacs Lisp - Size: 430 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 3

conllul/conllul.github.io
ConLL-UL Homepage
Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

olesar/UD_Lithuanian
Resourses and documentation for a Lithuanian Universal Dependencies treebank
Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

MartinXPN/sentence2tags
Extraction of lemma, morphological tags, and part of speech tag for each word in a given sentence
Language: Python - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

matanox/convert-tool
Scaffold for converting between CoNLL-U and spreadsheet formats
Language: Clojure - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

okalldal/gf-exjobb
Probabilistic natural language disambiguation using expectation maximization
Language: Grammatical Framework - Size: 197 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1
