An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: universal-dependencies

stanfordnlp/stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Language: Python - Size: 82.3 MB - Last synced at: 2 days ago - Pushed at: 16 days ago - Stars: 7,461 - Forks: 907

udapi/udapi-python

Python framework for processing Universal Dependencies data

Language: Python - Size: 3.03 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 57 - Forks: 32

UniversalDependencies/UD_Portuguese-Bosque

This Universal Dependencies (UD) Portuguese treebank.

Language: Common Lisp - Size: 209 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 50 - Forks: 12

nlp-uoregon/trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Language: Python - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 753 - Forks: 103

mehmetoguzderin/ud-2025h1-otk

Universal Dependencies and Old Turkish in 2025H1

Size: 1.29 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

adobe/NLP-Cube

Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing

Language: HTML - Size: 11.1 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 558 - Forks: 94

amir-zeldes/gum

Repository for the Georgetown University Multilayer Corpus (GUM)

Language: Python - Size: 1.35 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 94 - Forks: 49

hulln/UD-WALS-Linguistic-Patterns

Repository for a university project exploring linguistic patterns with UD and WALS, featuring Slovenian corpora analysis and transparent documentation.

Language: HTML - Size: 18.4 MB - Last synced at: 27 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

Aatlantise/k-snacs-ud

k-sncacs dataset for Universal Depdencies

Language: Python - Size: 12.9 MB - Last synced at: 29 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

pyconll/pyconll

A minimal, pure Python library to interface with CoNLL-U format files.

Language: Python - Size: 505 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 151 - Forks: 12

huspacy/huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

Language: Python - Size: 2.2 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 165 - Forks: 15

StarlangSoftware/TurkishDependencyParser-Cy

Dependency Parse Tree Processing Library

Language: Cython - Size: 25.7 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

StarlangSoftware/TurkishDependencyParser-Py

Dependency Parse Tree Processing Library

Language: Python - Size: 25.8 MB - Last synced at: about 20 hours ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 2

StarlangSoftware/UniversalDependencyParser

Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms

Language: Java - Size: 1.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

amir-zeldes/HebPipe

An NLP pipeline for Hebrew

Language: Lex - Size: 8.4 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 37 - Forks: 12

Hyperparticle/udify

A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology tags, lemmas, and dependency trees.

Language: Python - Size: 945 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 222 - Forks: 58

ivan-kleshnin/spacy-benchmarks

Comparison of Spacy performance with different architectures, corpuses, hyperparams...

Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

EmilStenstrom/json-tagger

A JSON API to tag a sentence with part of speech tags. Uses UDPipe, so support for hundreds of languages.

Language: HTML - Size: 663 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 2

alexeyev/apertium2ud

tag parser and converter between the two tagsets: Apertium (enhanced Leipzig?) and the one used in UD

Language: Python - Size: 72.3 KB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

TakeLab/spacy-udpipe

spaCy + UDPipe

Language: Python - Size: 104 KB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 161 - Forks: 10

StarlangSoftware/TurkishDependencyParser-CPP

Dependency Parse Tree Processing Library

Language: C++ - Size: 51.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

StarlangSoftware/TurkishDependencyParser-CS

Dependency Parse Tree Processing Library

Language: C# - Size: 25.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

alexeyev/tratreetra

simple syntactic transfer based on the treebank translation

Language: Python - Size: 42 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

StarlangSoftware/UniversalDependencyParser-CPP

Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms

Language: C++ - Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

360er0/COMBO

COMBO is jointly trained tagger, lemmatizer and dependency parser.

Language: Python - Size: 38.1 KB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 8

rhdunn/opennlp-extensions

A set of OpenNLP extensions for reading and training CoNLL-X and CoNLL-U files.

Language: Kotlin - Size: 236 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

LoicGrobol/ginger

Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.

Language: Python - Size: 289 KB - Last synced at: 29 days ago - Pushed at: 8 months ago - Stars: 12 - Forks: 1

unimorph/ud-compatibility

marry.py: A utility for converting Universal Dependencies–annotated corpora to UniMorph

Language: Python - Size: 48.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 6

interrogator/conll-df

CONLL-U to Pandas DataFrame

Language: Python - Size: 14.6 KB - Last synced at: 26 days ago - Pushed at: over 7 years ago - Stars: 31 - Forks: 9

kscanne/grammatach

Cód a bhaineann le gramadach Ghaeilge na hÉireann, Gaeilge na hAlban, agus Gaeilge Mhanann

Language: Python - Size: 245 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

StarlangSoftware/UniversalDependencyParser-CS

Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms

Language: C# - Size: 220 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

StarlangSoftware/UniversalDependencyParser-Js

Universal Dependency Annotation Interface and Basic Dependency Parsing Algorithms

Language: TypeScript - Size: 127 KB - Last synced at: 16 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

kscanne/gaelg

NLP resources for Manx Gaelic, mainly in support of the gv2ga MT engine

Language: Perl - Size: 11.3 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

fractaldragonflies/universal-dependencies

Modules related to reporting and work with universal dependencies for various languages

Language: Jupyter Notebook - Size: 875 KB - Last synced at: 9 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

furkanakkurt1335/boat

Boğaziçi University Annotation Tool

Language: TeX - Size: 16.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

udon2/udon2

A package for manipulating Universal Dependencies trees

Language: C++ - Size: 6.92 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

StarlangSoftware/TurkishDependencyParser

Dependency Parse Tree Processing Library

Language: Java - Size: 30.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 3

ud-turkic/udtw23

Resources for UD Turkic Workshop 2023

Size: 6.29 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 4

StarlangSoftware/TurkishDependencyParser-Swift

Dependency Parse Tree Processing Library

Language: Swift - Size: 18.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

StarlangSoftware/TurkishDependencyParser-Js

Dependency Parse Tree Processing Library

Language: TypeScript - Size: 32.9 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

div5yesh/natural-language-processing

Implementation of unigram/bigram language models, noisy channel and pointwise mutual information for natural language processing.

Language: Python - Size: 9.77 KB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

soutsios/pos-tagger-bert-tensorflow

BERT fine-tuning for POS tagging task (google's tensorflow)

Language: Jupyter Notebook - Size: 194 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 9 - Forks: 4

fostroll/mordl

Morphological parser (POS, lemmata, NER etc.)

Language: Python - Size: 3.75 MB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

asiryk/natural-language-processing

Natural language processing on Universal Dependencies framework

Language: TeX - Size: 1.03 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sivareddyg/UDepLambda

A framework to convert Universal Dependencies to Logical Forms

Language: Java - Size: 20.2 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 87 - Forks: 23

soutsios/pos-tagger-bert

BERT fine-tuning for POS tagging task (Keras)

Language: Jupyter Notebook - Size: 711 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 72 - Forks: 27

eric11eca/Udep2Mono

Universal Dependency polarization for monotonicity based natural language inference

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 1

PhyloStar/UniversalCEFRScoring Fork of nishkalavallabhi/UniversalCEFRScoring

Exploring the idea of a generic, language agnostic, CEFR level classifier

Language: Python - Size: 16.5 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

habeanf/yap

Yet Another (natural language) Parser

Language: Go - Size: 47.4 MB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 43 - Forks: 30

mova-institute/zoloto

розмічений руками морфо’, синт’, кореф’ корпус української мови

Size: 50.6 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 26 - Forks: 2

ElisaDiNuovo/VALICO-UD_guidelines

Annotation guidelines of the VALICO-UD treebank

Size: 7.83 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

AIRI-Institute/Probing_framework

Framework for probing tasks

Language: Python - Size: 65.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 10

dsindex/syntaxnet

reference code for syntaxnet

Language: Python - Size: 102 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 197 - Forks: 57

rhdunn/conllu-en-validator

A tool for validating English CoNLL-U data files.

Language: Python - Size: 334 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vukbatanovic/SETimes.SR

A Reference Training Corpus of Serbian

Language: Python - Size: 1.33 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

fostroll/corpuscula

Toolkit that simplifies corpus processing

Language: Python - Size: 36.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

qanastek/French-Part-Of-Speech-Tagging

Repository for the source code of the HuggingFace Space named qanastek/French-Part-Of-Speech-Tagging

Language: Python - Size: 41 KB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

soutsios/pos_tagger_rnn

BiRNN POS tagger experiments with pre-trained word embeddings, character embeddings and ELMo contextualized representations

Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 1

zacateras/udstats

Visual statistics for Universal Dependencies

Language: Python - Size: 3.31 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

harisont/concept-alignment

Syntax-based Concept Alignment for Machine Translation

Language: TeX - Size: 36.2 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

kasteph/CoNLL-U.tmLanguage

Syntax highlighting for CoNLL-U (.conllu, .conll) files on Sublime Text.

Size: 757 KB - Last synced at: 3 days ago - Pushed at: about 7 years ago - Stars: 9 - Forks: 0

Akshayanti/Masters-Thesis-CUNI-2020 📦

Experimental Data and Annotations for Master's Thesis as submitted to CUNI in July 2020

Language: Python - Size: 441 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sleepyrob/cl2-project

Contains the files of the group project for the "Computational linguistics II" course (A.Y. 2020-21, University of Pisa).

Size: 2.79 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

arademaker/hs-conllu

CoNLL-U/UD library

Language: Haskell - Size: 141 KB - Last synced at: 13 days ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 3

ioverho/morph_tag_lemmatize

Research code used to implement SoTA joint morphological taggers and lemmatizers in context. Reproduction and extension of the SIGMORPHON/CONLL 2019 Shared Task 2.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Esukhia/ud-pos-tagger-bo

Basic Universal Dependencies Part-of-Speech Tagger for Tibetan

Language: Python - Size: 2.43 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 2

zoltan-nz/typescript-sandbox

Typescript Sandbox

Language: TypeScript - Size: 6.33 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

unipv-larl/preverbs

The repository for the paper Annotating “Absolute” Preverbs in the Homeric and Vedic Treebanks.

Language: Python - Size: 4.14 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

tnitn/SPARQL-project

treebanks querying from TüNDRA to SPARQL

Language: Python - Size: 5.05 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

prannerta100/ud-pos-tagger

Hidden Markov Model based POS tagging for 60+ languages on universal dependencies (UD) data

Language: HTML - Size: 1.05 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

conllul/UL_Hebrew-HTB

CoNLL-UL Repository for UD_Hebrew

Size: 9.24 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

yyellin/tacred-enrichment

tacred-encrichment is a set of modules for supplementing the TACRED dataset with additional attributes, to be used by downstream RE neural networks

Language: Python - Size: 3.97 MB - Last synced at: 8 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

evelinacs/semantic_parsing_with_IRTGs

Experiments of developing an IRTG which simultaneously encodes transformations between phrase structure trees, dependency graphs and semantic graphs.

Language: Python - Size: 6.76 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 3

bureaucratic-labs/conllu

CoNLL-U format parser

Language: Go - Size: 7.81 KB - Last synced at: 11 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 0

hellomasaya/linguistics-data

Assignments covered as part of the course - Linguistics Data: Collection and Modeling

Language: Lex - Size: 52.6 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

lakhdar/ZConllu

Universal dependencies and Conll-u tool

Language: CSS - Size: 142 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

fostroll/morra

Morphological parser (POS, lemmata, NER etc.)

Language: Python - Size: 1.26 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

odanoburu/conllu-mode

CoNLL-U major mode for emacs

Language: Emacs Lisp - Size: 430 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 3

conllul/conllul.github.io

ConLL-UL Homepage

Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

olesar/UD_Lithuanian

Resourses and documentation for a Lithuanian Universal Dependencies treebank

Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

MartinXPN/sentence2tags

Extraction of lemma, morphological tags, and part of speech tag for each word in a given sentence

Language: Python - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

matanox/convert-tool

Scaffold for converting between CoNLL-U and spreadsheet formats

Language: Clojure - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

okalldal/gf-exjobb

Probabilistic natural language disambiguation using expectation maximization

Language: Grammatical Framework - Size: 197 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

Related Keywords
universal-dependencies 83 nlp 22 natural-language-processing 18 python 9 dependency-tree 8 dependency-parser 8 linguistics 7 dependency-analysis 7 machine-learning 7 stanford-dependency-tree 7 dependency-parsing 6 conll-u 6 conllu 5 lemmatization 5 syntax 5 corpus 5 pytorch 5 named-entity-recognition 5 computational-linguistics 4 pos-tagger 4 pos-tagging 4 morphological-analysis 4 morphology 4 treebank 4 annotation 4 deep-learning 4 artificial-intelligence 4 tokenization 3 part-of-speech-tagging 3 grammar 3 udpipe 3 hebrew 3 annotations 3 spacy 3 part-of-speech-tagger 3 morphosyntax 2 corpora 2 bert 2 gaelg 2 neural-network 2 penn-treebank 2 unimorph 2 nlp-library 2 treebanks 2 grammatical-framework 2 keras 2 hebrew-analytical-lexicon 2 information-extraction 2 machine-translation 2 corpus-linguistics 2 multilingual 2 language-model 2 coreference 2 transformers 2 morphological-tagging 2 thesis 2 ukrainian 1 transition-systems 1 nlp-parsing 1 nlp-dependency-parsing 1 flair 1 french 1 huggingface 1 part-of-speech 1 morphological-disambiguator 1 space 1 streamlit 1 ud-french-gsd 1 rnn-keras 1 golang 1 sublime-text-3 1 korean 1 dragnn 1 sejong-corpus 1 syntaxnet 1 tests 1 training-parser 1 training-tagger 1 conll 1 brat 1 dependency-syntax 1 morpho-syntactic 1 morpho-syntactic-tagging 1 named-entities 1 probing 1 parsing 1 serbian 1 multilinguality 1 italian-language 1 serbian-language 1 extended 1 guidelines 1 cuni 1 pomegranate 1 gcn 1 re 1 ucca 1 dependency-graph 1 grammar-rules 1 graph-transformation 1