An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: udpipe

bnosac/udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

Language: C++ - Size: 5.74 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 214 - Forks: 33

EmilStenstrom/json-tagger

A JSON API to tag a sentence with part of speech tags. Uses UDPipe, so support for hundreds of languages.

Language: HTML - Size: 663 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 2

veldhub/veld_chain__eltec_udpipe_inference

chain velds using udpipe 1 to infer on five ELTeC corpora.

Language: XSLT - Size: 225 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

veldhub/veld_chain__demo_teitok-tools

Chain velds to demonstrate usage of teitok-tools code velds.

Size: 205 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

veldhub/veld_code__teitok-tools Fork of ufal/teitok-tools

Conversion tools to and from the TEITOK TEI/XML format

Language: Perl - Size: 221 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BramVanroy/spacy_conll

Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.

Language: Python - Size: 231 KB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 80 - Forks: 16

TakeLab/spacy-udpipe

spaCy + UDPipe

Language: Python - Size: 104 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 161 - Forks: 10

veldhub/veld_code__udpipe

Code velds encapsulating updipe.

Language: C++ - Size: 1.04 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

veldhub/veld_chain__demo_udpipe_ts-vienna-2024

Demo repo of the VELD design, for the CLSInfra Training School Vienna 2024.

Size: 2.41 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

veldhub/veld_code__xmlanntools Fork of czcorpus/xmlanntools

Code velds encapsulating xmlanntools.

Language: Python - Size: 157 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

veldhub/veld_chain__demo_xmlanntools

Chain velds to demonstrate usage of xmlanntools code velds.

Size: 319 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

eaklykova/syntaxcomp

A Python3 package for extracting syntactic complexity measures from CoNLL-U annotations.

Language: Python - Size: 29.3 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

cosmoduende/r-twitter

Explore your Twitter activity with R: Sentiment Analysis and Data Visualization. How to analyze your Twitter account (or any account), discover your habits and sentiments with the "rtweet" package and NLP.

Language: R - Size: 48 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

Ciccolo22/R-Projects-

Quick and dirty analyses in R ecosystem to explore various ML frameworks and statistical methods

Language: Jupyter Notebook - Size: 5.6 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

pixelneo/parapipeline

A pipeline for POS tagging, sentence alignment, word alignment, and transliteration of texts in 30+ languages.

Language: Python - Size: 423 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

MatteoFasulo/TextMining

Project of TextMining Course: an analysis on Amazon Alexa Echo Dot

Language: R - Size: 3.52 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ioverho/morph_tag_lemmatize

Research code used to implement SoTA joint morphological taggers and lemmatizers in context. Reproduction and extension of the SIGMORPHON/CONLL 2019 Shared Task 2.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

tchewik/isanlp_udpipe Fork of IINemo/isanlp_udpipe

UDPipe containerized module for Russian and English (use with isanlp library).

Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

CristinaGHolgado/old-french-lemmatization

Methods to lemmatize Old French using different tools

Language: Jupyter Notebook - Size: 126 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

maks5507/elsa

ELSA combines extractive and abstractive approaches to the automatic text summarization

Language: Python - Size: 29.1 MB - Last synced at: 7 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

OzzyProjects/BAO3

Boite à outils 3 XML-RSS Parser and Lemmatizer in pure Perl

Language: Perl - Size: 25.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

add1993/article-deduplication-detection-spark

Detect duplicates between large number of articles and store only a single copy of each article.

Language: Python - Size: 16.4 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

detcorpus/udpiper

Tiny UDPipe+Mystem-wrapper

Language: Python - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

elbersb/depdistance

Calculation of dependency distance

Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 3