Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: conllu

proycon/foliatools

A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.

Language: Python - Size: 1.06 MB - Last synced: 23 days ago - Pushed: 23 days ago - Stars: 9 - Forks: 4

TajaKuzman/Parlamint-translation

A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic processing with the Stanza pipeline, machine translation and word alignment with the Eflomal tool.

Language: Jupyter Notebook - Size: 38.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0

pyconll/pyconll

A minimal, pure Python library to interface with CoNLL-U format files.

Language: Python - Size: 505 KB - Last synced: about 1 month ago - Pushed: 12 months ago - Stars: 146 - Forks: 12

stefanrer/CountBigramFreqInConlluCorpus

Count Bigram frequency in a conllu format corpus

Language: Python - Size: 21.5 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

ArbelTepper/NLP-IAHLT_project

Exploring and visualizing CONULLU files in Python

Language: Jupyter Notebook - Size: 213 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

rhdunn/conllu-en-validator

A tool for validating English CoNLL-U data files.

Language: Python - Size: 334 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

rgalhama/spaCy2CoNLLU

Simple script to parse text with spaCy and print the output in CoNLL-U format.

Language: Python - Size: 5.86 KB - Last synced: 8 months ago - Pushed: over 4 years ago - Stars: 7 - Forks: 5

fostroll/corpuscula

Toolkit that simplifies corpus processing

Language: Python - Size: 36.1 MB - Last synced: 23 days ago - Pushed: over 2 years ago - Stars: 3 - Forks: 1

MuhammadYaseenKhan/CoNLL-U_Parser

An small Python script that converts a .conllu file into a tab seprated view (tsv) file.

Language: Jupyter Notebook - Size: 16.6 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 2 - Forks: 2

acoli-repo/conll

ACoLi CoNLL libraries: Several tools for processing, manipulating and transforming TSV formats (CoNLL-RDF, CoNLL-Merge, CQP4RDF)

Size: 74.2 KB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 5 - Forks: 1

udon2/udon2

A package for manipulating Universal Dependencies trees

Language: C++ - Size: 6.92 MB - Last synced: 1 day ago - Pushed: 8 months ago - Stars: 8 - Forks: 0

Nahid01752/Arc-eager-parser

This is a GitHub repository for my Arc-Eager Transition-Based Parser, which utilizes the perceptron algorithm and uni-gram, bigram, and distance features for accurate dependency parsing.

Language: Python - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

MinionAttack/conllu-conll-tool

Tool to convert CoNLL-U format files to CoNLL format files and manipulate training, validation and test sets.

Language: Python - Size: 96.7 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 4 - Forks: 0

danieldk/conllu-utils

Utilities for working with CoNLL-U

Language: Rust - Size: 74.2 KB - Last synced: 27 days ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

avramandrei/BERT-Sequence-Labeling

End-to-end integration of HuggingFace's models for sequence labeling.

Language: Python - Size: 1.82 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 8 - Forks: 1

rhdunn/opennlp-extensions

A set of OpenNLP extensions for reading and training CoNLL-X and CoNLL-U files.

Language: Kotlin - Size: 236 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

MinionAttack/corpus-translator

Tool for translating a corpus file from one language to another.

Language: Python - Size: 10.2 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

arthurdjn/udpos

Universal Dependencies datasets preprocess and autodownloads.

Language: Python - Size: 9.96 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0

fergusq/bils

Small bilar packages

Size: 15.6 KB - Last synced: 10 months ago - Pushed: almost 6 years ago - Stars: 1 - Forks: 0