Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: parallel-corpora
techiaith/alinio
Cod hwyluso alinio testunau gyda hunalign a dogfennaeth ar sut i ddefnyddio LFAligner // Code for simplifying aligning texts with hunalign and documentation for LFAligner
Language: Python - Size: 28.3 KB - Last synced: about 2 months ago - Pushed: about 8 years ago - Stars: 0 - Forks: 0
rggdmonk/hadal
A simple and efficient tool for mining and aligning sentences with pre-trained models.
Language: Python - Size: 680 KB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 2 - Forks: 0
csebuetnlp/banglanmt
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Language: Python - Size: 2.05 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 144 - Forks: 45
timarkh/tsakorpus
Yet another search platform for linguistic corpora.
Language: Python - Size: 3.28 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 16 - Forks: 12
shashwatup9k/bho-resources
Size: 4.21 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 2 - Forks: 0
bitextor/bitextor
Bitextor generates translation memories from multilingual websites
Language: Python - Size: 177 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 265 - Forks: 45
Sohyo/Using-Confidential-Data-for-NMT
Size: 7.59 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 1
korenyoni/opus-api
OPUS (opus.nlpl.eu) Python3 API
Language: Python - Size: 118 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 14 - Forks: 5
tsuruoka-lab/BSD
The Business Scene Dialogue corpus
Size: 2.91 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 55 - Forks: 6
Kartikaggarwal98/Indian_ParallelCorpus
Curated list of publicly available parallel corpus for Indian Languages
Size: 8.79 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 28 - Forks: 1
gederajeg/constructional-equivalence
Repository of supplementary materials and RStudio project for the paper on corpus-based approach to measuring constructional equivalence.
Language: TeX - Size: 2.53 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
czcorpus/ictools
A program for calculating corpora alignments using a pivot language
Language: Go - Size: 242 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 1
npedrazzini/parallelbibles
Word-alignment models for Bible translations in 100+ historical and contemporary languages
Language: R - Size: 936 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
gederajeg/rob-steal-parallel-corpora
Repository kode pemrograman R dan data untuk analisis dalam penelitian dengan judul MODEL KAJIAN TERJEMAHAN BERBASIS BANK DATA TERJEMAHAN DIGITAL INGGRIS-INDONESIA DAN IMPLIKASI PEDAGOGISNYA
Language: R - Size: 8.51 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0