Topic: "parallel-corpora"
bitextor/bitextor
Bitextor generates translation memories from multilingual websites
Language: Python - Size: 177 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 290 - Forks: 43

csebuetnlp/banglanmt
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Language: Python - Size: 2.05 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 144 - Forks: 45

tsuruoka-lab/BSD
The Business Scene Dialogue corpus
Size: 2.91 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 55 - Forks: 6

Kartikaggarwal98/Indian_ParallelCorpus
Curated list of publicly available parallel corpus for Indian Languages
Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 28 - Forks: 1

timarkh/tsakorpus
Yet another search platform for linguistic corpora.
Language: Python - Size: 4.16 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 22 - Forks: 15

korenyoni/opus-api
OPUS (opus.nlpl.eu) Python3 API
Language: Python - Size: 117 KB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 18 - Forks: 5

rggdmonk/hadal
A simple and efficient tool for mining and aligning sentences with pre-trained models.
Language: Python - Size: 680 KB - Last synced at: 30 days ago - Pushed at: 12 months ago - Stars: 6 - Forks: 0

shashwatup9k/BHLTR
Size: 5.19 MB - Last synced at: 11 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

gederajeg/constructional-equivalence
Repository of supplementary materials and RStudio project for the paper on corpus-based approach to measuring constructional equivalence.
Language: TeX - Size: 2.53 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

czcorpus/ictools
A program for calculating corpora alignments using a pivot language
Language: Go - Size: 242 KB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

npedrazzini/parallelbibles
Word-alignment models for Bible translations in 100+ historical and contemporary languages
Language: R - Size: 936 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Sohyo/Using-Confidential-Data-for-NMT
Size: 7.59 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

Nexdata-AI/1990000-Groups-Chinese-Czech-Parallel-Corpus-Data
1990000-Groups-Chinese-Czech-Parallel-Corpus-Data
Size: 1.95 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

gederajeg/rob-steal-parallel-corpora
Repository kode pemrograman R dan data untuk analisis dalam penelitian dengan judul MODEL KAJIAN TERJEMAHAN BERBASIS BANK DATA TERJEMAHAN DIGITAL INGGRIS-INDONESIA DAN IMPLIKASI PEDAGOGISNYA
Language: R - Size: 8.51 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

techiaith/alinio
Cod hwyluso alinio testunau gyda hunalign a dogfennaeth ar sut i ddefnyddio LFAligner // Code for simplifying aligning texts with hunalign and documentation for LFAligner
Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: about 9 years ago - Stars: 0 - Forks: 0
