GitHub topics: low-resource-machine-translation
cambridgeltl/sail-bli
Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
Language: Python - Size: 445 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

cambridgeltl/ContrastiveBLI
Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
Language: Python - Size: 7.72 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 34 - Forks: 10

cambridgeltl/prompt4bli
On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.
Language: Python - Size: 86.9 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 2

yaoyiran/BLI-Reading-List
A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.
Language: Python - Size: 116 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 22 - Forks: 2

jchenghu/lowres_uski
Basic implementation of the USKI (Unaligned Sentences Keytokens pre-training) method for Neural Machine Translation
Language: Python - Size: 9.7 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

csebuetnlp/banglanmt
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Language: Python - Size: 2.05 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 144 - Forks: 45

clefourrier/CopperMT
[ACL 2021, Findings] Cognate Prediction Per Machine Translation
Language: JavaScript - Size: 37.2 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 0

steventan0110/ParaCrawl
On-develop Bitext Mining Tool for low resource languages
Language: Shell - Size: 59.6 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

cambridgeltl/BLICEr
Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
Language: Python - Size: 158 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 3

HenningBuhl/low-resource-machine-translation
This repository is an open-source colleciton of various low-resource machine translation experiments.
Language: Python - Size: 428 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

ic1998/Cor-En
Cor-En: a Cornish-English Machine Translator.
Language: Python - Size: 2.43 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Kartikaggarwal98/Indian_ParallelCorpus
Curated list of publicly available parallel corpus for Indian Languages
Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 28 - Forks: 1

harshitadd/indicOCR
Low-Resource OCR
Language: Jupyter Notebook - Size: 35.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Pzoom522/L1-Refinement
Code for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)
Language: Python - Size: 29.3 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 15 - Forks: 2

andrea-cavallo-98/Low-resource-Machine-Translation
Multilingual finetuning of Machine Translation model on low-resource languages. Project for Deep Natural Language Processing course.
Language: Jupyter Notebook - Size: 4.32 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

machelreid/afromt
Code for the EMNLP 2021 Paper "AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages" by Machel Reid, Junjie Hu, Graham Neubig, Yutaka Matsuo
Language: Python - Size: 4.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2
