Topic: "bitext-mining"
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language: Python - Size: 43.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,810 - Forks: 462

EliasK93/transformer-models-for-domain-specific-machine-translation
Example application for the task of fine-tuning pretrained machine translation models on highly domain-specific, self-extracted translated sentences
Language: Python - Size: 4.09 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 15 - Forks: 0

DOLMA-NLP/bitext-mining
Bitext mining for low-resourced Middle Eastern Languages - IWSLT2025
Language: Python - Size: 24.7 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

steventan0110/ParaCrawl
On-develop Bitext Mining Tool for low resource languages
Language: Shell - Size: 59.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
