An open API service providing repository metadata for many open source software ecosystems.

Topic: "bitext-mining"

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 43.7 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2,810 - Forks: 462

EliasK93/transformer-models-for-domain-specific-machine-translation

Example application for the task of fine-tuning pretrained machine translation models on highly domain-specific, self-extracted translated sentences

Language: Python - Size: 4.09 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 15 - Forks: 0

DOLMA-NLP/bitext-mining

Bitext mining for low-resourced Middle Eastern Languages - IWSLT2025

Language: Python - Size: 24.7 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

steventan0110/ParaCrawl

On-develop Bitext Mining Tool for low resource languages

Language: Shell - Size: 59.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0