An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: bilingual-lexicon-extraction

cambridgeltl/sail-bli

Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

Language: Python - Size: 445 KB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

cambridgeltl/ContrastiveBLI

Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

Language: Python - Size: 7.72 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 34 - Forks: 10

cambridgeltl/prompt4bli

On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

Language: Python - Size: 86.9 KB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 2

yaoyiran/BLI-Reading-List

A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.

Language: Python - Size: 116 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 22 - Forks: 2

kakaobrain/word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

Language: Python - Size: 1.07 MB - Last synced at: 9 months ago - Pushed at: over 4 years ago - Stars: 354 - Forks: 52

kbatsuren/CogNet

CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

Size: 88.7 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 32 - Forks: 8

zhangmozhi/iternorm

Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization (ACL 2019)

Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

cambridgeltl/BLICEr

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

Language: Python - Size: 158 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 3

jolivaresc/TSTL

Temas Selectos de Tecnologías del Lenguaje

Language: Jupyter Notebook - Size: 159 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

fdschmidt93/DynaDict

Bilingual n-gram Phrase Table Induction with Dynamax-Jaccard

Language: Python - Size: 7.25 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

THUNLP-MT/BiLex

A Bilingual Lexicon Inducer From Non-Parallel Data

Language: C - Size: 21.5 KB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

THUNLP-MT/UBiLexAT

An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Adversarial Training

Language: Python - Size: 14.6 MB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 1

THUNLP-MT/UBiLexEMD

An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Earth Mover's Distance Minimization

Language: Python - Size: 3.85 MB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 0

fdschmidt93/procrustes

Weakly-supervised bilingual lexicon induction

Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0