Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: cross-lingual
artitw/text2text
Text2Text: Crosslingual NLP/G toolkit
Language: Python - Size: 686 KB - Last synced: 4 days ago - Pushed: 3 months ago - Stars: 276 - Forks: 33
shimo-lab/Universal-Geometry-with-ICA
Discovering Universal Geometry in Embeddings with ICA
Language: Python - Size: 10.8 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 15 - Forks: 0
Separius/awesome-sentence-embedding 📦
A curated list of pretrained sentence and word embedding models
Language: Python - Size: 282 KB - Last synced: 14 days ago - Pushed: about 3 years ago - Stars: 2,193 - Forks: 259
ictnlp/BayLing
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.
Language: Python - Size: 66.6 MB - Last synced: 2 months ago - Pushed: 6 months ago - Stars: 268 - Forks: 15
sheng-z/cross-lingual-open-ie
MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models
Language: Python - Size: 26.2 MB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 22 - Forks: 9
BobXWu/InfoCTM
Code for InfoCTM: A Mutual Information Maximization Perspective of Cross-lingual Topic Modeling (AAAI2023)
Language: Python - Size: 25.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 20 - Forks: 0
nutcrtnk/DHGNet
Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.
Language: Python - Size: 37.4 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 12 - Forks: 4
salesforce/FewXC
Official code and data release for Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning, accepted by findings of EACL 2024.
Language: Python - Size: 42 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
mhardalov/exams-qa
A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
Language: Python - Size: 441 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 35 - Forks: 4
Darth-Kronos/Cross-lingual-SER
A cross-lingual SER that can learn language invariant representations without requiring target-language data labels
Language: Python - Size: 13.7 KB - Last synced: 22 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 1
google-research-datasets/swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.
Size: 201 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 32 - Forks: 2
checkstep/senti-stance
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Language: Python - Size: 43.7 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 16 - Forks: 2
shyamupa/xling-el
pytorch model for cross-lingual entity linking.
Language: Python - Size: 101 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 16 - Forks: 3
princeton-nlp/MultilingualAnalysis
Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"
Language: Python - Size: 47.9 MB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 13 - Forks: 0
ConsistencyVC/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Language: Python - Size: 1.72 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 83 - Forks: 12
CZWin32768/XNLG
AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training
Language: Python - Size: 149 KB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 126 - Forks: 18
harisbinzia/ZeroshotCrosslingualHateSpeech
Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models
Size: 4.88 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
maum-ai/sane-tts
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
Size: 57 MB - Last synced: 10 months ago - Pushed: 11 months ago - Stars: 5 - Forks: 0
andreabac3/cross-lingual-neural-databases
Codebase of Cross-Lingual Neural Databases
Language: Python - Size: 9.38 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
AmanDaVinci/X-Lingual-Transfer-Learning
Evolution of Representations during Cross Lingual Transfer Learning
Language: Jupyter Notebook - Size: 7.31 MB - Last synced: 10 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1
g-laz77/Cross-Lingual-Word-Embeddings
Learn a shared embedding space between words in multiple languages.
Language: Python - Size: 91.8 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 4 - Forks: 2
zliucr/mixed-language-training
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems (AAAI-2020)
Language: Python - Size: 14.5 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 31 - Forks: 5
zliucr/crosslingual-slu
EMNLP-2020: Cross-lingual Spoken Language Understanding with Regularized Representation Alignment
Language: Python - Size: 17.9 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 18 - Forks: 1
zliucr/Crosslingual-NLU
Zero-shot Cross-lingual Task-Oriented Dialogue Systems (EMNLP 2019)
Language: Python - Size: 50.2 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 23 - Forks: 5
pauli31/czech-subjectivity-dataset
This is the repository for the newly created Czech Subjectivity Dataset (Subj-CS) and our paper:
Size: 12.7 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
Youggls/ACROSS-ACL23
Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization
Size: 2.93 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 11 - Forks: 0
seahore/PPG-GradVC
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
Language: Python - Size: 204 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 35 - Forks: 6
ikergarcia1996/MVM-Embeddings
A monolingual and cross-lingual meta-embedding generation framework
Language: Python - Size: 10.4 MB - Last synced: 11 months ago - Pushed: over 4 years ago - Stars: 4 - Forks: 0
ramiyappan/X-Gear
Reproducing baseline model from ACL-2022 paper X-GEAR for Zero-shot Cross-Lingual EAE
Language: Python - Size: 10.3 MB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
GeekDream-x/SemEval2022-Task8-TonyX
Deep-learning system proposed by HFL for SemEval-2022 Task 8: Multilingual News Similarity
Language: Python - Size: 2.82 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 50 - Forks: 5
Xiefeng69/Awesome-Entity-Alignment
Awesome Entity Alignment is a collection of EA techniques, including papers, codes, and datasets.
Size: 32.2 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 7 - Forks: 2
yaushian/mSimCSE
mSimCSE: Multilingual SimCSE
Language: Python - Size: 2.62 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 14 - Forks: 1
Pzoom522/xANLG
Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)
Language: Python - Size: 31.3 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 0
chiachienhung/ZusammenQA
ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System
Language: Python - Size: 12.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 3
subhadarship/nlp4if-2021
Cross-lingual misinformation detection
Language: Jupyter Notebook - Size: 13.8 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1
honghanhh/ate-2022
Can Cross-domain Term Extraction Benefit from Cross-lingual Transfer?
Language: Python - Size: 130 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
GeorgeVern/smala
Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".
Language: Python - Size: 278 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 11 - Forks: 0
krystalan/ClidSum
EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization
Language: Python - Size: 1.44 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 28 - Forks: 2
Akshayanti/cross-lingual-tagging 📦
Cross-Lingual tagging, with single/multiple sources and parameter estimation to improve projection quality
Language: Python - Size: 2.39 GB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
ymcui/Cross-Lingual-MRC
Cross-Lingual Machine Reading Comprehension (EMNLP 2019)
Language: Python - Size: 20.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 65 - Forks: 17
manojsukhavasi/Unsupervised-Cross-Lingual-Embeddings
cross-lingual word embeddings with unsupervised learning
Language: Jupyter Notebook - Size: 5.86 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 2
livc/cross-crfae
Code for the paper `Unsupervised Cross-Lingual Adaptation of Dependency Parsers Using CRF Autoencoders` in the findings of EMNLP 2020.
Language: Python - Size: 30.3 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 1
jhliu17/xlingual-mrc
Code for the paper "Cross-lingual Machine Reading Comprehension with Language Branch Knowledge Distillation" (COLING 2020)
Language: Python - Size: 43.9 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 8 - Forks: 0
BobXWu/CNPMI
Cross-lingual Normalized Pointwise Mutual Information for cross-lingual topic evaluation.
Language: Python - Size: 42.1 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0
peter-yh-wu/cross-lingual
Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Size: 2.31 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1
yahah100/cross_lingual_summarization
This repository contains a exploration of cross-lingual summarization using two datasets.
Language: Jupyter Notebook - Size: 90.8 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
SapienzaNLP/unify-srl
Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).
Language: Python - Size: 76.2 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 16 - Forks: 1
murali1996/nlp-notes
A curated list of papers and experiments in the field of Natural Language Processing (NLP)
Size: 547 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 5 - Forks: 3
umanlp/ZusammenQA Fork of chiachienhung/ZusammenQA
ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System
Language: Python - Size: 12.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 2
sedflix/unsacmt
Unsupervised Sentiment Analysis for Code-mixed Data
Language: Jupyter Notebook - Size: 2.81 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 8 - Forks: 4
alexandra-chron/relm_unmt
Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".
Language: Python - Size: 2.24 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 27 - Forks: 2
Blarc/cross-lingual-question-answering
Implementation of multilingual models for question answering using English and Slovene corpora.
Language: Jupyter Notebook - Size: 11 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
chaitanyamalaviya/NeuralFactorGraph
This repo contains the code for the paper Neural Factor Graph Models for Cross-lingual Morphological Tagging.
Language: Python - Size: 1.5 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 51 - Forks: 8
thunlp/CLSP
Code and data for EMNLP 2018 paper "Cross-lingual Lexical Sememe Prediction"
Language: C - Size: 7.42 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 19 - Forks: 7
osainz59/XLREMed
Code for the Cross-Lingual Transfer Learning for Medical Relation Extraction
Language: Python - Size: 93.8 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 4 - Forks: 0
Akshayanti/cross-lingual-tools
Tools for working with cross-lingual data.
Language: Python - Size: 53.3 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
deterministic-algorithms-lab/Large-XLM
XLM implementation with utilities to process and train on large multi-lingual datasets, with not enough RAM.
Language: Python - Size: 310 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
jiminsun/multi-oli
Pytorch code for "Multilingual Offensive Language Identification"
Language: Python - Size: 3.61 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0
anlausch/XWEAT
Cross-lingual version of WEAT
Language: Python - Size: 84 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 6 - Forks: 4
jxhe/cross-lingual-struct-flow
PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656
Language: Python - Size: 70.9 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 23 - Forks: 3
JiaWu-Repository/AKE Fork of IIEdm/AKE
Guiding Cross-Lingual Entity Alignment via Adversarial Knowledge Embedding (ICDM 2019)
Size: 19.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 3
jerbarnes/crosslingual_reordering
Reordering for embedding-based cross-lingual sentiment analysis
Language: TeX - Size: 606 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 4 - Forks: 0