Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cross-lingual

artitw/text2text

Text2Text: Crosslingual NLP/G toolkit

Language: Python - Size: 686 KB - Last synced: 4 days ago - Pushed: 3 months ago - Stars: 276 - Forks: 33

shimo-lab/Universal-Geometry-with-ICA

Discovering Universal Geometry in Embeddings with ICA

Language: Python - Size: 10.8 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 15 - Forks: 0

Separius/awesome-sentence-embedding 📦

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 282 KB - Last synced: 14 days ago - Pushed: about 3 years ago - Stars: 2,193 - Forks: 259

ictnlp/BayLing

“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.

Language: Python - Size: 66.6 MB - Last synced: 2 months ago - Pushed: 6 months ago - Stars: 268 - Forks: 15

sheng-z/cross-lingual-open-ie

MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models

Language: Python - Size: 26.2 MB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 22 - Forks: 9

BobXWu/InfoCTM

Code for InfoCTM: A Mutual Information Maximization Perspective of Cross-lingual Topic Modeling (AAAI2023)

Language: Python - Size: 25.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 20 - Forks: 0

nutcrtnk/DHGNet

Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.

Language: Python - Size: 37.4 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 12 - Forks: 4

salesforce/FewXC

Official code and data release for Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning, accepted by findings of EACL 2024.

Language: Python - Size: 42 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

mhardalov/exams-qa

A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering

Language: Python - Size: 441 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 35 - Forks: 4

Darth-Kronos/Cross-lingual-SER

A cross-lingual SER that can learn language invariant representations without requiring target-language data labels

Language: Python - Size: 13.7 KB - Last synced: 22 days ago - Pushed: about 1 year ago - Stars: 1 - Forks: 1

google-research-datasets/swim-ir

SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.

Size: 201 KB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 32 - Forks: 2

checkstep/senti-stance

Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

Language: Python - Size: 43.7 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 16 - Forks: 2

shyamupa/xling-el

pytorch model for cross-lingual entity linking.

Language: Python - Size: 101 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 16 - Forks: 3

princeton-nlp/MultilingualAnalysis

Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"

Language: Python - Size: 47.9 MB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 13 - Forks: 0

ConsistencyVC/ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language: Python - Size: 1.72 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 83 - Forks: 12

CZWin32768/XNLG

AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training

Language: Python - Size: 149 KB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 126 - Forks: 18

harisbinzia/ZeroshotCrosslingualHateSpeech

Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models

Size: 4.88 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

maum-ai/sane-tts

SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech

Size: 57 MB - Last synced: 10 months ago - Pushed: 11 months ago - Stars: 5 - Forks: 0

andreabac3/cross-lingual-neural-databases

Codebase of Cross-Lingual Neural Databases

Language: Python - Size: 9.38 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

AmanDaVinci/X-Lingual-Transfer-Learning

Evolution of Representations during Cross Lingual Transfer Learning

Language: Jupyter Notebook - Size: 7.31 MB - Last synced: 10 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1

g-laz77/Cross-Lingual-Word-Embeddings

Learn a shared embedding space between words in multiple languages.

Language: Python - Size: 91.8 KB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 4 - Forks: 2

zliucr/mixed-language-training

Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems (AAAI-2020)

Language: Python - Size: 14.5 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 31 - Forks: 5

zliucr/crosslingual-slu

EMNLP-2020: Cross-lingual Spoken Language Understanding with Regularized Representation Alignment

Language: Python - Size: 17.9 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 18 - Forks: 1

zliucr/Crosslingual-NLU

Zero-shot Cross-lingual Task-Oriented Dialogue Systems (EMNLP 2019)

Language: Python - Size: 50.2 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 23 - Forks: 5

pauli31/czech-subjectivity-dataset

This is the repository for the newly created Czech Subjectivity Dataset (Subj-CS) and our paper:

Size: 12.7 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

Youggls/ACROSS-ACL23

Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization

Size: 2.93 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 11 - Forks: 0

seahore/PPG-GradVC

A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis

Language: Python - Size: 204 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 35 - Forks: 6

ikergarcia1996/MVM-Embeddings

A monolingual and cross-lingual meta-embedding generation framework

Language: Python - Size: 10.4 MB - Last synced: 11 months ago - Pushed: over 4 years ago - Stars: 4 - Forks: 0

ramiyappan/X-Gear

Reproducing baseline model from ACL-2022 paper X-GEAR for Zero-shot Cross-Lingual EAE

Language: Python - Size: 10.3 MB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

GeekDream-x/SemEval2022-Task8-TonyX

Deep-learning system proposed by HFL for SemEval-2022 Task 8: Multilingual News Similarity

Language: Python - Size: 2.82 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 50 - Forks: 5

Xiefeng69/Awesome-Entity-Alignment

Awesome Entity Alignment is a collection of EA techniques, including papers, codes, and datasets.

Size: 32.2 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 7 - Forks: 2

yaushian/mSimCSE

mSimCSE: Multilingual SimCSE

Language: Python - Size: 2.62 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 14 - Forks: 1

Pzoom522/xANLG

Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)

Language: Python - Size: 31.3 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 11 - Forks: 0

chiachienhung/ZusammenQA

ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System

Language: Python - Size: 12.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 3

subhadarship/nlp4if-2021

Cross-lingual misinformation detection

Language: Jupyter Notebook - Size: 13.8 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1

honghanhh/ate-2022

Can Cross-domain Term Extraction Benefit from Cross-lingual Transfer?

Language: Python - Size: 130 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

GeorgeVern/smala

Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".

Language: Python - Size: 278 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 11 - Forks: 0

krystalan/ClidSum

EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization

Language: Python - Size: 1.44 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 28 - Forks: 2

Akshayanti/cross-lingual-tagging 📦

Cross-Lingual tagging, with single/multiple sources and parameter estimation to improve projection quality

Language: Python - Size: 2.39 GB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ymcui/Cross-Lingual-MRC

Cross-Lingual Machine Reading Comprehension (EMNLP 2019)

Language: Python - Size: 20.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 65 - Forks: 17

manojsukhavasi/Unsupervised-Cross-Lingual-Embeddings

cross-lingual word embeddings with unsupervised learning

Language: Jupyter Notebook - Size: 5.86 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 3 - Forks: 2

livc/cross-crfae

Code for the paper `Unsupervised Cross-Lingual Adaptation of Dependency Parsers Using CRF Autoencoders` in the findings of EMNLP 2020.

Language: Python - Size: 30.3 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 1

jhliu17/xlingual-mrc

Code for the paper "Cross-lingual Machine Reading Comprehension with Language Branch Knowledge Distillation" (COLING 2020)

Language: Python - Size: 43.9 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 8 - Forks: 0

BobXWu/CNPMI

Cross-lingual Normalized Pointwise Mutual Information for cross-lingual topic evaluation.

Language: Python - Size: 42.1 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

peter-yh-wu/cross-lingual

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Size: 2.31 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1

yahah100/cross_lingual_summarization

This repository contains a exploration of cross-lingual summarization using two datasets.

Language: Jupyter Notebook - Size: 90.8 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

SapienzaNLP/unify-srl

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Language: Python - Size: 76.2 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 16 - Forks: 1

murali1996/nlp-notes

A curated list of papers and experiments in the field of Natural Language Processing (NLP)

Size: 547 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 5 - Forks: 3

umanlp/ZusammenQA Fork of chiachienhung/ZusammenQA

ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System

Language: Python - Size: 12.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 2

sedflix/unsacmt

Unsupervised Sentiment Analysis for Code-mixed Data

Language: Jupyter Notebook - Size: 2.81 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 8 - Forks: 4

alexandra-chron/relm_unmt

Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".

Language: Python - Size: 2.24 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 27 - Forks: 2

Blarc/cross-lingual-question-answering

Implementation of multilingual models for question answering using English and Slovene corpora.

Language: Jupyter Notebook - Size: 11 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

chaitanyamalaviya/NeuralFactorGraph

This repo contains the code for the paper Neural Factor Graph Models for Cross-lingual Morphological Tagging.

Language: Python - Size: 1.5 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 51 - Forks: 8

thunlp/CLSP

Code and data for EMNLP 2018 paper "Cross-lingual Lexical Sememe Prediction"

Language: C - Size: 7.42 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 19 - Forks: 7

osainz59/XLREMed

Code for the Cross-Lingual Transfer Learning for Medical Relation Extraction

Language: Python - Size: 93.8 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 4 - Forks: 0

Akshayanti/cross-lingual-tools

Tools for working with cross-lingual data.

Language: Python - Size: 53.3 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

deterministic-algorithms-lab/Large-XLM

XLM implementation with utilities to process and train on large multi-lingual datasets, with not enough RAM.

Language: Python - Size: 310 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

jiminsun/multi-oli

Pytorch code for "Multilingual Offensive Language Identification"

Language: Python - Size: 3.61 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0

anlausch/XWEAT

Cross-lingual version of WEAT

Language: Python - Size: 84 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 6 - Forks: 4

jxhe/cross-lingual-struct-flow

PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656

Language: Python - Size: 70.9 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 23 - Forks: 3

JiaWu-Repository/AKE Fork of IIEdm/AKE

Guiding Cross-Lingual Entity Alignment via Adversarial Knowledge Embedding (ICDM 2019)

Size: 19.3 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 3

jerbarnes/crosslingual_reordering

Reordering for embedding-based cross-lingual sentiment analysis

Language: TeX - Size: 606 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 4 - Forks: 0

Related Keywords
cross-lingual 62 nlp 17 multilingual 13 natural-language-processing 7 embeddings 6 pytorch 5 word-embeddings 5 language-model 5 multi-lingual 5 bert 5 question-answering 5 cross-lingual-embeddings 4 task-oriented-dialogue 4 crosslingual 4 zero-shot 4 deep-learning 4 transfer-learning 4 sentiment-analysis 3 unsupervised-learning 3 low-resource 3 zero-shot-learning 3 sentence-embeddings 3 data-augmentation 3 information-retrieval 3 machine-learning 3 machine-translation 3 transformers 3 python 3 summarization 3 multilingual-topic-models 2 multilingual-bert 2 dataset 2 translation 2 transformer 2 topic-modeling 2 text-classification 2 pretraining 2 large-language-models 2 voice-conversion 2 paper 2 emnlp 2 few-shot 2 reading-comprehension 2 xlm-roberta 2 entity-alignment 2 knowledge-graph 2 dialogue 2 nlu 2 awesome 2 open-retrieval 2 natural-language 2 specialization 2 chatgpt 2 pretrained-models 2 telugu 1 telugu-language 1 parsing 1 multi-source-tagging 1 language-similarity 1 text-summarization 1 cross-lingual-tagging 1 language 1 backtranslation 1 meta-embeddings 1 monolingual 1 information-extraction 1 mt5 1 nlp-machine-learning 1 computational-linguistics 1 semantic-similarity 1 semeval-2022 1 word-analogy 1 word-embedding 1 misinformation 1 acter 1 ate 1 cross-domain 1 rsdo5 1 term-extraction 1 xnli 1 cross-lingual-summarization 1 dialogue-summarization 1 low-resource-languages 1 residual-adapters 1 unsupervised-machine-translation 1 graphical-models 1 morphology 1 structured-prediction 1 sememe 1 ehealth-kd 1 medical 1 relation-extraction 1 bpe-codes 1 nmt 1 vocabulary 1 xlm 1 offensive-language-detection 1 bias 1 dependency-parsing 1 invertible-transformation 1