Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: code-switching
vcyrot/Frenglish-Benchmark
A Centralized Frenglish Benchmark from Naturally Occurring Code-Switching and Code-Mixing
Size: 105 KB - Last synced: about 15 hours ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
Lidan0241/language-detection
language detection in code-switching for es/en/zh speakers
Language: Jupyter Notebook - Size: 4.6 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 1 - Forks: 0
Tomiinek/Multilingual_Text_to_Speech 📦
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language: Python - Size: 42.5 MB - Last synced: 21 days ago - Pushed: 8 months ago - Stars: 810 - Forks: 155
PPPI/POSIT
POSIT aims to segment and tag mixed-text that contains English and C-like code, such that the user both knows what a token is, and within the language it's used in, what role, such as an AST tag or PoS tag, it serves.
Language: Python - Size: 51.5 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 6 - Forks: 2
97arushisharma/Hindi-English-Code-Switching
A simple UI to translate a text written in romanised hindi form to fully english sentence
Language: Lex - Size: 10.2 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0
ctarnold/jpLLM
working on llm research
Language: Python - Size: 4.41 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
microsoft/CodeMixed-Text-Generator
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Language: Jupyter Notebook - Size: 3.79 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 47 - Forks: 12
sagorbrur/codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
Language: Jupyter Notebook - Size: 23.4 KB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 30 - Forks: 6
audioku/meta-transfer-learning
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
Language: Python - Size: 6 MB - Last synced: 4 months ago - Pushed: almost 4 years ago - Stars: 47 - Forks: 10
ishan00/translation-for-code-switching-acl
Official repository for the paper titled "From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text" accepted at ACL 2021
Language: Python - Size: 8.59 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 3 - Forks: 2
microsoft/LID-tool
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
Language: Python - Size: 2.16 MB - Last synced: about 2 months ago - Pushed: almost 4 years ago - Stars: 45 - Forks: 11
Anwarvic/truel_bilingual_nmt
The official code for the "True Bilingual NMT" paper
Language: Python - Size: 3.59 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
AddisonDP/OTMTextDataAnalysis
Data Analysis Toolkit for On the Margins, LLC
Language: Python - Size: 28.3 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
RaiBP/translation-detection
Python program for detecting unintentional bilingual and translation instances in NLP datasets.
Language: Python - Size: 26.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
Nativeatom/NaturalLanguageProcessing
Natural Language Procesing
Size: 165 KB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 34 - Forks: 9
umar1997/propaganda-codeswitched-text
[EMNLP 2023] Official repository of paper titled "Detecting Propaganda Techniques in Code-Switched Social Media Text"
Language: Jupyter Notebook - Size: 47 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 1
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
Size: 302 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 262 - Forks: 33
coolEphemeroptera/Foreign_Pronunciation_Generator_for_Code-Switch_ASR
a socket script to obtain chinese phones-sequence for any english word
Language: Python - Size: 33.2 KB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 5 - Forks: 0
Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone
Japanese Speaking English Speech Dataset
Size: 347 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0
Nexdata-AI/300-Hours-Mixed-Speech-with-Korean-and-English-Data-by-Mobile-Phone
Mixed Speech with Korean and English Dataset
Size: 442 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 2 - Forks: 0
mvidaldp/cs_catalan_spanish
Catalan-Spanish code-switching web-based online experiment including a Bilingual Language Profile building questionnaire.
Language: JavaScript - Size: 1.78 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
gentaiscool/meta-emb
Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)
Language: Python - Size: 3.02 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 31 - Forks: 4
javadr/PyTorch-Detect-Code-Switching
Implementation of a deep learning model (BiLSTM) to detect code-switching
Language: Python - Size: 9.55 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 5 - Forks: 0
dieuthu/sequencetagging
A sequence tagging model with active learning
Language: Python - Size: 123 MB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 7 - Forks: 0
carexl8/code-mixed-tweets
Tweet ids for code-mixed Russian-German and Russian-Hebrew tweets
Size: 20.5 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0
kjgpta/NSC-Code-Switch-Analysis
Code-switching analysis based on categories like Age, Gender and part-of-speech
Language: Jupyter Notebook - Size: 57.6 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
kjgpta/Code-Switch-Language-Modeling-for-English-and-Malay
Code-Switched Data generation based on Part-of-speech and Language Modeling of the generated text.
Language: Jupyter Notebook - Size: 153 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
ash-shar/Code-Switching-and-Swearing-Patterns-on-Twitter
Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes
Language: Python - Size: 1.97 MB - Last synced: 9 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 2
mmaguero/josa-corpus
Jopara (Guarani-dominant mixed with Spanish) sentiment analysis corpus
Size: 8.79 KB - Last synced: over 1 year ago - Pushed: about 2 years ago - Stars: 6 - Forks: 0
feyzaakyurek/newsframing
Code repository for ACL2020 paper Multi-label and Multilingual News Framing Analysis
Language: Jupyter Notebook - Size: 2.7 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 6 - Forks: 2
jonathandunn/pacific_CodeSwitch
Code-switching detection for Pacific languages
Language: Python - Size: 219 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
andi611/CS-Tacotron-Pytorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
Language: Python - Size: 155 MB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 21 - Forks: 7
kolloqe/react-kbi-si-en
Kolloqe Input Component with code-switching support between Sinhala and English
Size: 694 KB - Last synced: 25 days ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
kolloqe/react-kbi-si-en-html
Kolloqe Input Component with code-switching support between Sinhala and English attachable via <script> tags
Language: JavaScript - Size: 849 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
gentaiscool/multi-task-cs-lm
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning (CALCS 2018, ACL)
Language: Python - Size: 978 KB - Last synced: 10 months ago - Pushed: almost 5 years ago - Stars: 9 - Forks: 3
vincenthuang75025/chinglish
Chrome extension for translating highlighted English text into Chinglish (a chinese + english hybrid)
Language: Python - Size: 128 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0
sedflix/unsacmt
Unsupervised Sentiment Analysis for Code-mixed Data
Language: Jupyter Notebook - Size: 2.81 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 8 - Forks: 4
amsuhane/ACL20-Code-switching-patterns
Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection
Language: Python - Size: 4.72 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 10 - Forks: 1
vsoto/crowdsourced_bangor
This repository contains crowdsourced universal part-of-speech tags for the Miami Bangor corpus.
Size: 2.33 MB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 1 - Forks: 0
kmi-linguistics/Code-mixing
Size: 3.91 KB - Last synced: 12 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0