Topic: "code-mixed"
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
Size: 178 KB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 315 - Forks: 39

sagorbrur/codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 23 days ago - Pushed at: over 4 years ago - Stars: 35 - Forks: 6

aditeyabaral/calbert
CalBERT - Code-mixed Adaptive Language representations using BERT, published at AAAI-MAKE 2022
Language: Python - Size: 118 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

sakshidgoel/Bilingual-Sentiment-Analysis
The main aim of the project is to develop a sentiment analyzer that can be used on twitter data to classify it as positive or negative. Our project takes care of the challenge of bilingual comments, where people tweet in two languages, in this case Hindi and English, in the Latin Alphabet.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 10

sedflix/unsacmt
Unsupervised Sentiment Analysis for Code-mixed Data
Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 4

fokhruli/CM-seti-anlysis
Implementation for the paper titled, " Data-Augmentation for Bangla-English Code-Mixed Sentiment Analysis: Enhancing Cross Linguistic Contextual Understanding", IEEE Access, 2023
Language: Python - Size: 2.33 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

manashb21/HomeAutomationLLM
This project focuses on fine-tuning Llama 3.2 (1B) to generate structured YAML automation commands in Nepali-English, optimized for low-resource deployment.
Language: Jupyter Notebook - Size: 76.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

eftekhar-hossain/CUET_NLP-EACL_2021
This repository contains the system description and the codes that we implemented for participating in EACL-2021 shared tasks.
Language: Jupyter Notebook - Size: 8.64 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 3

jessicasaikia/hidden-markov-model-HMM
This repository implements a Hidden Markov Model (HMM) for performing Parts of Speech (POS) Tagging on Assamese-English code-mixed texts.
Language: Python - Size: 358 KB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/conditional-random-field-CRF
This repository implements a Conditional Random Field (CRF) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.
Language: Python - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/long-short-term-memory-LSTM
This repository implements a Long Short Term Memory (LSTM) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.
Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/bidirectional-long-short-term-memory-BiLSTM
This repository implements a Bidirectional Long Short Term Memory (BiLSTM) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.
Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/multilingual-BERT-mBERT
This repository implements a Multilingual BERT (mBERT) model for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.
Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/rule-based
This repository contains a simple Rule-Based Model for Parts-of-Speech tagging in Assamese-English code mixed texts.
Language: Python - Size: 352 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Nexdata-AI/302-Person-Hindi-and-English-Bilingual-Spontaneous-Monologue-smartphone-speech-dataset
302-Person-Hindi-and-English-Bilingual-Spontaneous-Monologue-smartphone-speech-dataset
Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

carexl8/code-mixed-tweets
Tweet ids for code-mixed Russian-German and Russian-Hebrew tweets
Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shubh-Nisar/commentator
A code-mixed annotation tool aimed at increasing the annotation quality whilst reducing the annotation time and various overheads associated with code-mixed data.
Language: JavaScript - Size: 3.06 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 3

dharun-narayanan/Code-Mixed-Data
Language: Jupyter Notebook - Size: 8 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

ShreyPandit/Abuse-detection-using-Federated-Learning
The code for SOP project done for the topic of Abuse detection in multilingual code-switched and code-mixed language using federated learning
Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

krishhrana/Sentiment_analysis
Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0
