An open API service providing repository metadata for many open source software ecosystems.

Topic: "code-mixed"

gentaiscool/code-switching-papers

A curated list of research papers and resources on code-switching

Size: 178 KB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 315 - Forks: 39

sagorbrur/codeswitch

CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 23 days ago - Pushed at: over 4 years ago - Stars: 35 - Forks: 6

aditeyabaral/calbert

CalBERT - Code-mixed Adaptive Language representations using BERT, published at AAAI-MAKE 2022

Language: Python - Size: 118 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

sakshidgoel/Bilingual-Sentiment-Analysis

The main aim of the project is to develop a sentiment analyzer that can be used on twitter data to classify it as positive or negative. Our project takes care of the challenge of bilingual comments, where people tweet in two languages, in this case Hindi and English, in the Latin Alphabet.

Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 10

sedflix/unsacmt

Unsupervised Sentiment Analysis for Code-mixed Data

Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 4

fokhruli/CM-seti-anlysis

Implementation for the paper titled, " Data-Augmentation for Bangla-English Code-Mixed Sentiment Analysis: Enhancing Cross Linguistic Contextual Understanding", IEEE Access, 2023

Language: Python - Size: 2.33 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

manashb21/HomeAutomationLLM

This project focuses on fine-tuning Llama 3.2 (1B) to generate structured YAML automation commands in Nepali-English, optimized for low-resource deployment.

Language: Jupyter Notebook - Size: 76.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

eftekhar-hossain/CUET_NLP-EACL_2021

This repository contains the system description and the codes that we implemented for participating in EACL-2021 shared tasks.

Language: Jupyter Notebook - Size: 8.64 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 3

jessicasaikia/hidden-markov-model-HMM

This repository implements a Hidden Markov Model (HMM) for performing Parts of Speech (POS) Tagging on Assamese-English code-mixed texts.

Language: Python - Size: 358 KB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/conditional-random-field-CRF

This repository implements a Conditional Random Field (CRF) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Language: Python - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/long-short-term-memory-LSTM

This repository implements a Long Short Term Memory (LSTM) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/bidirectional-long-short-term-memory-BiLSTM

This repository implements a Bidirectional Long Short Term Memory (BiLSTM) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/multilingual-BERT-mBERT

This repository implements a Multilingual BERT (mBERT) model for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jessicasaikia/rule-based

This repository contains a simple Rule-Based Model for Parts-of-Speech tagging in Assamese-English code mixed texts.

Language: Python - Size: 352 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Nexdata-AI/302-Person-Hindi-and-English-Bilingual-Spontaneous-Monologue-smartphone-speech-dataset

302-Person-Hindi-and-English-Bilingual-Spontaneous-Monologue-smartphone-speech-dataset

Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

carexl8/code-mixed-tweets

Tweet ids for code-mixed Russian-German and Russian-Hebrew tweets

Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shubh-Nisar/commentator

A code-mixed annotation tool aimed at increasing the annotation quality whilst reducing the annotation time and various overheads associated with code-mixed data.

Language: JavaScript - Size: 3.06 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 3

dharun-narayanan/Code-Mixed-Data

Language: Jupyter Notebook - Size: 8 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

ShreyPandit/Abuse-detection-using-Federated-Learning

The code for SOP project done for the topic of Abuse detection in multilingual code-switched and code-mixed language using federated learning

Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

krishhrana/Sentiment_analysis

Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Related Topics
nlp 10 code-mixing 8 english 7 pos-tagging 7 assamese 6 assamese-text 6 english-language 6 nlp-machine-learning 6 parts-of-speech 6 pos-tagger 6 sentiment-analysis 5 parts-of-speech-tagging 5 code-switching 4 hindi-english 3 natural-language-processing 3 multilingual 2 code-switch 2 transformers 2 deep-learning 2 embeddings 1 multi-lingual 1 hmm-viterbi-algorithm 1 unsupervised 1 zero-shot-learning 1 light 1 cross-lingual 1 social-media 1 federated-learning 1 abuse-detection 1 abuse 1 spontaneous-speech-recognition 1 speech-recognition 1 asr 1 low-resource-languages 1 spanish-english 1 pos 1 ner 1 language-identification 1 huggingface 1 hmm-model 1 hmm 1 hidden-markov-model 1 speech 1 research 1 papers 1 language 1 bilingual 1 transformer 1 machine-learning 1 bert 1 bilstm-model 1 bilstm 1 bidirectional-lstm 1 bidirectional-long-short-term-memory-network 1 multilingual-bert 1 mbert 1 nepali-english 1 llm 1 light-automation 1 rule-based 1 assamese-language 1 assamese-english 1 text-classification 1 offensive-language 1 hope-speech-detection 1 machinelearning-python 1 twitter 1 tweets 1 russian 1 hebrew 1 german 1 twitter-data 1 kaggle-dataset 1 hybrid-model 1 ensemble-models 1 classifiers 1 bilingual-comments 1 tamil 1 python 1 codeswitch 1 reactjs 1 kappa 1 hinglish 1 flask 1 docker 1 data 1 annotations 1 part-of-speech-tagging 1 lstm-neural-networks 1 lstm-model 1 lstm 1 long-short-term-memory-models 1 long-short-term-memory 1 crfsuite 1 crf-model 1 crf 1 conditional-random-field 1 rule-based-nlp 1 rule-based-modeling 1