Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: vietnamese-nlp
undertheseanlp/underthesea
Underthesea - Vietnamese NLP Toolkit
Language: Python - Size: 166 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 1,336 - Forks: 270
VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
Size: 168 KB - Last synced: 4 days ago - Pushed: 4 months ago - Stars: 632 - Forks: 91
nguyenvulebinh/vietnamese-electra
Electra pre-trained model using Vietnamese corpus
Language: Jupyter Notebook - Size: 64.8 MB - Last synced: 4 days ago - Pushed: almost 1 year ago - Stars: 64 - Forks: 11
dnanhkhoa/python-vncorenlp
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
Language: Python - Size: 40 KB - Last synced: 10 days ago - Pushed: almost 6 years ago - Stars: 54 - Forks: 17
tarudesu/ViHateT5
Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'2024)
Language: Python - Size: 201 KB - Last synced: 25 days ago - Pushed: 26 days ago - Stars: 0 - Forks: 0
vntk/dictionary
Vietnamese Dictionary for Node
Language: JavaScript - Size: 2.64 MB - Last synced: 16 days ago - Pushed: over 6 years ago - Stars: 17 - Forks: 7
anti-aii/RagE
RagE (RAG Engine) - A tool supporting the construction and training of components of the Retrieval-Augmented-Generation (RAG) model. It also facilitates the rapid development of Q&A systems and chatbots following the RAG model.
Language: Python - Size: 5.63 MB - Last synced: 13 days ago - Pushed: about 1 month ago - Stars: 3 - Forks: 1
Oztobuzz/Vista
This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations and images
Language: Python - Size: 1.79 MB - Last synced: 28 days ago - Pushed: 29 days ago - Stars: 13 - Forks: 0
chauminhnguyen/Dual-Transformer
Dual-Transformer for Image to Poem
Language: Python - Size: 26.6 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0
lupanh/VietnameseMDS
Size: 121 KB - Last synced: about 2 months ago - Pushed: over 10 years ago - Stars: 31 - Forks: 19
undertheseanlp/automatic_speech_recognition
Vietnamese Automatic Speech Recognition
Language: Python - Size: 131 MB - Last synced: about 2 months ago - Pushed: over 5 years ago - Stars: 61 - Forks: 37
ngoanpv/llama2_vietnamese
A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.
Language: Python - Size: 508 KB - Last synced: about 2 months ago - Pushed: 9 months ago - Stars: 9 - Forks: 1
yeuai/yeuai-sdk-nodejs
Node.js SDK for yeu.ai
Language: JavaScript - Size: 15.6 KB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 1
vTuanpham/Vietnamese_QA_System
Vietnamese long form question answering system with documents retrieval.
Language: Python - Size: 444 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 17 - Forks: 6
VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
Language: Python - Size: 589 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 131 - Forks: 16
vunb/vntk
Vietnamese NLP Toolkit for Node
Language: JavaScript - Size: 3.56 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 207 - Forks: 61
ndthuan/vi-word-segmenter
HTTP wrapper of the VnCoreNLP library - A Vietnamese natural language processing toolkit
Language: Java - Size: 82 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
behitek/vncorenlp-wrapper 📦
A python wrapper for VnCoreNLP
Language: Python - Size: 135 MB - Last synced: 2 months ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 0
behitek/vietnam-sensitive-words 📦
Vietnamese sensitive words (including teencode) was created by ML algorithm
Size: 48.8 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 60 - Forks: 22
vncorenlp/VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018)
Language: Java - Size: 232 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 549 - Forks: 140
vntk/vntk-tagger
Experiments of Basic Vietnamese NLP Problems and Named Entity Recognition Tool
Language: JavaScript - Size: 1.44 MB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 2
vntk/vntk-cli
CLI for VNTK Applications
Size: 1000 Bytes - Last synced: 3 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 1
vntk/preprocess
Corpus preprocessing
Size: 105 KB - Last synced: 3 months ago - Pushed: about 11 years ago - Stars: 0 - Forks: 0
nguyenvulebinh/vietnamese-roberta
A Robustly Optimized BERT Pretraining Approach for Vietnamese
Language: Python - Size: 10.7 KB - Last synced: 4 days ago - Pushed: almost 3 years ago - Stars: 27 - Forks: 5
undertheseanlp/chatbot
Vietnamese Chatbot
Language: C - Size: 267 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 88 - Forks: 47
huyhoang8909/CoreNLP Fork of stanfordnlp/CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
Language: Java - Size: 282 MB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0
HySonLab/ViDeBERTa
ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023
Language: Jupyter Notebook - Size: 388 MB - Last synced: 29 days ago - Pushed: 8 months ago - Stars: 51 - Forks: 9
VuBacktracking/Deep-Neural-Network-Vietnamese-Student-Feedback-Sentiment-Analysis
Vietnamese Student Feedback Sentiment Analysis
Language: Jupyter Notebook - Size: 34.4 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
magizbox/scraper
Scraper
Language: Python - Size: 74.8 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 13 - Forks: 7
ngxtnhi/ViLexNorm
A Lexical Normalization Corpus for Vietnamese Social Media Text
Size: 485 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 4 - Forks: 0
QuaCau-TheSphere/tranky-test
Language: TypeScript - Size: 1.22 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
Nguyendat-bit/qa_information_utt
My university graduation thesis with the topic of building an automatic information question and answer system for the University of Transport Technology (UTT)
Language: Python - Size: 27.2 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
Anshler/vietnamese-poem-classifier
Classify genre and score Vietnamese poems 📜🔍
Language: Python - Size: 59.6 KB - Last synced: 11 days ago - Pushed: 5 months ago - Stars: 4 - Forks: 0
Anshler/poem_generator
Generate Vietnamese poem with natural language prompts 📜🖋️
Language: Jupyter Notebook - Size: 59.8 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
duongkstn/albert-vi-as-service
albert-vi-as-service: A Fork of bert-as-service to deploy albert_vi
Language: Python - Size: 8.57 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 12 - Forks: 3
VinAIResearch/PhoGPT
PhoGPT: Generative Pre-training for Vietnamese
Size: 22.5 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 370 - Forks: 27
Anshler/fake_news_detector
Fake news detection in English and Vietnamese 📰❌
Language: Python - Size: 130 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
datquocnguyen/VnDT
VnDT: A Vietnamese Dependency Treebank
Size: 2.05 MB - Last synced: 4 days ago - Pushed: over 2 years ago - Stars: 18 - Forks: 1
VinAIResearch/BARTpho
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
Size: 683 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 90 - Forks: 7
VinAIResearch/PhoNER_COVID19
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Size: 3.53 MB - Last synced: 30 days ago - Pushed: almost 2 years ago - Stars: 59 - Forks: 16
phusroyal/ViHOS
Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)
Language: Jupyter Notebook - Size: 5.68 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 26 - Forks: 6
VinAIResearch/VinAI_Translate
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
Size: 56.6 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 101 - Forks: 14
undertheseanlp/NLP-Vietnamese-progress
Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.
Size: 353 KB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 297 - Forks: 70
vietai/aivivn-vn-diacritic
Vietnamese Diacritic Restoration using Transformer Sequence-to-Sequence Model
Language: Python - Size: 6.84 KB - Last synced: 8 months ago - Pushed: over 4 years ago - Stars: 2 - Forks: 0
pbcquoc/vietnamese_ocr
vietnamese OCR
Language: Python - Size: 93.8 KB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 116 - Forks: 48
autobotasia/vitone
Tự động thêm dấu tiếng việt dùng Transformer model
Language: Python - Size: 7.62 MB - Last synced: 8 months ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 1
Nguyendat-bit/VieTokenizer
Vietnamese Tokenizer package based on deeplearning methods
Language: Python - Size: 13.7 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1
anhthuan1999/Vietnamese-News-Classification
We use LSTM, BiLSTM, BERT and SVM with TF-IDF, Word2vec and Bag-of-words to classify this documents to positive (labeled as 1), neutral (labeled as 0) and negative (labeled as 2)
Language: Jupyter Notebook - Size: 1.64 MB - Last synced: 8 months ago - Pushed: 9 months ago - Stars: 27 - Forks: 12
hieunguyen1053/named-entity-recognition-vietnamese
Thử nghiệm một số mô hình giải quyết bài toán nhận dạng thực thể tên tiếng Việt
Language: Python - Size: 78.8 MB - Last synced: 9 months ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0
jackNhat/classification Fork of HoangNamHai/underthesea.classification
Vietnamese Text Classification experiments
Language: Python - Size: 1.54 GB - Last synced: 9 months ago - Pushed: over 5 years ago - Stars: 2 - Forks: 5
jackNhat/named_entity_recognition
Vietnamese Named Entity Recognition Experiments
Language: Python - Size: 8.45 MB - Last synced: 9 months ago - Pushed: over 4 years ago - Stars: 2 - Forks: 1
VinAIResearch/PhoMT
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
Size: 6.84 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 33 - Forks: 3
vinhtran2611/KieuGPT
AI writes poetry
Language: Python - Size: 34.8 MB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
trietnm2/BkParser
BkParser - Vietnamese POS tagger and dependency parser
Language: Python - Size: 39.7 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0
dangvansam/phoneme2grapheme-vietnamese
convert phoneme to grapheme vietnames
Language: Python - Size: 6.84 KB - Last synced: 10 months ago - Pushed: almost 4 years ago - Stars: 4 - Forks: 2
dangvansam/nvidia-nemo-jasper-quartznet-asr-vietnamese
Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo
Language: Python - Size: 925 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
Language: Python - Size: 289 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 65 - Forks: 35
manleviet/Bat-loi-chinh-ta-tieng-Viet-dua-tren-phan-tich-ngu-phap 📦
A check-spelling application based on analyzing the grammar of Vietnamese phrases
Language: Visual Basic - Size: 27.5 MB - Last synced: 10 months ago - Pushed: almost 8 years ago - Stars: 2 - Forks: 3
daivuongktx13/VNSpellCorrection
Vietnamese Spelling Correction using Transformer with Tokenization Repair
Language: Python - Size: 26.7 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
Tuan-Lee-23/Vietnamese-corpus-search-and-analysis-Web-app
Vietnamese corpus search tools and statistical analysis
Language: Python - Size: 36.8 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 1
toandokhanh/Text-BasedVideoSummarizer
This is an internship project of mine with the desire to help businesses have an automatic system of classifying and summarizing video content with natural language processing methods.
Language: Jupyter Notebook - Size: 203 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
ndthuan/go-vi-wordseg-client
Go client library for the ndthuan/vi-word-segmenter service
Language: Go - Size: 19.5 KB - Last synced: 11 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0
Tuan-Lee-23/Vietnamese-News-Generative-Model
A Fine-tuned Vietnamese GPT2 model which can generate Vietnamese news based on context (category + headline)
Language: Jupyter Notebook - Size: 186 KB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 2
duyvuleo/VNTC
A Large-scale Vietnamese News Text Classification Corpus
Size: 157 MB - Last synced: 12 months ago - Pushed: over 4 years ago - Stars: 87 - Forks: 53
nicolay-r/ViLongT5
LongT5-based model pre-trained on a large amount of unlabeled Vietnamese news texts and fine-tuned with ViMS and VMDS collections
Language: Python - Size: 3.37 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0
lupanh/Vietnamese-Person-Questions-Dataset
Tập dữ liệu câu hỏi về người trong tiếng Việt đã được gán nhãn
Size: 118 KB - Last synced: about 2 months ago - Pushed: almost 9 years ago - Stars: 14 - Forks: 8
ntdas/public_instructions_dataset
Public instruction dataset, put in one place.
Size: 0 Bytes - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
trungngv/web_scraping
Web scraping
Language: Python - Size: 1.6 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 5 - Forks: 3
baodv1001/TrendBot
Language: Python - Size: 156 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 4 - Forks: 2
HuuHuy227/Vietnamese-Chatbot-Transformer
Vietnamese Chatbot using Transformer Architecture
Language: Jupyter Notebook - Size: 30.3 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0
kelvng/vietnamese-pos-tagging
Xây dựng chương trình (tool) gán nhãn từ loại (POS tagger) cho tiếng Việt.
Language: Jupyter Notebook - Size: 178 MB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0
vudaoanhtuan/vietnamese-tone-prediction
restore tone for missing tone sentences
Language: Python - Size: 454 KB - Last synced: over 1 year ago - Pushed: almost 5 years ago - Stars: 7 - Forks: 4
undertheseanlp/pos_tag 📦
Vietnamese POS Tagging
Language: Python - Size: 24.6 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 12 - Forks: 4
ngwgsang/vqda
vqda is to provide data augmentation methods for Vietnamese questions.
Language: Python - Size: 42 KB - Last synced: 2 months ago - Pushed: about 1 year ago - Stars: 5 - Forks: 0
undertheseanlp/word_tokenize 📦
Vietnamese Word Tokenize
Language: Python - Size: 28.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 45 - Forks: 24
kh4nh12/ViTASD
A novel dataset and method for Vietnamese Target-Aspect-Sentiment joint detection (ViTASD)
Size: 45.9 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0
sonvx/VietSentiWordNet
[VietSentiWordNet] A quick and simple method to find Opinion for Vietnamese text.
Language: Java - Size: 41.1 MB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 26 - Forks: 11
mailong25/bert-vietnamese-question-answering
Vietnamese question answering system with BERT
Language: Python - Size: 3.18 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 87 - Forks: 38
dinhanhx/VisualRoBERTa
The first public Vietnamese visual linguistic foundation model(s)
Language: Python - Size: 98.6 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 2 - Forks: 1
hoangks5/vietnamese-sentences
Language: Python - Size: 30 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 18 - Forks: 8
undertheseanlp/speech_classification
Vietnamese Speech Classification experiments
Language: Python - Size: 14.6 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 7 - Forks: 3
undertheseanlp/sentiment
Vietnamese Sentiment Analysis
Language: Python - Size: 952 MB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 17 - Forks: 12
datquocnguyen/RDRsegmenter
A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)
Language: Java - Size: 420 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 69 - Forks: 9
tienthanhdhcn/Vietnamese-Accent-Prediction
A simple/fast/accurate accent prediction for non-accented Vietnamese text
Language: Java - Size: 40.2 MB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 32 - Forks: 9
undertheseanlp/corpus.viwiki
Vietnamese Wikipedia Corpus
Language: Python - Size: 51.4 MB - Last synced: over 1 year ago - Pushed: about 7 years ago - Stars: 15 - Forks: 7
209sontung/Vietnamese-stock-article-classification
Sentiment-based classification for stock article title using PhoBert
Language: Jupyter Notebook - Size: 311 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 16 - Forks: 4
thangntt2/pivi
My little Vietnamese NLP toolkit
Language: Python - Size: 5.24 MB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 4 - Forks: 0
undertheseanlp/slp3-vietnamese
Speech and Language Processing 3rd edition Vietnamese Translation
Language: TeX - Size: 1.09 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 22 - Forks: 2
dinhanhx/VL-datasets
Some Python scripts to load Vietnamese visual linguistic data
Language: Python - Size: 1.95 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 1
datquocnguyen/VnMarMoT
A state-of-the-art pre-trained model for Vietnamese POS tagging (ALTA 2017)
Language: Python - Size: 27.3 MB - Last synced: over 1 year ago - Pushed: almost 5 years ago - Stars: 7 - Forks: 0
NeroYuki/yourchatstarterv2a
Chatbot service with vietnamese natural language processing support
Language: JavaScript - Size: 39.9 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1
trongtuyen99/ViWikiSum
vietnamese multi doc-summarization dataset
Size: 37.3 MB - Last synced: over 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0
dangnam739/semantic-analysis-vietnamese
It is my project to build a simple web app that can sematic analysis Vietnamese comment or review.
Language: Jupyter Notebook - Size: 125 MB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0
undertheseanlp/chunking
Vietnamese Chunking experiments
Language: Python - Size: 8.97 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 5 - Forks: 1
SCIMTA/Tap-News Fork of ztqsteve/Tap-News
A real-time news scraping and recommendation system
Language: Jupyter Notebook - Size: 35.9 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
hthoai/sentiment-analysis
Sentiment analysis of TheGioiDiDong's product reviews - a project of Advanced Data Mining course at FIT-HCMUS.
Language: Jupyter Notebook - Size: 14.3 MB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0
undertheseanlp/lang_detect
Vietnamese Language Detection
Language: Python - Size: 742 KB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 2 - Forks: 0
dthung1602/DacosaTextGenerator
A text generator using GPT2, trained on Dacosa corpus
Language: Python - Size: 291 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
hiepnguyen034/Neural-machine-translator
a Vie-Eng NMT system using seq2seq model with attention
Language: Python - Size: 39.5 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
undertheseanlp/sent_tokenize
Vietnamese Sentence Boundary Detection
Language: Python - Size: 1.62 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 5 - Forks: 5