An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: grammatical-error-correction

nusnlp/moece

The official code of the "Efficient and Interpretable Grammatical Error Correction with Mixture of Experts" paper

Language: Python - Size: 4.83 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5 - Forks: 0

spraakbanken/multigec-2025

Public repository for the MultiGEC dataset.

Language: Python - Size: 1.26 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 1

nusnlp/ALLECS

The official code of ALLECS: A Lightweight Language Error Correction System

Language: Python - Size: 3.4 MB - Last synced at: 4 days ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 1

grammarly/ua-gec

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Language: Macaulay2 - Size: 18 MB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 259 - Forks: 22

bminixhofer/nlprule

A fast, low-resource Natural Language Processing and Text Correction library written in Rust.

Language: Rust - Size: 898 KB - Last synced at: 19 days ago - Pushed at: almost 2 years ago - Stars: 620 - Forks: 39

Hyprnx/Project-Athena

Source code and data for Project Athena

Size: 16.6 KB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

kakaobrain/helo-word

Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task

Language: Python - Size: 4.1 MB - Last synced at: 3 days ago - Pushed at: over 5 years ago - Stars: 92 - Forks: 22

nusnlp/greco

The official code for the "System Combination via Quality Estimation for Grammatical Error Correction" paper, published in EMNLP 2023.

Language: Macaulay2 - Size: 7.86 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 0

HillZhang1999/NaSGEC

Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)

Language: Python - Size: 497 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 83 - Forks: 8

michiyasunaga/LM-Critic

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Language: Python - Size: 3.35 MB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 119 - Forks: 11

nusnlp/esc

The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper

Language: Macaulay2 - Size: 2.04 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 56 - Forks: 12

zhpmatrix/cged_tf

论文实现:《Chinese Grammatical Error Diagnosis with Long Short-Term Memory Networks》

Language: Python - Size: 82 KB - Last synced at: 14 days ago - Pushed at: about 6 years ago - Stars: 49 - Forks: 15

sagorbrur/shuddhi-app

Bangla text corrector app.

Size: 161 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 31 - Forks: 0

xlxwalex/FCGEC

The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型

Language: Python - Size: 12.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 108 - Forks: 12

grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

Language: Python - Size: 669 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 907 - Forks: 216

HillZhang1999/MuCGEC

MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"

Language: Python - Size: 5.08 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 510 - Forks: 64

tlu-dt-nlp/EstGEC-L2-Corpus

Estonian Grammatical Error Correction (GEC) test and development corpus that contains L2 learner texts error-annotated in the M2 format.

Language: Python - Size: 729 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

TedYeh/Chinese_spelling_Correction

Chinese Grammar Error and Spelling Error Correction System - 中文文法錯誤及錯別字校正系統

Language: Jupyter Notebook - Size: 99.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 1

CAMeL-Lab/arabic-gec

Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.

Language: Python - Size: 252 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 13 - Forks: 2

shotakoyama/green

GREEN: n-gram F-score for Grammatical Error Correction

Language: Python - Size: 51.8 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

shotakoyama/gleu

Re-implementation of GLEU, evaluation metric of grammatical error correction

Language: Python - Size: 33.2 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

HillZhang1999/RobustGEC

Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)

Language: Python - Size: 1.69 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 0

HillZhang1999/SynGEC

Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"

Language: Python - Size: 4.42 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 74 - Forks: 15

richard-peng-xia/KD-CGEC

Code for Chinese grammatical error correction based on knowledge distillation

Language: Python - Size: 29 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

GeorgeVern/lmcor

Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"

Language: Python - Size: 401 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

richard-peng-xia/Chinese-Noisy-Text

This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and characters in redundant, missing, selection and ordering respectively.

Language: Python - Size: 68.4 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 3

team-langbot/conversationally

AI based conversational language tutor

Size: 109 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tm4roon/pytorch-translm

An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.

Language: Python - Size: 1.09 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 28 - Forks: 7

tlu-dt-nlp/M2-preprocessing Fork of kaisall/m2-preprocessing

Scripts used for the preprocessing of the EstGEC-L2 corpus that contains Estonian L2 learner texts error-annotated in the M2 format.

Language: Python - Size: 26.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shotakoyama/arteraro

artificial error generation using various rules for neural grammatical error correction

Language: Python - Size: 322 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

swjtu-gec/zlyang-master-dissertation-code

Code of zlyang's master dissertation for Chinese grammatical error correction.

Language: Python - Size: 2.82 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 34 - Forks: 6

ZetangForward/CSA-GEC

This is the official code for ``Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Simple Cycle Self-Augmenting``

Language: Python - Size: 8.78 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

nusnlp/geccl

Grammatical Error Correction with Contrastive Learning in Low Error Density Domains

Language: Python - Size: 3.75 MB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

osyvokon/unlp-2023-shared-task

UNLP 2023 Shared Task on Grammatical Error Correction for Ukrainian

Language: Macaulay2 - Size: 4.66 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

awasthiabhijeet/PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D19-1435.pdf (EMNLP-IJCNLP 2019)

Language: Macaulay2 - Size: 2.38 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 221 - Forks: 40

manzurola/errant4j

An unofficial Java port of ERRANT, the parallel text grammatical error annotator

Language: Java - Size: 699 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

lorafei/Explainable_GEC

The official code of the 2023 ACL paper "Enhancing Grammatical Error Correction Systems with Explanations"

Language: Python - Size: 5.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 1

butsugiri/gec-pseudodata

Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)

Language: Python - Size: 707 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 67 - Forks: 8

aseifert/textshine

Textshine is a seq2seq model for grammatical error correction.

Language: Python - Size: 83 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

lilt/tec

Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].

Language: Perl - Size: 2.86 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 0

yuanxun-yx/eracond

The first high-quality, fine-grained error-correction conversation dataset between English second language learner and an educational chatbot.

Size: 228 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

simonepri/text2error

〰 Introduce errors in error free text

Language: Python - Size: 73.2 KB - Last synced at: 29 days ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

imdreamer2018/Grammatical-Error-Correction

Grammatical-Error-Correction is an NLP-based spelling and grammar correction tool that accepts articles as well as raw text and returns a corrected sentence. Grammatical-Error-Correction is built using Python, powered by data and makes use of core NLP techniques. It is mainly based on AllenNLP and transformers.

Language: Python - Size: 676 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

ricardojosehlima/VGSO

Verificador Gramatical Sociolinguisticamente Orientado - um corretor gramatical amigável, como foco no usuário

Language: Python - Size: 903 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

Mindful/m2data

A Python package for working with GEC data in .m2 files

Language: Python - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

shubhaguha/MSc

M.Sc. thesis project at University of Edinburgh, 2017.

Language: Jupyter Notebook - Size: 347 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Related Keywords
grammatical-error-correction 46 natural-language-processing 14 nlp 12 gec 11 deep-learning 8 dataset 7 pytorch 5 annotation 3 machine-translation 3 corpus 3 grammatical-error-detection 3 generation 2 nlp-machine-learning 2 large-language-models 2 estonian-language 2 text-simplification 2 sequence-labeling 2 bert 2 grammar 2 ukrainian-language 2 seq2seq 2 chinese-nlp 2 shared-task 2 nmt 2 evaluation-metrics 2 corpus-processing 1 ensemble-decoding 1 conll-u 1 convs2s 1 annotation-processing 1 text-summarization 1 language-modeling 1 retrieval-augmented-generation 1 rag 1 llms 1 fine-tuning 1 noise-generator 1 summarization 1 parameter-efficient-fine-tuning 1 data-to-text-generation 1 knowledge-distillation 1 chinese-grammar-error-diagnosis 1 sequence-to-sequence 1 neural-machine-translation 1 mt 1 data-loading 1 python 1 portugues 1 text 1 error 1 eracond 1 subwords 1 encoder-decoder 1 interpretability 1 language-learning 1 java 1 grammar-checker 1 error-annotator 1 errant 1 sequence-transduction 1 sequence-editing 1 post-editing 1 bert-models 1 bert-model 1 bert-embeddings 1 ukrainian-nlp 1 chatgpt 1 adversarial-attacks 1 reranking-mechanism 1 multi-channel-fusion 1 language-model 1 cross-domain 1 re-ranking 1 quality-estimation 1 ensemble-model 1 transformer 1 transfer-learning 1 pre-training 1 fairseq 1 tensorflow2 1 pytorch-implementation 1 jax 1 style-checker 1 spellcheck 1 rust 1 proofreading 1 machine-learning 1 nlp-datasets 1 corpus-tools 1 corpus-data 1 flask 1 bootstrap 1 sla 1 data 1 mixture-of-experts 1 emnlp-2022 1 robustness 1 ged 1 arabic-nlp 1 arabic 1