GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.
Our code is available at LLM-based Code-Switched Text Generation for Grammatical Error Correction Authors: Tom Potter, Zheng Yuan Conference: EMNLP Link: Abstract With the rise of globalisation, code-switching ...
GitHubmisspellingsatomic editslanguage modelingThe lack of large-scale datasets has been a major hindrance to the development of NLP tasks such as spelling correction and grammatical error correction (GEC). As a complementary new resource for these tasks, we present the GitHub Typo Corpus, a large...
evaluation. We also summarize the approaches investigated by the participants of this task. Such approaches demonstrate the state-of-the-art of Grammatical Error Correction for Mandarin Chinese. The data set and evaluation tool used by this task is available at
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors The lack of large-scale datasets has been a major hindrance to the development of NLP tasks such as spelling correction and grammatical error correction (G... M Hagiwara,M Mita - International Conference ...
Improving Sequence Tagging approach for Grammatical Error Correction task [paper][code] LM-Critic: Language Models for Unsupervised Grammatical Error Correction [paper][code] Citation If you find this work is useful for your research, please cite our papers: ...
The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型 - xlxwalex/FCGEC
(EACL 2017): JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Valencia, Spain. April 03-07, 2017.Michael Heilman, Aoife Cahill, Nitin Madnani, Melissa Lopez, ...
Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data - kanyun-inc/fairseq-gec
The Turkish version of ERRANT, an automatic evaluation toolkit for grammatical error correction tasks. - harunuz/erranttr