GitHub / microsoft / CodeMixed-Text-Generator
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2FCodeMixed-Text-Generator
Stars: 54
Forks: 12
Open issues: 7
License: mit
Language: Jupyter Notebook
Size: 3.79 MB
Dependencies parsed at: Pending
Created at: about 4 years ago
Updated at: about 1 month ago
Pushed at: 10 months ago
Last synced at: about 14 hours ago
Topics: code-mixing, code-switching, data-generation, language-modeling, linguistics, natural-language-processing, python3, synthetic-data-generation