Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: bengali-text-normalization
csebuetnlp/normalizer
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
Language: Python - Size: 15.6 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 28 - Forks: 5