Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: bengali-text-normalization

csebuetnlp/normalizer

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

Language: Python - Size: 15.6 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 28 - Forks: 5