An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-cleaner"

VoxelCubes/PanelCleaner

An AI-powered tool to clean manga panels.

Language: Python - Size: 43.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 316 - Forks: 26

shivam5992/dupandas

:bar_chart: python package for performing deduplication using flexible text matching and cleaning in pandas dataframe

Language: Python - Size: 214 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 4

amansrivastava17/text-preprocess-python

Text preprocessing tools in python.

Language: Python - Size: 39.1 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 24 - Forks: 7

34j/mecab-text-cleaner

Simple Python package (CLI/Python API) for getting japanese readings (yomigana) and accents using MeCab.

Language: Python - Size: 167 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 7 - Forks: 0

Khodnevis-Research-Lab/khoshnevis

Khodnevis Normalizer: A Python library for Persian text preprocessing.

Language: Python - Size: 21.5 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

sagepublishing/text_cleaning

Corpora and scripts for cleaning political science texts. Scripts are translated into transformations that support SAGE Texti.

Language: Python - Size: 30.4 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

HamidRezaAttar/Per-Normalizer

Persian text cleaner with additional features on Parsivar package.

Language: Python - Size: 143 KB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

riteshkarmakar/auto-clipboard-cleaner

Automatically clean, format, and enhance your clipboard content. It simplifies text management by applying a variety of cleaning and formatting rules to your copied content, ensuring your clipboard stays organized and clutter-free.

Language: Python - Size: 5.09 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

r7-labs/R7.Webmaster 📦

Webmaster's desktop productivity tools

Language: C# - Size: 404 KB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

faisaltareque/BanglaLanguageToolkit

Bangla Language Processing Toolkit

Language: Python - Size: 26.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

showmik/TidyText

🖹 Offline Text Cleaner and Formatter

Language: C# - Size: 293 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

tcramm0nd/nlppre

Preprocessing for NLP applicaitons

Language: Python - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

PaleAlex/wordpy

Module and Class to extract from a text.txt file the "n" most common words

Language: Python - Size: 385 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0