An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-processing

Sumit-807/newsnow

NewsNow offers a clean and elegant interface for reading real-time trending news. 🌐 Dive into the latest updates and enjoy seamless access with GitHub OAuth integration! 🐙

Language: TypeScript - Size: 4.55 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 0 - Forks: 0

VitinDM/data-science-snippets

🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.

Language: Python - Size: 30.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Moez-lab/parallel-keyword-scanner

High-performance keyword scanner for text and PDF files with multiprocessing and a modern React UI.

Language: TypeScript - Size: 80.1 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

rlayers/pawpaw

Text Processing & Segmentation Framework

Language: Python - Size: 2.52 MB - Last synced at: about 10 hours ago - Pushed at: 3 months ago - Stars: 23 - Forks: 4

KaizoKonpaku/Hush

AI-Powered Screenshot, Audio Transcription, and Text Processing for macOS, Hidden from Screen Sharing, Packed with Features, and Just 2MB

Language: Swift - Size: 12 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 24 - Forks: 5

linuxscout/pyarabic

pyarabic

Language: Python - Size: 1.23 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 459 - Forks: 88

Taha5125/DocxWriter-JSON

DocxWriter is a Python library for generating professional Word documents from JSON. Automate reports, add tables, lists, images, and apply custom styles — all from clean, structured data.

Language: Python - Size: 23.4 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

alexandersisco/kubun

Python-style slicing for paths and delimiter-separated strings, from your terminal.

Language: Go - Size: 29.3 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

BurntSushi/aho-corasick

A fast implementation of Aho-Corasick in Rust.

Language: Rust - Size: 4.71 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 1,116 - Forks: 102

arverma/HindiXlit Fork of AI4Bharat/IndicXlit

Transliteration models for Roman to Devanagari language

Language: Python - Size: 45.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

twardoch/wiktra2 Fork of kbatsuren/wiktra

Wiktra: transliteration tool using Wiktionary transliteration modules. Version 2 (fork)

Language: Lua - Size: 1.29 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4 - Forks: 0

Mukeshthenraj/date-extraction-project

Extract and normalize dates from unstructured medical notes using Python and regular expressions.

Language: Python - Size: 40 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

loderunner/typelit

A type-safe string templating library for TypeScript

Language: TypeScript - Size: 381 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 1

digineo/texd

texd wraps TeX in a web API

Language: Go - Size: 818 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 1

rhiosutoyo/Teaching-Deep-Learning-and-Its-Applications

This course introduces the building blocks of deep learning and provides overview of various deep learning architectures. It also demonstrates how to solve real-world problems using a practical approach.

Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

dewanakl/aman

🤬 Filter kata kotor sederhana dengan regex. Cek, sensor, dan hapus kata kasar dengan pola karakter mirip.

Language: PHP - Size: 85 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 2

victoria217-bottino/google-news-scraper

# 📰 Google News Scraper A Python tool to fetch, decode, and process Google News articles by keyword and time range. Extract clean article text, decode URLs, and perform NLP effortlessly. Perfect for news aggregation, analysis, or building bots. Includes progress tracking with `tqdm` and customizable features for advanced use cases. 🚀

Size: 1000 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 1

12345far/metrics-calculation-precision-recall

Laboratory 7 - Retrieval Information

Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

shama-llama/pdf-epub-converter

PDF to EPUB conversion using ML for layout detection

Language: Python - Size: 140 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

wenet-e2e/WeTextProcessing

Text Normalization & Inverse Text Normalization

Language: Python - Size: 957 KB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 602 - Forks: 83

weiwei/silabacion

Convert Spanish words into syllables

Language: TypeScript - Size: 1.62 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 0

ChenghaoMou/text-dedup

All-in-one text de-duplication

Language: Python - Size: 5.77 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 690 - Forks: 74

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language: Python - Size: 331 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 7,444 - Forks: 621

yuvrajpandiya/Piero-EnDe-Coder

A powerful encryption and decryption tool that combines the Vigenère cipher, XOR encryption, and Base64 encoding to secure messages. This tool allows users to encode and decode messages using a secret key, ensuring an extra layer of security.

Size: 1000 Bytes - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Goldziher/html-to-markdown

HTML to markdown converter

Language: Python - Size: 319 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 50 - Forks: 6

Puchaczov/Musoq

SQL Syntax without any database

Language: C# - Size: 15.7 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 482 - Forks: 21

rhaberkorn/sciteco

Advanced TECO dialect and interactive screen editor based on Scintilla

Language: C - Size: 3.48 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 51 - Forks: 6

YULINHEEE/NLP-text-preprocessing-and-classification

Starter code to solve real-world text data problems related to job advertisements. Includes: Word2Vec, phrase embeddings, Text Classification with Logistic Regression, simple text preprocessing, pre-trained embeddings and more.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

shohanur-shoron/bangla_normalizer

A Python library designed to convert various written forms of Bengali text elements (like numbers, dates, times, currency, percentages, distances, etc.) into their corresponding spoken word representations.

Language: Python - Size: 96.7 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

bithead21/parcel

Parser for cpp programms! Parcel is simple language for parsing text information and retrieving any data.

Language: C++ - Size: 1.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

guillaumeast/mentorai

Turn any YouTube channel into a full Custom GPT (avatar, settings, transcripts)

Language: Shell - Size: 61.5 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

sdleffler/qp-trie-rs

An idiomatic and fast QP-trie implementation in pure Rust.

Language: Rust - Size: 80.1 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 101 - Forks: 25

mary-lev/mary-lev.github.io

Just another blog

Language: HTML - Size: 19.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

google/diff-match-patch 📦

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.

Language: Python - Size: 659 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 7,794 - Forks: 1,145

sstadick/hck

A sharp cut(1) clone.

Language: Rust - Size: 494 KB - Last synced at: 3 days ago - Pushed at: 18 days ago - Stars: 714 - Forks: 18

phil65/docler

Abstractions & Tools for OCR / document processing

Language: Python - Size: 2.28 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

ProfRandom/Excel-Lambda-Suite

Reusable Excel LAMBDA function library for modeling, simulation, statistics, and advanced spreadsheet design.

Size: 1.96 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Lord-Memester/tagger-txt-to-XMP

A python script to convert the .txt files generated by an automatic tagger plugin for Automatic1111's stable diffusion Web UI into XMP sidecar files interpretable by Immich.

Language: Python - Size: 105 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

DougLau/booky

A tool to analyze English text

Language: Rust - Size: 456 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

AlanSteinbarth/Audio2Tekst

Profesjonalny konwerter audio na tekst wykorzystujący OpenAI Whisper. Wspiera batch processing, eksport do różnych formatów (TXT, DOCX, PDF). GUI z drag&drop, progress tracking i opcjami konfiguracji jakości transkrypcji. Idealny dla dziennikarzy, studentów i twórców treści.

Language: Python - Size: 3.47 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

pyparsing/pyparsing

Python library for creating PEG parsers

Language: Python - Size: 7.58 MB - Last synced at: 3 days ago - Pushed at: 20 days ago - Stars: 2,348 - Forks: 291

DineshDhamodharan24/Data_Science_Final_Project

Customer Insights & Recommendation System: Harnessing Decision Tree, Logistic Regression, and Random Forest models for behavior analysis. Utilizing EasyOCR and Python Imaging Library for image information extraction. Employing NLTK for sentiment analysis on textual data

Language: Jupyter Notebook - Size: 21.1 MB - Last synced at: 3 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor

Language: Scala - Size: 32.7 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 632 - Forks: 98

fullscreen-triangle/kwasa-kwasa

Semantic computing framework with meta-cognitive orchestration and biomimetic principles

Language: Rust - Size: 9.57 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

samwega/obsidian-wordsmith Fork of chrisgrieser/obsidian-proofreader

AI-powered context-aware writing assistant for Obsidian. Instantly improve, translate, or generate new text with context-aware AI inline suggestions, custom prompts, and granular review. Supports ALL remote and local models. Enjoy a seamless, keyboard-first workflow for editing, refining, and creative writing—all within your notes.

Language: TypeScript - Size: 1.09 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

MonikaBarget/atr-historical-research

Automated Text Recognition in Historical Research

Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: 7 days ago - Pushed at: 21 days ago - Stars: 5 - Forks: 14

maqeel019/ATS

A powerful Python-based ATS that parses and ranks PDF resumes on recruiter-defined filters like skills, education, and experience. Handles scanned and complex resumes with detailed scoring and Excel output.

Language: Python - Size: 1.88 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ZeroX-DG/vi-rs

Vietnamese Input Method library

Language: Rust - Size: 322 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 150 - Forks: 16

Automattic/go-search-replace

🚀 Search & replace URLs in WordPress SQL files.

Language: Go - Size: 104 KB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 97 - Forks: 19

Romelium/mpatch

A fuzzy patch tool in Rust for applying AI-generated diffs from markdown, ignoring line numbers.

Language: Rust - Size: 0 Bytes - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

vmenger/deduce

Deduce: de-identification method for Dutch medical text

Language: Python - Size: 7.25 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 57 - Forks: 24

teenu/gpu-text-search

Ultra-high-performance GPU-accelerated text search using Metal compute shaders

Language: Swift - Size: 61.5 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

homeofhx/Text-Purifier

Simple Mac application that filters out specific characters in given text using regular expression (Regex)

Language: Swift - Size: 1.14 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

ga1az/pathdigest

A command-line tool written in Go that analyzes Git repositories, local directories, or individual files and generates a structured, LLM-friendly text digest of their content.

Language: Go - Size: 32.2 KB - Last synced at: about 14 hours ago - Pushed at: 18 days ago - Stars: 5 - Forks: 1

PyThaiNLP/pythainlp

Thai natural language processing in Python

Language: Python - Size: 65.6 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1,040 - Forks: 280

omicsNLP/Auto-CORPus

Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London collaboration to standardize text and table data extracted from full text publications. See Open Access publication at: https://doi.org/10.3389/fdgth.2022.788124.

Language: HTML - Size: 57.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 21 - Forks: 8

himkt/konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

Language: Python - Size: 1.35 MB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 251 - Forks: 28

roshan-research/hazm

Persian NLP Toolkit

Language: Python - Size: 25.5 MB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 1,287 - Forks: 190

june1963/Alfred-GitHub-Models

An Alfred workflow for AI text processing with GitHub Models

Language: Shell - Size: 419 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

hyung-hwan/hawk

An AWK interpreter

Language: C - Size: 4.21 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 7 - Forks: 1

whitfin/bytelines

Read input lines as byte slices for high efficiency

Language: Rust - Size: 39.1 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 66 - Forks: 9

thombashi/humanreadable

humanreadable is a Python library to convert human-readable values to other units.

Language: Python - Size: 137 KB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 18 - Forks: 1

Thihasoehlaing/spelling-correction-system

A smart NLP-based spelling correction system for English language with a PySide6 GUI. Detects and highlights non-word and real-word errors using minimum edit distance, bigram/trigram models, and POS tagging.

Language: Python - Size: 144 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

bocaletto-luca/TextEditorQt

This program is a simple text editor with an intuitive user interface, created using the PyQt5 framework for developing desktop applications in Python. The text editor provides many basic features expected from an editor, along with advanced functionalities such as text formatting.

Language: Python - Size: 32.2 KB - Last synced at: 10 days ago - Pushed at: 21 days ago - Stars: 5 - Forks: 1

btwitskaif69/Pro-Text-Editor

The Pro Text Editor project is a web application built using React, offering features for text manipulation, including text-to-speech functionality.

Language: JavaScript - Size: 198 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

ds-modules/CUNEIF-102A

UC Berkeley CUNEIF 102A (Sumerian Text Analysis) Fall 2017

Language: Jupyter Notebook - Size: 40.8 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 6 - Forks: 0

cobanov/shakespeare-dataset

complete works, plays, sonnets and poems of shakespeare

Size: 2.36 MB - Last synced at: 9 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 3

daac-tools/daachorse

🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

Language: Rust - Size: 3.71 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 218 - Forks: 15

dhopp1/nlp_pipeline

Collection of NLP tools for processing and analyzing text data.

Language: Python - Size: 148 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 2

MIT-LCP/bloatectomy

A python package for removing duplicate text in clinical notes or other documents

Language: TeX - Size: 7.48 MB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 37 - Forks: 9

krzyzanowskim/CoreTextSwift

CoreText Swift bindings

Language: Swift - Size: 27.3 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 165 - Forks: 8

RAGnTeX/RAGnTeX

Creates latex presentations on the given topic based on the provided documents

Language: Python - Size: 17.5 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

george-gca/ai_papers_cleaner

Extract text from papers PDFs and abstracts, and remove uninformative words.

Language: Python - Size: 390 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 5 - Forks: 0

fossology/atarashi

Atarashi scans for license statements in open source software, focusing on text statistics. Designed to work stand-alone and with FOSSology.

Language: Python - Size: 50.3 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 29 - Forks: 29

justinbt1/Akin

Python library for detecting near duplicate texts in a corpus at scale.

Language: Python - Size: 2.77 MB - Last synced at: about 17 hours ago - Pushed at: 19 days ago - Stars: 8 - Forks: 0

helix-editor/nucleo

A fast and convenient fuzzy matcher library for rust

Language: Rust - Size: 232 KB - Last synced at: 19 days ago - Pushed at: about 1 month ago - Stars: 1,109 - Forks: 39

chmln/sd

Intuitive find & replace CLI (sed alternative)

Language: Rust - Size: 414 KB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 6,323 - Forks: 144

omerblau/language-flipper

Instantly fix text typed in the wrong keyboard layout with one hot-key (Win).

Language: C++ - Size: 1.52 MB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 3 - Forks: 0

SAGE-Rebirth/gemini-chatbot-mongodb

This project is a FastAPI and React-based chatbot system for querying PDF content using Google Gemini 2.0 Flash embeddings and MongoDB vector search. It features PDF upload, semantic search, chat interface, and an admin panel for document management with Netligent branding. The system is production grade ready with robust error handling.

Language: TypeScript - Size: 317 KB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 1

mblucasm/lcmp

List Comparison - A fast and lightweight tool for comparing two lists, finding common elements, and identifying differences. Supports raw text files, Instagram data, and extracted HTML <div> elements.

Language: C++ - Size: 52.7 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

whitfin/s3-utils

Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI

Language: Rust - Size: 43.9 KB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 55 - Forks: 10

cloudflare/wildcard

Wildcard matching

Language: Rust - Size: 45.7 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 197 - Forks: 4

andyi95/reading-vue

Set of tools for text processing

Language: Vue - Size: 2.36 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

knaw-huc/textsurf

Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without loading everything into memory.

Language: Rust - Size: 78.1 KB - Last synced at: 4 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

TheophilusE/pystringbuilder

A lightweight and efficient Python string builder class for dynamic text construction, minimizing unnecessary string concatenations for better performance.

Language: Python - Size: 7.81 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

tsmdt/dygest

CLI tool to extract content insights from raw txt using LLMs and NER

Language: Python - Size: 7.96 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 3 - Forks: 0

IG-onGit/TexeT

TexeT is the tool you need to take your interaction and content control to the next level.

Language: Python - Size: 117 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

Lips7/Matcher

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust.

Language: Rust - Size: 36.9 MB - Last synced at: 4 days ago - Pushed at: 22 days ago - Stars: 17 - Forks: 1

cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

Language: Python - Size: 778 KB - Last synced at: 12 days ago - Pushed at: 25 days ago - Stars: 670 - Forks: 93

assafmo/xioc

Extract indicators of compromise from text, including "escaped" ones.

Language: Go - Size: 64.5 KB - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 160 - Forks: 11

ovuiproduction/Research-Assistant

AI-Powered Research Assistant – A smart tool that helps researchers find relevant papers, recommend journals, ask questions about content, humanize AI text, and detect AI-generated writing. Powered by Large Language Models for enhanced research productivity.

Language: JavaScript - Size: 1.15 MB - Last synced at: 24 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

desholmes/text-quest

Text Quest is a game engine for running text-based adventure games, using a low/no code approach to game design.

Language: JavaScript - Size: 414 KB - Last synced at: 24 days ago - Pushed at: 25 days ago - Stars: 14 - Forks: 3

hscspring/pnlp

NLP预/后处理工具。

Language: Python - Size: 106 KB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 30 - Forks: 6

dnoice/textMan

Your ultimate text manipulation tool

Language: JavaScript - Size: 4.16 MB - Last synced at: 24 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

Lucifer88484/Natural-Language-Processing-API

Natural-Language-Processing-API is a RESTful API built with FastAPI that offers core NLP tasks like sentiment analysis, entity recognition, summarization, and language detection. It uses Hugging Face and spaCy models, supports Docker, and provides easy integration for NLP features.

Language: Python - Size: 9.77 KB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

benletchford/buup.io

A versatile text transformation toolkit in pure Rust with a dependency-free core. Encoding, decoding, formatting, cryptography, and (de)compression and more through CLI, web UI, or as a library.

Language: Rust - Size: 15 MB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 6 - Forks: 0

AmirAli104/Text2Excel

A GUI desktop application that can extract data from a text file and put them in an Excel or CSV file using regular expression (regex) patterns

Language: Python - Size: 208 KB - Last synced at: 10 days ago - Pushed at: 26 days ago - Stars: 4 - Forks: 0

milliorn/cli-password-generators

Simple command-line applications for generating passwords

Language: Go - Size: 6.85 MB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 2 - Forks: 0

elektito/finglish

A Finglish to Persian converter.

Language: Python - Size: 2.28 MB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 84 - Forks: 21

ty70/news-summary-translator

Automatically fetches, translates, and summarizes world news from Yahoo! Japan using Google Cloud APIs. Output in JSON or terminal. 🇯🇵→🇺🇸

Language: Python - Size: 21.5 KB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0