An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-processing

DougLau/booky

A tool to analyze English text

Language: Rust - Size: 456 KB - Last synced at: about 11 hours ago - Pushed at: about 12 hours ago - Stars: 1 - Forks: 0

AlanSteinbarth/Audio2Tekst

Profesjonalny konwerter audio na tekst wykorzystujący OpenAI Whisper. Wspiera batch processing, eksport do różnych formatów (TXT, DOCX, PDF). GUI z drag&drop, progress tracking i opcjami konfiguracji jakości transkrypcji. Idealny dla dziennikarzy, studentów i twórców treści.

Language: Python - Size: 3.47 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 0 - Forks: 0

DineshDhamodharan24/Data_Science_Final_Project

Customer Insights & Recommendation System: Harnessing Decision Tree, Logistic Regression, and Random Forest models for behavior analysis. Utilizing EasyOCR and Python Imaging Library for image information extraction. Employing NLTK for sentiment analysis on textual data

Language: Jupyter Notebook - Size: 21.1 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

fullscreen-triangle/kwasa-kwasa

Semantic computing framework with meta-cognitive orchestration and biomimetic principles

Language: Rust - Size: 9.57 MB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

Sumit-807/newsnow

NewsNow offers a clean and elegant interface for reading real-time trending news. 🌐 Dive into the latest updates and enjoy seamless access with GitHub OAuth integration! 🐙

Language: TypeScript - Size: 4.55 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

samwega/obsidian-wordsmith Fork of chrisgrieser/obsidian-proofreader

AI-powered context-aware writing assistant for Obsidian. Instantly improve, translate, or generate new text with context-aware AI inline suggestions, custom prompts, and granular review. Supports ALL remote and local models. Enjoy a seamless, keyboard-first workflow for editing, refining, and creative writing—all within your notes.

Language: TypeScript - Size: 1.09 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

BurntSushi/aho-corasick

A fast implementation of Aho-Corasick in Rust.

Language: Rust - Size: 4.71 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 1,114 - Forks: 101

VitinDM/data-science-snippets

🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.

Language: Python - Size: 30.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

MonikaBarget/atr-historical-research

Automated Text Recognition in Historical Research

Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: about 8 hours ago - Pushed at: 15 days ago - Stars: 5 - Forks: 14

maqeel019/ATS

A powerful Python-based ATS that parses and ranks PDF resumes on recruiter-defined filters like skills, education, and experience. Handles scanned and complex resumes with detailed scoring and Excel output.

Language: Python - Size: 1.88 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

ZeroX-DG/vi-rs

Vietnamese Input Method library

Language: Rust - Size: 322 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 150 - Forks: 16

Taha5125/DocxWriter-JSON

DocxWriter is a Python library for generating professional Word documents from JSON. Automate reports, add tables, lists, images, and apply custom styles — all from clean, structured data.

Language: Python - Size: 23.4 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Romelium/mpatch

A fuzzy patch tool in Rust for applying AI-generated diffs from markdown, ignoring line numbers.

Language: Rust - Size: 0 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

vmenger/deduce

Deduce: de-identification method for Dutch medical text

Language: Python - Size: 7.25 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 57 - Forks: 24

pyparsing/pyparsing

Python library for creating PEG parsers

Language: Python - Size: 7.58 MB - Last synced at: 3 days ago - Pushed at: 13 days ago - Stars: 2,344 - Forks: 291

teenu/gpu-text-search

Ultra-high-performance GPU-accelerated text search using Metal compute shaders

Language: Swift - Size: 61.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

homeofhx/Text-Purifier

Simple Mac application that filters out specific characters in given text using regular expression (Regex)

Language: Swift - Size: 1.14 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

victoria217-bottino/google-news-scraper

# 📰 Google News Scraper A Python tool to fetch, decode, and process Google News articles by keyword and time range. Extract clean article text, decode URLs, and perform NLP effortlessly. Perfect for news aggregation, analysis, or building bots. Includes progress tracking with `tqdm` and customizable features for advanced use cases. 🚀

Size: 1000 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 1

12345far/metrics-calculation-precision-recall

Laboratory 7 - Retrieval Information

Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

ga1az/pathdigest

A command-line tool written in Go that analyzes Git repositories, local directories, or individual files and generates a structured, LLM-friendly text digest of their content.

Language: Go - Size: 32.2 KB - Last synced at: 1 day ago - Pushed at: 11 days ago - Stars: 5 - Forks: 1

PyThaiNLP/pythainlp

Thai natural language processing in Python

Language: Python - Size: 65.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,040 - Forks: 280

ChenghaoMou/text-dedup

All-in-one text de-duplication

Language: Python - Size: 5.77 MB - Last synced at: 3 days ago - Pushed at: 26 days ago - Stars: 688 - Forks: 74

omicsNLP/Auto-CORPus

Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London collaboration to standardize text and table data extracted from full text publications. See Open Access publication at: https://doi.org/10.3389/fdgth.2022.788124.

Language: HTML - Size: 57.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 21 - Forks: 8

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language: Python - Size: 332 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7,394 - Forks: 619

yuvrajpandiya/Piero-EnDe-Coder

A powerful encryption and decryption tool that combines the Vigenère cipher, XOR encryption, and Base64 encoding to secure messages. This tool allows users to encode and decode messages using a secret key, ensuring an extra layer of security.

Size: 1000 Bytes - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

himkt/konoha

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

Language: Python - Size: 1.35 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 251 - Forks: 28

rhaberkorn/sciteco

Advanced TECO dialect and interactive screen editor based on Scintilla

Language: C - Size: 3.48 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 53 - Forks: 6

rhiosutoyo/Teaching-Deep-Learning-and-Its-Applications

This course introduces the building blocks of deep learning and provides overview of various deep learning architectures. It also demonstrates how to solve real-world problems using a practical approach.

Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

KaizoKonpaku/Hush

AI-Powered Screenshot, Audio Transcription, and Text Processing for macOS, Hidden from Screen Sharing, Packed with Features, and Just 2MB

Language: Swift - Size: 12 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 12 - Forks: 3

june1963/Alfred-GitHub-Models

An Alfred workflow for AI text processing with GitHub Models

Language: Shell - Size: 419 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

wenet-e2e/WeTextProcessing

Text Normalization & Inverse Text Normalization

Language: Python - Size: 957 KB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 596 - Forks: 83

hyung-hwan/hawk

An AWK interpreter

Language: C - Size: 4.21 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7 - Forks: 1

whitfin/bytelines

Read input lines as byte slices for high efficiency

Language: Rust - Size: 39.1 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 66 - Forks: 9

thombashi/humanreadable

humanreadable is a Python library to convert human-readable values to other units.

Language: Python - Size: 137 KB - Last synced at: about 4 hours ago - Pushed at: about 2 months ago - Stars: 18 - Forks: 1

Thihasoehlaing/spelling-correction-system

A smart NLP-based spelling correction system for English language with a PySide6 GUI. Detects and highlights non-word and real-word errors using minimum edit distance, bigram/trigram models, and POS tagging.

Language: Python - Size: 144 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

bocaletto-luca/TextEditorQt

This program is a simple text editor with an intuitive user interface, created using the PyQt5 framework for developing desktop applications in Python. The text editor provides many basic features expected from an editor, along with advanced functionalities such as text formatting.

Language: Python - Size: 32.2 KB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 5 - Forks: 1

phil65/docler

Abstractions & Tools for OCR / document processing

Language: Python - Size: 2.26 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

btwitskaif69/Pro-Text-Editor

The Pro Text Editor project is a web application built using React, offering features for text manipulation, including text-to-speech functionality.

Language: JavaScript - Size: 198 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ds-modules/CUNEIF-102A

UC Berkeley CUNEIF 102A (Sumerian Text Analysis) Fall 2017

Language: Jupyter Notebook - Size: 40.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 0

cobanov/shakespeare-dataset

complete works, plays, sonnets and poems of shakespeare

Size: 2.36 MB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 3

daac-tools/daachorse

🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

Language: Rust - Size: 3.71 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 218 - Forks: 15

dhopp1/nlp_pipeline

Collection of NLP tools for processing and analyzing text data.

Language: Python - Size: 148 MB - Last synced at: about 4 hours ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 2

krzyzanowskim/CoreTextSwift

CoreText Swift bindings

Language: Swift - Size: 27.3 KB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 165 - Forks: 8

RAGnTeX/RAGnTeX

Creates latex presentations on the given topic based on the provided documents

Language: Python - Size: 17.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

sstadick/hck

A sharp cut(1) clone.

Language: Rust - Size: 494 KB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 712 - Forks: 18

open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor

Language: Scala - Size: 32.7 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 630 - Forks: 98

linuxscout/pyarabic

pyarabic

Language: Python - Size: 1.23 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 457 - Forks: 88

george-gca/ai_papers_cleaner

Extract text from papers PDFs and abstracts, and remove uninformative words.

Language: Python - Size: 390 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 5 - Forks: 0

fossology/atarashi

Atarashi scans for license statements in open source software, focusing on text statistics. Designed to work stand-alone and with FOSSology.

Language: Python - Size: 50.3 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 29 - Forks: 29

dewanakl/aman

🤬 Filter kata kotor sederhana dengan regex. Cek, sensor, dan hapus kata kasar dengan pola karakter mirip.

Language: PHP - Size: 85 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 2

justinbt1/Akin

Python library for detecting near duplicate texts in a corpus at scale.

Language: Python - Size: 2.77 MB - Last synced at: 2 days ago - Pushed at: 12 days ago - Stars: 8 - Forks: 0

helix-editor/nucleo

A fast and convenient fuzzy matcher library for rust

Language: Rust - Size: 232 KB - Last synced at: 12 days ago - Pushed at: 29 days ago - Stars: 1,109 - Forks: 39

chmln/sd

Intuitive find & replace CLI (sed alternative)

Language: Rust - Size: 414 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 6,323 - Forks: 144

omerblau/language-flipper

Instantly fix text typed in the wrong keyboard layout with one hot-key (Win).

Language: C++ - Size: 1.52 MB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

SAGE-Rebirth/gemini-chatbot-mongodb

This project is a FastAPI and React-based chatbot system for querying PDF content using Google Gemini 2.0 Flash embeddings and MongoDB vector search. It features PDF upload, semantic search, chat interface, and an admin panel for document management with Netligent branding. The system is production grade ready with robust error handling.

Language: TypeScript - Size: 317 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 1

mblucasm/lcmp

List Comparison - A fast and lightweight tool for comparing two lists, finding common elements, and identifying differences. Supports raw text files, Instagram data, and extracted HTML <div> elements.

Language: C++ - Size: 52.7 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

whitfin/s3-utils

Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI

Language: Rust - Size: 43.9 KB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 55 - Forks: 10

andyi95/reading-vue

Set of tools for text processing

Language: Vue - Size: 2.36 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

knaw-huc/textsurf

Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without loading everything into memory.

Language: Rust - Size: 78.1 KB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

TheophilusE/pystringbuilder

A lightweight and efficient Python string builder class for dynamic text construction, minimizing unnecessary string concatenations for better performance.

Language: Python - Size: 7.81 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

tsmdt/dygest

CLI tool to extract content insights from raw txt using LLMs and NER

Language: Python - Size: 7.96 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 0

IG-onGit/TexeT

TexeT is the tool you need to take your interaction and content control to the next level.

Language: Python - Size: 117 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Lips7/Matcher

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust.

Language: Rust - Size: 36.9 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 17 - Forks: 1

cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

Language: Python - Size: 778 KB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 670 - Forks: 93

Goldziher/html-to-markdown

HTML to markdown converter

Language: Python - Size: 443 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 47 - Forks: 5

ovuiproduction/Research-Assistant

AI-Powered Research Assistant – A smart tool that helps researchers find relevant papers, recommend journals, ask questions about content, humanize AI text, and detect AI-generated writing. Powered by Large Language Models for enhanced research productivity.

Language: JavaScript - Size: 1.15 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

desholmes/text-quest

Text Quest is a game engine for running text-based adventure games, using a low/no code approach to game design.

Language: JavaScript - Size: 414 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 14 - Forks: 3

hscspring/pnlp

NLP预/后处理工具。

Language: Python - Size: 106 KB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 30 - Forks: 6

dnoice/textMan

Your ultimate text manipulation tool

Language: JavaScript - Size: 4.16 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

Lucifer88484/Natural-Language-Processing-API

Natural-Language-Processing-API is a RESTful API built with FastAPI that offers core NLP tasks like sentiment analysis, entity recognition, summarization, and language detection. It uses Hugging Face and spaCy models, supports Docker, and provides easy integration for NLP features.

Language: Python - Size: 9.77 KB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

benletchford/buup.io

A versatile text transformation toolkit in pure Rust with a dependency-free core. Encoding, decoding, formatting, cryptography, and (de)compression and more through CLI, web UI, or as a library.

Language: Rust - Size: 15 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 6 - Forks: 0

digineo/texd

texd wraps TeX in a web API

Language: Go - Size: 815 KB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 11 - Forks: 1

AmirAli104/Text2Excel

A GUI desktop application that can extract data from a text file and put them in an Excel or CSV file using regular expression (regex) patterns

Language: Python - Size: 208 KB - Last synced at: 3 days ago - Pushed at: 19 days ago - Stars: 4 - Forks: 0

milliorn/cli-password-generators

Simple command-line applications for generating passwords

Language: Go - Size: 6.85 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

elektito/finglish

A Finglish to Persian converter.

Language: Python - Size: 2.28 MB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 84 - Forks: 21

ty70/news-summary-translator

Automatically fetches, translates, and summarizes world news from Yahoo! Japan using Google Cloud APIs. Output in JSON or terminal. 🇯🇵→🇺🇸

Language: Python - Size: 21.5 KB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

scripal-git/scripal

universal text processor

Language: C++ - Size: 3.02 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

finnjest/Realm

Advanced Text Processing Tool

Language: AutoHotkey - Size: 253 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

moltenib/md-to-html

Sed script that converts Markdown to HTML code.

Language: sed - Size: 106 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 2

codingkush/ChatSense

ChatSense — A chat analyzer app that quickly summarizes and analyzes WhatsApp chat exports with a clean, easy-to-use interface.

Language: Python - Size: 17.6 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

Pranav-Patel-123/GenAI

Language: TypeScript - Size: 102 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

GateNLP/python-gatenlp

Python text processing, pattern matching, and NLP framework

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 66 - Forks: 8

farhad-here/Persian_Text_Processing

It is Persian Text processing with parsivar library

Language: Python - Size: 9.77 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Pranav-Patel-123/WaY-scrapping

web and youtube scrapping for the given input like a search engine that brings links and text from web and youtube.

Language: Python - Size: 6.84 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Edopramudya/Sentiment-Text-Clustering

Proyek ini berfokus pada preprocessing dan clustering data teks dari dataset sentimen. Dataset yang digunakan berisi teks dan label sentimen (positif, negatif, netral), dan dilakukan pembersihan teks sebelum proses klastering.

Language: Jupyter Notebook - Size: 729 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

LucasGoncSilva/mosheh

Mosheh, a tool for creating docs for projects, from Python to Python.

Language: Python - Size: 1.46 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 8 - Forks: 1

airbnb/artificial-adversary

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Language: Python - Size: 116 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 402 - Forks: 57

open-i18n/rust-unic

UNIC: Unicode and Internationalization Crates for Rust

Language: Rust - Size: 14.1 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 241 - Forks: 24

IoeCmcomc/chiecthuyenngoaixa

An utility library for processing Vietnamese texts

Language: Python - Size: 235 KB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

casics/nostril 📦

Nostril: Nonsense String Evaluator

Language: Python - Size: 143 MB - Last synced at: 5 days ago - Pushed at: about 3 years ago - Stars: 195 - Forks: 35

google/diff-match-patch 📦

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.

Language: Python - Size: 659 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 7,755 - Forks: 1,139

PellaML/Markdown-Renderer

Enhanced Markdown Renderer: A versatile and extensible JavaScript-based Markdown rendering and parsing library, leveraging Abstract Syntax Trees (AST) for efficient processing and customizable output. Open-source and community-driven, with a focus on future improvements and contributions.

Language: JavaScript - Size: 33.2 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

aditiiprasad/WhatsStat

A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.

Language: Python - Size: 1.27 MB - Last synced at: 6 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

ronnmabunga/scanpad

ScanPad is an OCR-powered notepad that extracts text from images and lets you edit, organize, and export documents. It features a rich text editor, multiple input methods, and a responsive user interface design.

Language: JavaScript - Size: 354 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

victoryosiobe/kingchop

Kingchop ⚔️ is a JavaScript English based library for tokenizing text (chopping text). It uses vast rules for tokenizing, and you can adjust them easily.

Language: JavaScript - Size: 85.9 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

yui-mhcp/data_processing

Data processing utilities in keras3

Language: Jupyter Notebook - Size: 86.2 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 5 - Forks: 1

ucd-dnp/ConTexto

Librería en Python para minería de texto y NLP

Language: Jupyter Notebook - Size: 34.1 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 49 - Forks: 14

Automattic/go-search-replace

🚀 Search & replace URLs in WordPress SQL files.

Language: Go - Size: 101 KB - Last synced at: 8 days ago - Pushed at: 15 days ago - Stars: 97 - Forks: 19

Bikatr7/Kudasai

Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies

Language: Python - Size: 90.4 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 25 - Forks: 4

KashifMoin1410/Text-Sentiment-Analysis

This project analyzes tweet sentiments using both traditional machine learning (Logistic Regression, Ridge, XGBoost) and deep learning (LSTM) models. The workflow covers text preprocessing, feature engineering, model training, and evaluation. Logistic Regression achieved an R² score of 0.80, while the LSTM model reached ~76% validation accuracy.

Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0