GitHub topics: text-processing
DougLau/booky
A tool to analyze English text
Language: Rust - Size: 456 KB - Last synced at: about 11 hours ago - Pushed at: about 12 hours ago - Stars: 1 - Forks: 0

AlanSteinbarth/Audio2Tekst
Profesjonalny konwerter audio na tekst wykorzystujący OpenAI Whisper. Wspiera batch processing, eksport do różnych formatów (TXT, DOCX, PDF). GUI z drag&drop, progress tracking i opcjami konfiguracji jakości transkrypcji. Idealny dla dziennikarzy, studentów i twórców treści.
Language: Python - Size: 3.47 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 0 - Forks: 0

DineshDhamodharan24/Data_Science_Final_Project
Customer Insights & Recommendation System: Harnessing Decision Tree, Logistic Regression, and Random Forest models for behavior analysis. Utilizing EasyOCR and Python Imaging Library for image information extraction. Employing NLTK for sentiment analysis on textual data
Language: Jupyter Notebook - Size: 21.1 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

fullscreen-triangle/kwasa-kwasa
Semantic computing framework with meta-cognitive orchestration and biomimetic principles
Language: Rust - Size: 9.57 MB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

Sumit-807/newsnow
NewsNow offers a clean and elegant interface for reading real-time trending news. 🌐 Dive into the latest updates and enjoy seamless access with GitHub OAuth integration! 🐙
Language: TypeScript - Size: 4.55 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

samwega/obsidian-wordsmith Fork of chrisgrieser/obsidian-proofreader
AI-powered context-aware writing assistant for Obsidian. Instantly improve, translate, or generate new text with context-aware AI inline suggestions, custom prompts, and granular review. Supports ALL remote and local models. Enjoy a seamless, keyboard-first workflow for editing, refining, and creative writing—all within your notes.
Language: TypeScript - Size: 1.09 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

BurntSushi/aho-corasick
A fast implementation of Aho-Corasick in Rust.
Language: Rust - Size: 4.71 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 1,114 - Forks: 101

VitinDM/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
Language: Python - Size: 30.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

MonikaBarget/atr-historical-research
Automated Text Recognition in Historical Research
Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: about 8 hours ago - Pushed at: 15 days ago - Stars: 5 - Forks: 14

maqeel019/ATS
A powerful Python-based ATS that parses and ranks PDF resumes on recruiter-defined filters like skills, education, and experience. Handles scanned and complex resumes with detailed scoring and Excel output.
Language: Python - Size: 1.88 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

ZeroX-DG/vi-rs
Vietnamese Input Method library
Language: Rust - Size: 322 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 150 - Forks: 16

Taha5125/DocxWriter-JSON
DocxWriter is a Python library for generating professional Word documents from JSON. Automate reports, add tables, lists, images, and apply custom styles — all from clean, structured data.
Language: Python - Size: 23.4 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Romelium/mpatch
A fuzzy patch tool in Rust for applying AI-generated diffs from markdown, ignoring line numbers.
Language: Rust - Size: 0 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

vmenger/deduce
Deduce: de-identification method for Dutch medical text
Language: Python - Size: 7.25 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 57 - Forks: 24

pyparsing/pyparsing
Python library for creating PEG parsers
Language: Python - Size: 7.58 MB - Last synced at: 3 days ago - Pushed at: 13 days ago - Stars: 2,344 - Forks: 291

teenu/gpu-text-search
Ultra-high-performance GPU-accelerated text search using Metal compute shaders
Language: Swift - Size: 61.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

homeofhx/Text-Purifier
Simple Mac application that filters out specific characters in given text using regular expression (Regex)
Language: Swift - Size: 1.14 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

victoria217-bottino/google-news-scraper
# 📰 Google News Scraper A Python tool to fetch, decode, and process Google News articles by keyword and time range. Extract clean article text, decode URLs, and perform NLP effortlessly. Perfect for news aggregation, analysis, or building bots. Includes progress tracking with `tqdm` and customizable features for advanced use cases. 🚀
Size: 1000 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 1

12345far/metrics-calculation-precision-recall
Laboratory 7 - Retrieval Information
Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

ga1az/pathdigest
A command-line tool written in Go that analyzes Git repositories, local directories, or individual files and generates a structured, LLM-friendly text digest of their content.
Language: Go - Size: 32.2 KB - Last synced at: 1 day ago - Pushed at: 11 days ago - Stars: 5 - Forks: 1

PyThaiNLP/pythainlp
Thai natural language processing in Python
Language: Python - Size: 65.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,040 - Forks: 280

ChenghaoMou/text-dedup
All-in-one text de-duplication
Language: Python - Size: 5.77 MB - Last synced at: 3 days ago - Pushed at: 26 days ago - Stars: 688 - Forks: 74

omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London collaboration to standardize text and table data extracted from full text publications. See Open Access publication at: https://doi.org/10.3389/fdgth.2022.788124.
Language: HTML - Size: 57.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 21 - Forks: 8

pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Language: Python - Size: 332 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7,394 - Forks: 619

yuvrajpandiya/Piero-EnDe-Coder
A powerful encryption and decryption tool that combines the Vigenère cipher, XOR encryption, and Base64 encoding to secure messages. This tool allows users to encode and decode messages using a secret key, ensuring an extra layer of security.
Size: 1000 Bytes - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

himkt/konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Language: Python - Size: 1.35 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 251 - Forks: 28

rhaberkorn/sciteco
Advanced TECO dialect and interactive screen editor based on Scintilla
Language: C - Size: 3.48 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 53 - Forks: 6

rhiosutoyo/Teaching-Deep-Learning-and-Its-Applications
This course introduces the building blocks of deep learning and provides overview of various deep learning architectures. It also demonstrates how to solve real-world problems using a practical approach.
Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

KaizoKonpaku/Hush
AI-Powered Screenshot, Audio Transcription, and Text Processing for macOS, Hidden from Screen Sharing, Packed with Features, and Just 2MB
Language: Swift - Size: 12 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 12 - Forks: 3

june1963/Alfred-GitHub-Models
An Alfred workflow for AI text processing with GitHub Models
Language: Shell - Size: 419 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
Language: Python - Size: 957 KB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 596 - Forks: 83

hyung-hwan/hawk
An AWK interpreter
Language: C - Size: 4.21 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7 - Forks: 1

whitfin/bytelines
Read input lines as byte slices for high efficiency
Language: Rust - Size: 39.1 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 66 - Forks: 9

thombashi/humanreadable
humanreadable is a Python library to convert human-readable values to other units.
Language: Python - Size: 137 KB - Last synced at: about 4 hours ago - Pushed at: about 2 months ago - Stars: 18 - Forks: 1

Thihasoehlaing/spelling-correction-system
A smart NLP-based spelling correction system for English language with a PySide6 GUI. Detects and highlights non-word and real-word errors using minimum edit distance, bigram/trigram models, and POS tagging.
Language: Python - Size: 144 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

bocaletto-luca/TextEditorQt
This program is a simple text editor with an intuitive user interface, created using the PyQt5 framework for developing desktop applications in Python. The text editor provides many basic features expected from an editor, along with advanced functionalities such as text formatting.
Language: Python - Size: 32.2 KB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 5 - Forks: 1

phil65/docler
Abstractions & Tools for OCR / document processing
Language: Python - Size: 2.26 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

btwitskaif69/Pro-Text-Editor
The Pro Text Editor project is a web application built using React, offering features for text manipulation, including text-to-speech functionality.
Language: JavaScript - Size: 198 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ds-modules/CUNEIF-102A
UC Berkeley CUNEIF 102A (Sumerian Text Analysis) Fall 2017
Language: Jupyter Notebook - Size: 40.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 0

cobanov/shakespeare-dataset
complete works, plays, sonnets and poems of shakespeare
Size: 2.36 MB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 3

daac-tools/daachorse
🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.
Language: Rust - Size: 3.71 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 218 - Forks: 15

dhopp1/nlp_pipeline
Collection of NLP tools for processing and analyzing text data.
Language: Python - Size: 148 MB - Last synced at: about 4 hours ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 2

krzyzanowskim/CoreTextSwift
CoreText Swift bindings
Language: Swift - Size: 27.3 KB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 165 - Forks: 8

RAGnTeX/RAGnTeX
Creates latex presentations on the given topic based on the provided documents
Language: Python - Size: 17.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

sstadick/hck
A sharp cut(1) clone.
Language: Rust - Size: 494 KB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 712 - Forks: 18

open-korean-text/open-korean-text
Open Korean Text Processor - An Open-source Korean Text Processor
Language: Scala - Size: 32.7 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 630 - Forks: 98

linuxscout/pyarabic
pyarabic
Language: Python - Size: 1.23 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 457 - Forks: 88

george-gca/ai_papers_cleaner
Extract text from papers PDFs and abstracts, and remove uninformative words.
Language: Python - Size: 390 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 5 - Forks: 0

fossology/atarashi
Atarashi scans for license statements in open source software, focusing on text statistics. Designed to work stand-alone and with FOSSology.
Language: Python - Size: 50.3 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 29 - Forks: 29

dewanakl/aman
🤬 Filter kata kotor sederhana dengan regex. Cek, sensor, dan hapus kata kasar dengan pola karakter mirip.
Language: PHP - Size: 85 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 2

justinbt1/Akin
Python library for detecting near duplicate texts in a corpus at scale.
Language: Python - Size: 2.77 MB - Last synced at: 2 days ago - Pushed at: 12 days ago - Stars: 8 - Forks: 0

helix-editor/nucleo
A fast and convenient fuzzy matcher library for rust
Language: Rust - Size: 232 KB - Last synced at: 12 days ago - Pushed at: 29 days ago - Stars: 1,109 - Forks: 39

chmln/sd
Intuitive find & replace CLI (sed alternative)
Language: Rust - Size: 414 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 6,323 - Forks: 144

omerblau/language-flipper
Instantly fix text typed in the wrong keyboard layout with one hot-key (Win).
Language: C++ - Size: 1.52 MB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

SAGE-Rebirth/gemini-chatbot-mongodb
This project is a FastAPI and React-based chatbot system for querying PDF content using Google Gemini 2.0 Flash embeddings and MongoDB vector search. It features PDF upload, semantic search, chat interface, and an admin panel for document management with Netligent branding. The system is production grade ready with robust error handling.
Language: TypeScript - Size: 317 KB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 1

mblucasm/lcmp
List Comparison - A fast and lightweight tool for comparing two lists, finding common elements, and identifying differences. Supports raw text files, Instagram data, and extracted HTML <div> elements.
Language: C++ - Size: 52.7 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

whitfin/s3-utils
Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI
Language: Rust - Size: 43.9 KB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 55 - Forks: 10

andyi95/reading-vue
Set of tools for text processing
Language: Vue - Size: 2.36 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

knaw-huc/textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without loading everything into memory.
Language: Rust - Size: 78.1 KB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

TheophilusE/pystringbuilder
A lightweight and efficient Python string builder class for dynamic text construction, minimizing unnecessary string concatenations for better performance.
Language: Python - Size: 7.81 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

tsmdt/dygest
CLI tool to extract content insights from raw txt using LLMs and NER
Language: Python - Size: 7.96 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 0

IG-onGit/TexeT
TexeT is the tool you need to take your interaction and content control to the next level.
Language: Python - Size: 117 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Lips7/Matcher
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust.
Language: Rust - Size: 36.9 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 17 - Forks: 1

cbaziotis/ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Language: Python - Size: 778 KB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 670 - Forks: 93

Goldziher/html-to-markdown
HTML to markdown converter
Language: Python - Size: 443 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 47 - Forks: 5

ovuiproduction/Research-Assistant
AI-Powered Research Assistant – A smart tool that helps researchers find relevant papers, recommend journals, ask questions about content, humanize AI text, and detect AI-generated writing. Powered by Large Language Models for enhanced research productivity.
Language: JavaScript - Size: 1.15 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

desholmes/text-quest
Text Quest is a game engine for running text-based adventure games, using a low/no code approach to game design.
Language: JavaScript - Size: 414 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 14 - Forks: 3

hscspring/pnlp
NLP预/后处理工具。
Language: Python - Size: 106 KB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 30 - Forks: 6

dnoice/textMan
Your ultimate text manipulation tool
Language: JavaScript - Size: 4.16 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

Lucifer88484/Natural-Language-Processing-API
Natural-Language-Processing-API is a RESTful API built with FastAPI that offers core NLP tasks like sentiment analysis, entity recognition, summarization, and language detection. It uses Hugging Face and spaCy models, supports Docker, and provides easy integration for NLP features.
Language: Python - Size: 9.77 KB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

benletchford/buup.io
A versatile text transformation toolkit in pure Rust with a dependency-free core. Encoding, decoding, formatting, cryptography, and (de)compression and more through CLI, web UI, or as a library.
Language: Rust - Size: 15 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 6 - Forks: 0

digineo/texd
texd wraps TeX in a web API
Language: Go - Size: 815 KB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 11 - Forks: 1

AmirAli104/Text2Excel
A GUI desktop application that can extract data from a text file and put them in an Excel or CSV file using regular expression (regex) patterns
Language: Python - Size: 208 KB - Last synced at: 3 days ago - Pushed at: 19 days ago - Stars: 4 - Forks: 0

milliorn/cli-password-generators
Simple command-line applications for generating passwords
Language: Go - Size: 6.85 MB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 2 - Forks: 0

elektito/finglish
A Finglish to Persian converter.
Language: Python - Size: 2.28 MB - Last synced at: 1 day ago - Pushed at: over 3 years ago - Stars: 84 - Forks: 21

ty70/news-summary-translator
Automatically fetches, translates, and summarizes world news from Yahoo! Japan using Google Cloud APIs. Output in JSON or terminal. 🇯🇵→🇺🇸
Language: Python - Size: 21.5 KB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

scripal-git/scripal
universal text processor
Language: C++ - Size: 3.02 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

finnjest/Realm
Advanced Text Processing Tool
Language: AutoHotkey - Size: 253 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

moltenib/md-to-html
Sed script that converts Markdown to HTML code.
Language: sed - Size: 106 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 2

codingkush/ChatSense
ChatSense — A chat analyzer app that quickly summarizes and analyzes WhatsApp chat exports with a clean, easy-to-use interface.
Language: Python - Size: 17.6 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

Pranav-Patel-123/GenAI
Language: TypeScript - Size: 102 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

GateNLP/python-gatenlp
Python text processing, pattern matching, and NLP framework
Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 66 - Forks: 8

farhad-here/Persian_Text_Processing
It is Persian Text processing with parsivar library
Language: Python - Size: 9.77 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Pranav-Patel-123/WaY-scrapping
web and youtube scrapping for the given input like a search engine that brings links and text from web and youtube.
Language: Python - Size: 6.84 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

Edopramudya/Sentiment-Text-Clustering
Proyek ini berfokus pada preprocessing dan clustering data teks dari dataset sentimen. Dataset yang digunakan berisi teks dan label sentimen (positif, negatif, netral), dan dilakukan pembersihan teks sebelum proses klastering.
Language: Jupyter Notebook - Size: 729 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

LucasGoncSilva/mosheh
Mosheh, a tool for creating docs for projects, from Python to Python.
Language: Python - Size: 1.46 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 8 - Forks: 1

airbnb/artificial-adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Language: Python - Size: 116 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 402 - Forks: 57

open-i18n/rust-unic
UNIC: Unicode and Internationalization Crates for Rust
Language: Rust - Size: 14.1 MB - Last synced at: 9 days ago - Pushed at: 16 days ago - Stars: 241 - Forks: 24

IoeCmcomc/chiecthuyenngoaixa
An utility library for processing Vietnamese texts
Language: Python - Size: 235 KB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

casics/nostril 📦
Nostril: Nonsense String Evaluator
Language: Python - Size: 143 MB - Last synced at: 5 days ago - Pushed at: about 3 years ago - Stars: 195 - Forks: 35

google/diff-match-patch 📦
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Language: Python - Size: 659 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 7,755 - Forks: 1,139

PellaML/Markdown-Renderer
Enhanced Markdown Renderer: A versatile and extensible JavaScript-based Markdown rendering and parsing library, leveraging Abstract Syntax Trees (AST) for efficient processing and customizable output. Open-source and community-driven, with a focus on future improvements and contributions.
Language: JavaScript - Size: 33.2 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

aditiiprasad/WhatsStat
A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.
Language: Python - Size: 1.27 MB - Last synced at: 6 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

ronnmabunga/scanpad
ScanPad is an OCR-powered notepad that extracts text from images and lets you edit, organize, and export documents. It features a rich text editor, multiple input methods, and a responsive user interface design.
Language: JavaScript - Size: 354 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

victoryosiobe/kingchop
Kingchop ⚔️ is a JavaScript English based library for tokenizing text (chopping text). It uses vast rules for tokenizing, and you can adjust them easily.
Language: JavaScript - Size: 85.9 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

yui-mhcp/data_processing
Data processing utilities in keras3
Language: Jupyter Notebook - Size: 86.2 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 5 - Forks: 1

ucd-dnp/ConTexto
Librería en Python para minería de texto y NLP
Language: Jupyter Notebook - Size: 34.1 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 49 - Forks: 14

Automattic/go-search-replace
🚀 Search & replace URLs in WordPress SQL files.
Language: Go - Size: 101 KB - Last synced at: 8 days ago - Pushed at: 15 days ago - Stars: 97 - Forks: 19

Bikatr7/Kudasai
Streamlining Japanese-English Translation with Advanced Preprocessing and Integrated Translation Technologies
Language: Python - Size: 90.4 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 25 - Forks: 4

KashifMoin1410/Text-Sentiment-Analysis
This project analyzes tweet sentiments using both traditional machine learning (Logistic Regression, Ridge, XGBoost) and deep learning (LSTM) models. The workflow covers text preprocessing, feature engineering, model training, and evaluation. Logistic Regression achieved an R² score of 0.80, while the LSTM model reached ~76% validation accuracy.
Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0
