An open API service providing repository metadata for many open source software ecosystems.

Topic: "textprocessing"

apache/opennlp

Apache OpenNLP

Language: Java - Size: 17.1 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 1,570 - Forks: 489

psemiletov/tea-qt

TEA text editor

Language: C++ - Size: 3.98 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 90 - Forks: 14

apache/opennlp-sandbox

Apache OpenNLP Sandbox

Language: Java - Size: 32.9 MB - Last synced at: about 20 hours ago - Pushed at: 2 days ago - Stars: 45 - Forks: 33

tupilabs/HumanNameParser.java

HumanNameParser.java, a Java port of HumanNameParser.php. Parser for human names in Java, all credit goes to @jasonpriem

Language: Java - Size: 756 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 27 - Forks: 24

atantos/TextAssociations.jl

Language: Julia - Size: 34.4 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 1

verachell/YeetWords

a domain-specific language for text substitution

Language: Ruby - Size: 136 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 0

apache/opennlp-models

Apache OpenNLP Models

Language: Shell - Size: 347 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 11 - Forks: 2

SilenZcience/cat_win

Advanced OS Independent Text-Processing and -Analytics Command Line Tool made in Python.

Language: Python - Size: 323 MB - Last synced at: 29 days ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 0

J-TECH-bot/Blackcoffer_Data_Extraction_NLP

This repository showcases data-driven text analytics using NLP techniques. It combines text preprocessing, sentiment scoring, and structured data extraction to convert unstructured text into business-ready datasets.

Language: Jupyter Notebook - Size: 491 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

Doktorierer/WordAsLatexEditor

MS Word as a LaTeX editor/previewer

Size: 3.45 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

jyu0414/linelog2py

LINE Chat History Parser for Python

Language: Python - Size: 39.1 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

EdenOved/SMSTranslation

Android SMS translator app with offline machine learning models for automatic and on-demand translation.

Language: Java - Size: 7.12 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

mantzaris/KeemenaPreprocessing.jl

Preprocessing for text data: cleaning, normalization, vectorization, tokenization and more

Language: Julia - Size: 630 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

jkecb/pdf-to-epub-ai

Python scripts pipeline for converting PDF files to clean, readable text format using openai api for post-OCR correction and text refinement.

Language: Python - Size: 76.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 1

Vikhram-S/IndianConstitution

A Python library for exploring the Constitution of India.

Language: Jupyter Notebook - Size: 201 KB - Last synced at: 29 days ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

HimalayaMinds/string-mastery-toolkit

A Comprehensive Toolbox for Mastery in String Operations Across Programming Paradigms ๐Ÿš€๐Ÿ”

Size: 85.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 2

absmj/texcel

Texel is a converting tool between text and colors. All symbols define hexadecimal values, convert to RGB values and set pixels.

Language: JavaScript - Size: 1.99 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

chigwell/promptify-summary

A new package would process text inputs, like video titles or descriptions, to generate structured summaries using an LLM. It would take a user-provided text string (e.g., a headline or query) and ret

Language: Python - Size: 4.88 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

pauldolphin3/Acs.Core

Easy-to-use library to work with files and processes

Language: C# - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

its-arnavtech/Parser_Build-Arnav

This Project is currently working on extracting key data from a resume in order to enhance a candidate's profile

Language: Python - Size: 196 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Afnanksalal/TextWave

Empower your projects with high-quality, natural-sounding voices, crafted for diverse applications from interactive games to enhanced accessibility.

Language: Python - Size: 167 KB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

Chihuah/Process-Long-Subtitles

ไธ€ๅ€‹ Python ๅทฅๅ…ท๏ผŒ็”จไพ†ๅฐ‡ AI ็”ข็”Ÿ็š„้Ž้•ท SRT ๅญ—ๅน•ๆ‹†ๅˆ†ๆˆๅคš่กŒ๏ผŒไธฆ่‡ชๅ‹•ๅ‡ๅˆ†ๆ™‚้–“่ปŒ๏ผŒๆๅ‡ๅญ—ๅน•ๅฏ่ฎ€ๆ€งใ€‚

Language: Jupyter Notebook - Size: 1.95 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

bhadrakali79/NLP-Practicals

This repository contains practical implementations of NLP concepts including dependency grammar, text processing, normalization, and TF-IDF models to demonstrate key techniques in natural language processing.

Language: Jupyter Notebook - Size: 500 KB - Last synced at: 7 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

gopireddy99/Daily_AD_NLP_assignments

AD Training classes in NLP - Daily Assignments

Language: Jupyter Notebook - Size: 180 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

arya-io/LingoLens

A Streamlit app for language translation and sentiment analysis powered by GoogleTranslator and TextBlob.

Language: Python - Size: 14.6 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

arya-io/NER-EntityLinker

A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.

Language: Python - Size: 34.5 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

arya-io/NLP-Explorer

NLP Explorer is an interactive Streamlit app that lets users explore various NLP techniques like Tokenization, POS Tagging, Stemming, Lemmatization, and NER. It provides real-time analysis of text, making it a great tool for learning and experimenting with NLP concepts.

Language: Python - Size: 113 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

arya-io/email-generator

A Cold Email Generator that uses Groq Cloud, ChromaDB, Langchain, and Streamlit to automatically create personalized cold emails for job opportunities by extracting details from job listings.

Language: Python - Size: 51.8 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Tech-Guyy/Text-processing-Information-retrieval-and-information-extraction-text-to-speech

The author implemented simple rule base solution and machine learning approach for information retrieval and information extraction after which the result were analyzed.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Tech-Guyy/Text-processing-sentiments-analysis

The author implemented support vector machine for sentiments analysis and applied two feature extractions, Bag-of-Words (CountVectorizer) and TF-IDF (TfidfVectorizer), after which the results for both methods were analysed. The accuracy obtained for both methods were (BoW = 87%) and (TF-IDF = 86%).

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sowmyavi/Text-Analytics

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Huzaifa-367/Text-Preprocessing-with-NLTK

Text Preprocessing with NLTK

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

muhammadkhairiisufyaan/Analisis-Sentimen-Berita-Pilpres-pada-Platform-Twitter-

This project represents my team's contribution to the semi-final of Gelar Rasa 2023, a competition organized by HIMASADA UPN "Veteran" East Java. With enthusiasm and dedication, our team managed to secure the 2nd place in the competition.

Language: Jupyter Notebook - Size: 5.18 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

georgiyozhegov/eliquarre

Text filtering tool

Language: Rust - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ansargit3/Sarcasm-Detection-Application

"Detect sarcasm effortlessly! This Python app uses NLP and ML to analyze text sentiment, distinguishing sarcastic tones. With a user-friendly interface, input any text for real-time sarcasm identification. Achieve accurate results through advanced sentiment analysis techniques and trained models."

Language: Python - Size: 1.79 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

rhandley1231/TextProcessing

The first project from my Advanced Data Structures Class. The goal is to take a normal english corpus - in the case of this project, Lewis Carroll's Alice's Adventures In Wonderland - and analyze the frequency at which certain words and symbols appear.

Language: Java - Size: 74.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

SaikatPhys/python-web-scraping

Web & social media scraping in Pythonian way

Language: Jupyter Notebook - Size: 206 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

ManuPer3z/Sentiment-Analysis-Project

A project that harnesses the Stanford NLP library to gauge sentiment from provided text via an intuitive graphical interface.

Language: Java - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

NagiPragalathan/TerminalDesigner

TerminalDesigner is a Python-based project aimed at enhancing text processing capabilities in the terminal. It provides a set of tools and functionalities to manipulate text appearance, create ASCII art, and modify terminal colors

Language: Python - Size: 661 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Abhikar12/AIL

This assignments focuses on implementing and applying various AI algorithms and techniques.

Language: Python - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ao9000/NTU-SC1015-Data-Science-Mini-Project

Mini project for NTU-SC1015 (Introduction to Data Science and Artificial Intelligence). Regarding fake news analysis & classification

Language: Jupyter Notebook - Size: 70.3 MB - Last synced at: almost 3 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

sanjay-06/Optical-Character-Recognition

The objective of the package is to use optical character recognition in order to read vehicle license plates and convert them to text.

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 7 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

udaycruise2903/khasi-dienshonhi

Documentation and scripts of khasi-khasi Dictionary Digitalisation project

Language: Python - Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Related Topics
nlp 15 python 9 machinelearning 6 ai 5 naturallanguageprocessing 4 machine-learning 4 sentiment-analysis 3 natural-language-processing 3 tokenization 3 sentimentanalysis 3 textanalysis 3 nltk 3 streamlit 3 apache 3 compling 3 languagetechnology 3 opennlp 3 text 3 logistic-regression 2 tf-idf 2 automation 2 webscraping 2 dataextraction 2 java 2 stemming 2 datascience 2 ner 1 image-processing 1 perceptron-learning-algorithm 1 textnormalizer 1 tfidf-text-analysis 1 gui 1 languagemodel 1 entitylinking 1 sentiment 1 stanfordanalysis 1 textanalytics 1 entityextraction 1 classicml 1 disambiguation 1 rust 1 nltk-python 1 crawling-python 1 stopwords 1 cleaning-text 1 postagging 1 treemap 1 lemmatization 1 pyqt5 1 sarcasmdetection 1 stringtokenizer 1 svmclassifier 1 wikipediaapi 1 editor 1 algorithms 1 regularexpression 1 spacy 1 algorithms-and-data-structures 1 artificial-intelligence 1 artificial-intelligence-algorithms 1 namedentityrecognition 1 expert-system 1 svm 1 module 1 pythonpackage 1 terminalart 1 terminalcreativity 1 terminaldesigner 1 terminalenvironment 1 terminalfun 1 terminalgraphics 1 terminalmagic 1 textart 1 textbaseddesign 1 textstyling 1 visualdesign 1 aiforbusiness 1 chromadb 1 coldemailgenerator 1 groqcloud 1 jobapplication 1 langchain 1 llama31 1 personalizedemails 1 portfoliointegration 1 techstack 1 twitter 1 decision-tree-classifier 1 random-forest-classifier 1 bag-of-words 1 sentiment-classification 1 support-vector-machines 1 tweepy 1 twint 1 twitter-api 1 datastructures-algorithms 1 languagetranslation 1 socialnetworkanalysis 1 information-extraction 1 information-retrieval 1