An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-analysis

quanteda/stopwords

Multilingual Stopword Lists in R

Language: R - Size: 995 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 115 - Forks: 9

biolab/orange3-text

🍊 :page_facing_up: Text Mining add-on for Orange3

Language: Python - Size: 46.5 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 132 - Forks: 86

Princeton-CDH/corppa

Research software associated with Princeton Prosody Archive (PPA) full-text corpus; ocr, text reuse, poetry detection

Language: Jupyter Notebook - Size: 32 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 1

DishnaKawindi/Stock_Price_Prediction_and_Text_Analysis

Stock price prediction and news sentiment analysis project for CSIS 4260 – Special Topics in Data Analytics (Winter 2025). Built by a team of 2. Includes automated data pipeline, ML models, web scraping, and an interactive Dash dashboard.

Language: Python - Size: 7.1 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sergeyklay/clusterium

Text Clustering Toolkit for Bayesian Nonparametric Analysis

Language: Python - Size: 1.44 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

satyampandey1411/SAT-News-Analyser

SAT News Analyser is a web application offering in-depth news article analysis with features like sentiment analysis, genre detection, POS tagging, and reading time estimation. Supports URL-based content extraction, Google OAuth authentication, article history, and text-to-speech functionality. Built with Flask, PostgreSQL, and NLTK.

Language: HTML - Size: 445 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

convosense/email_signature_remover

Email Signature remover - Extracting email body out of the email text in order to get accurate sentiment results, using NLP tasks.

Language: Python - Size: 60.5 KB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 19 - Forks: 2

noduslabs/infranodus-obsidian-plugin

Advanced graph view for Obsidian: text analysis, topic modeling, and AI with InfraNodus AI text analysis tool: https://infranodus.com

Language: JavaScript - Size: 15.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 67 - Forks: 8

ChristopherAndrewTopalian/CATopalian_JavaScript_Text_Investigator

A JavaScript application that enables very deep text analysis. It also gets all Earthquake data from across the world with the push of a button.

Language: JavaScript - Size: 199 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

carpentries-incubator/python-text-analysis

Text Analysis with Python

Language: Python - Size: 57.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 16

hiDaDeng/cntext

text analysis, supporting multiple methods including word count, readability, document similarity, sentiment analysis, Word2Vec/GloVe, and Large Language Models (LLMs).文本分析包,支持字数统计、可读性、文档相似度、情感分析在内的多种文本分析方法。

Language: Python - Size: 64 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 338 - Forks: 30

hiDaDeng/Chinese-Pretrained-Word-Embeddings

中文文本分析工具、语料、预训练模型相关资源汇总。

Size: 1.13 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 137 - Forks: 29

andrewtavis/kwx

BERT, LDA, and TFIDF based keyword extraction in Python

Language: Python - Size: 12.3 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 73 - Forks: 10

twardoch/split-markdown4gpt

A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.

Language: Python - Size: 78.1 KB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 2

gengoai/gengoai

Mono Repository for GengoAI projects

Language: Java - Size: 14.4 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

pacian/Digital-Humanities-Toolkit

I created this repository to provide the DH Community a compilation of free, open-source tools for creating and developing digital humanities projects, along with relevant tutorials and examples of projects completed with those tools. Please contact me at [email protected], Richard Dennis, if you have any questions, comments or suggestions.

Size: 715 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 36 - Forks: 3

stepthom/text_mining_resources

Resources for learning about Text Mining and Natural Language Processing

Size: 707 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 577 - Forks: 199

wyounas/homer

Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.

Language: Python - Size: 4.66 MB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 632 - Forks: 35

Kaiten-dev/quita_mini

Quita Mini is a text analysis tool designed to calculate various linguistic metrics from text data. It processes a collection of text files, computes statistics such as Type-Token Ratio (TTR), entropy, average token and type lengths, hapax legomena percentages, and more. The results are then saved in an Excel file for further analysis.

Language: Go - Size: 3.53 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ropensci/jstor

Import journal data from DfR (JSTOR)

Language: R - Size: 6.14 MB - Last synced at: about 23 hours ago - Pushed at: about 2 months ago - Stars: 47 - Forks: 10

dinamohsin/Social-Media-Sentiment-Analysis-

Analyzing public sentiment from social media data using Natural Language Processing (NLP) techniques. It involves cleaning and preprocessing raw text, performing sentiment classification using TextBlob, balancing the dataset, training a machine learning model, and visualizing sentiment trends.

Language: Jupyter Notebook - Size: 526 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

CodeWithJazmine/bookbot

Python command-line tool that analyzes text files for word count and character statistics

Language: Python - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

K2/Stylometrics

Stylometric Stenography LLM Generation Attribution DRM/DLP

Language: TypeScript - Size: 65.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

kivanc57/quita_mini

Quita Mini is a text analysis tool designed to calculate various linguistic metrics from text data. It processes a collection of text files, computes statistics such as Type-Token Ratio (TTR), entropy, average token and type lengths, hapax legomena percentages, and more. The results are then saved in an Excel file for further analysis.

Language: Go - Size: 3.53 MB - Last synced at: 23 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mnsoylemez/Sulay

PyTorch Transformer language model with rotary positional encoding and SentencePiece BPE for text generation, plus text analysis tools.

Language: Python - Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

DavidOsipov/Keywords4CV

A Python tool to extract key skills and terms from job descriptions, optimizing resumes and LinkedIn profiles for ATS and recruiters.

Language: Python - Size: 225 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

jrrobison1/wordtangible

Python library for analyzing the concreteness and imageability of words and text. It provides tools to measure how abstract or concrete the language in a given text is, which can be useful for various natural language processing tasks, readability analysis, and linguistic research.

Language: Python - Size: 229 KB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

ichalkiad/datadescriptor_uselections2020

Code for collecting and cleaning speeches (text) of the US 2020 election campaign. Corresponding publication: "A text dataset of campaign speeches of the main tickets in the 2020 US presidential election", by Ioannis Chalkiadakis, Louise Anglès d’Auriac, Gareth W. Peters, and Divina Frau-Meigs

Language: Python - Size: 38.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ryanjgallagher/shifterator

Interpretable data visualizations for understanding how texts differ at the word level

Language: Python - Size: 40.1 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 275 - Forks: 29

Kledenai/stringwise

Efficient and elegant string similarity comparison using bigram analysis. Ideal for fuzzy matching, search optimization, and natural language tools.

Language: TypeScript - Size: 83 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

AbrSantiago/corpus-tfidf-analyzer

A Python tool for text analysis using TF-IDF, lemmatization, stopword filtering, and frequency visualization.

Language: Python - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

abilzerian/LLM-Prompt-Library

A playground of highly experimental prompts, tools & scripts for machine intelligence models from DeepSeek, OpenAI, Anthropic, Meta, Mistral, Google, xAI & others.

Language: Python - Size: 143 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 1,116 - Forks: 116

Nuraj250/LexiIQ

🚀 LexiIQ is an AI-powered NLP tool that processes text using spaCy and TextBlob to extract meaningful insights. It provides tokenization, named entity recognition, dependency parsing, sentiment analysis, and an interactive interface for easy use.

Language: TypeScript - Size: 707 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Inkdecker/Inktyping

Free tool for text exploration, analyze your favorites books and practice writing.

Language: Python - Size: 68.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

oneai-nlp/oneai-node

Natural language processing - summarization, sentiment analysis, topic detection and more.

Language: TypeScript - Size: 5.38 MB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 67 - Forks: 6

prakharrathi25/Text-Analytics-Tool

This is an application that automates the process of text analysis with a user-friendly GUI. 📱 It has been implemented using Python and deployed with the Streamlit package.

Language: Jupyter Notebook - Size: 56.5 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 36 - Forks: 9

hosu-kim/word_counter_by_hosu

A simple React app to count words, sentences, paragraphs, and estimate reading time in real time.

Language: TypeScript - Size: 18.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

textpipe/textpipe 📦

Textpipe: clean and extract metadata from text

Language: Python - Size: 340 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 301 - Forks: 25

trinker/qdap

Quantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis

Language: R - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 177 - Forks: 44

hiDaDeng/hidadeng

github介绍页

Size: 34.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

troy-yu-cheng/final-project-proposal

final project for intro2ds gu 25spr

Language: HTML - Size: 5.78 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

danielvartan/iramuteqlike

💬⛏️ IRaMuTeQ Software Analyses in R

Language: R - Size: 3.37 MB - Last synced at: 2 months ago - Pushed at: 12 months ago - Stars: 7 - Forks: 2

DCS-training/BeyondSocialNetworks

This is a repository for the Beyond Social Networks: Advanced Uses of Gephi in Humanities Research course provided by Brian Wong for the CDCS. Within the repository there are files with sample datasets and a guide to building datasets. It will be updated before each section. Go to the Readme file

Size: 10.5 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

textvec/textvec

Text vectorization tool to outperform TFIDF for classification tasks

Language: Python - Size: 799 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 194 - Forks: 26

cosmoduende/r-holy-books-sentiment-data-analysis

What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R

Language: R - Size: 1.42 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 4

Samya-S/TextUtils

TextUtils, a React.js application, is a text-based utility that can be used to manipulate your text in the way you want.

Language: JavaScript - Size: 675 KB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

gjtorikian/what_you_say

Natural language detection library. Written in Rust, wrapped in Ruby.

Language: Ruby - Size: 156 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 16 - Forks: 0

apache/uima-uimacpp

C++ support for Apache UIMA

Language: C++ - Size: 2.21 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 16 - Forks: 18

DCS-training/IntroNetworkAnalysis

This is a repository for the Introduction to Network Analysis course provided by Brian Wong for the CDCS. Within the repository there are files with sample datasets and a guide to building datasets. It will be updated before each section. Go to the Readme file

Size: 1.23 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

zahramh99/Video-Chaptering-Youtube

Video chaptering is the process of dividing a video into distinct segments, each labelled with a specific title or chapter name, to enhance navigation and user experience.

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

samdouble/textlinter

A GitHub Action that performs simple checks on text

Language: TypeScript - Size: 23.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Tsunami014/Phoenix-Engine

This is our TAS assessment task from a while ago

Language: Python - Size: 54.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

rosette-api/csharp

Babel Street Analytics Client Library for C#

Language: C# - Size: 12.6 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 16

codewithdark-git/DarkGPT

DarkGPT Chat Explorer is an interactive web application that allows users to engage in conversations with various GPT (Generative Pre-trained Transformer) models in real-time. This repository contains the source code for the application.

Language: Python - Size: 227 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 40 - Forks: 12

lmullen/legal-modernism

Law and legal practice modernized in the nineteenth-century United States. We are studying and visualizing the history of the modernization of American law.

Language: R - Size: 43.2 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

evelyncy96/Movie-Recommendation-System

We create movie recommendation system through demographic filtering, content-based filtering, collaborative filtering, and hybrid engine.

Language: Python - Size: 104 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

DCS-training/Digital-Method-of-the-Month

In this repository you are going to find the documents we produced to support the discussion in our Digital Methods of the Month. These documents will help you orienting yourself if you want to pickup the method in your research. Go to the readme file

Size: 446 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 4

Mathux/TEMOS

Official PyTorch implementation of the paper "TEMOS: Generating diverse human motions from textual descriptions", ECCV 2022 (Oral)

Language: Python - Size: 3.92 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 396 - Forks: 25

MycroftAI/padatious

A neural network intent parser

Language: Python - Size: 97.7 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 162 - Forks: 39

struktapp/strukt-owl

AI & things

Language: PHP - Size: 68.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

notesjor/corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Language: C# - Size: 32.5 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 23 - Forks: 3

nlpie/biomedicus

BioMedICUS: A biomedical and clinical NLP engine.

Language: Java - Size: 6.75 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 7

kreeedit/TRACE

TRACE - Text Reuse Analysis and Comparison Engine

Language: Python - Size: 1.39 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 0

get-woke/woke

Detect non-inclusive language in your source code.

Language: Go - Size: 22 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 484 - Forks: 61

SGSSSonline/text-analysis

Learning and teaching materials for text analysis course run under the Practical Computational Methods for Social Scientists training series.

Language: Jupyter Notebook - Size: 305 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 2

graphbrain/graphbrain

Language, Knowledge, Cognition

Language: Python - Size: 103 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 598 - Forks: 69

tax-8974/location-analyzer

The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.

Language: Java - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Sameer051022/NLP_Text_Analysis

"A detailed notebook demonstrating advanced NLP techniques for text analysis, including data preprocessing, feature extraction, and model implementation using Python libraries such as NLTK and sklearn."

Language: Jupyter Notebook - Size: 26.4 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jwalsh/syntax-tree-streamer 📦

Syntax tree representation and processing system for streaming S-expression-based data from LLMs

Language: Python - Size: 51.8 KB - Last synced at: about 6 hours ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

irfanalidv/Toxic_Comment_Classification_using_Keras

Identify and classify toxic online comments

Language: Jupyter Notebook - Size: 52.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

sfu-discourse-lab/GenderGapTracker

Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media

Language: Python - Size: 9.34 MB - Last synced at: about 3 hours ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 11

mkearney/googleapis

R client for accessing Google Cloud Natural Language APIs

Language: R - Size: 767 KB - Last synced at: 3 months ago - Pushed at: almost 8 years ago - Stars: 9 - Forks: 2

fendouai/Awesome-Text-Classification

Awesome-Text-Classification Projects,Papers,Tutorial .

Size: 7.81 KB - Last synced at: 17 days ago - Pushed at: over 7 years ago - Stars: 171 - Forks: 32

jonclayden/ore

An R interface to the Onigmo regular expression library

Language: C - Size: 1.1 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 57 - Forks: 2

nevmenandr/UniversitatesPodcastData

An R package for downloading, extracting, and analyzing interview transcripts from the Universitates podcast series. It provides tools for data processing, searching, and visualization

Language: R - Size: 55.7 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

finesse123/AutoText-GPT

A Next-Word Prediction project uses Transformers and GPT-2 for text generation. GPTTokenizer preprocesses input, and the model is fine-tuned. Evaluation measures accuracy, perplexity, and fluency.

Language: Jupyter Notebook - Size: 96.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

IBM-Cloud/code-engine-text-analysis

Text Analysis with Code Engine, Cloud Object Storage and Natural Language Understanding

Language: JavaScript - Size: 776 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 7 - Forks: 10

supriya811106/WhatsApp-Chat-Analyzer-App

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

Language: Python - Size: 2.44 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Software-Research-Lab/dropsuit

DropSuit - NLP & data manipulation library for JS & Node.js. Offers diverse functions for text analysis, language understanding & more. Open-source under Apache License 2.0.

Language: JavaScript - Size: 41 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

wzhou7/WTT

The Word-Text-Topic (WTT) extraction approach, implemented in Python and R.

Language: Python - Size: 37.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ontoligent-design/polo2

A revised version of Polo

Language: Jupyter Notebook - Size: 442 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

dmytrovoytko/SublimeText-Translate

🌐 Translation plugin (multi-engine, fast, flexible) for SublimeText 3 & 4, works without API keys, works in China

Language: Python - Size: 82 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 2

shubhamgoyal575/Fashionkart-NLP-Analysis

This project leverages machine learning to predict whether a customer will recommend a product based on their review. It also applies topic modeling to uncover key themes in customer feedback. Using NLP techniques, the model processes text data, builds a classifier, and visualizes insights. Built with Python, Scikit-Learn, NLTK, and Gensim.

Language: Jupyter Notebook - Size: 3.21 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fialovy/different_from_me_should

Finish this sentence, reddit...

Language: Python - Size: 191 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

arnozeng98/nlp-sentiment-toxicity

A multilingual text analysis system that performs sentiment analysis and toxicity detection with detoxified text generation.

Language: Python - Size: 55.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Matissoss/mcw 📦

mcw/tan - text analysis

Language: Rust - Size: 1.13 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rolme/ytscript

A versatile Node.js toolkit for extracting, processing, and analyzing YouTube video content through transcripts. It combines powerful transcript extraction with multi-provider AI analysis (ChatGPT, Claude, and Google AI) to generate intelligent summaries and insights from video content.

Language: TypeScript - Size: 814 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

qtalr/Lessons

Interactive R lessons

Language: HTML - Size: 12.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

forTEXT/gitma

Python package to access and process CATMA projects via the CATMA GitLab backend

Language: Python - Size: 18.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 4

apache/uima-ruta

Apache UIMA Ruta

Language: Java - Size: 19.7 MB - Last synced at: 8 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 5

eellak/nlpbuddy

A text analysis application for performing common NLP tasks through a web dashboard interface and an API

Language: HTML - Size: 929 KB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 125 - Forks: 28

bit2r/bitNLP

Tools that support "Natural Language Processing" for Korean text analytics.

Language: R - Size: 45.8 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 10 - Forks: 3

rosette-api/python

Babel Street Analytics Client Library for Python

Language: Python - Size: 1.63 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 38 - Forks: 37

VishwaGauravIn/string-tools-pro

🤏 Tiny & versatile 🔥 Node.js library for in-depth text analysis, manipulation and data extraction.

Language: JavaScript - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 7 - Forks: 0

Halvani/Constituent-Treelib

A lightweight Python library for constructing, processing, and visualizing constituent trees.

Language: Jupyter Notebook - Size: 2.67 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 67 - Forks: 12

WasamiKirua/animeclick-db

An animeclick dataset creation tool for RAG, Text and Sentimental analysis

Language: Python - Size: 402 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

anna-zam/Mass_text_API_text_ru

Автоматизированная проверка текстов через API text.ru 📄🔍 В помощь копирайтерам и контент-мейкерам. Скрипт позволяет массово проверять тексты на уникальность, заспамленность и "воду" с помощью API text.ru. Результаты сохраняются в удобный Excel-файл.

Language: Python - Size: 12.7 KB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

arturom/search-analysis

A graphical user interface for the Elasticsearch Analyze API

Language: JavaScript - Size: 4.67 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

RickyJ99/RA-project

Explore my role as an RA in analyzing White House speeches' economic impact. Discover code, resources, data management, NLP analysis, and AI modeling insights. Uncover language's power in shaping economic narratives.

Language: Jupyter Notebook - Size: 108 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shib1111111/WebText-Analyzer

It is a text analysis tool that performs linguistic analysis on a collection of web pages. It includes sentiment analysis, readability metrics, and other derived variables.

Language: Python - Size: 592 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Related Keywords
text-analysis 1,355 python 314 nlp 313 machine-learning 200 text-mining 189 natural-language-processing 174 text-processing 155 sentiment-analysis 150 text-classification 140 r 102 data-science 76 topic-modeling 63 text 61 nltk 57 data-analysis 57 nlp-machine-learning 56 python3 54 data-visualization 53 deep-learning 42 classification 40 ai 35 digital-humanities 30 visualization 29 text-summarization 29 tf-idf 28 java 28 javascript 28 clustering 26 wordcloud 26 text-analytics 25 api 25 react 24 artificial-intelligence 24 sentiment-classification 24 word2vec 24 spacy 24 web-scraping 23 flask 22 pandas 22 twitter 21 network-analysis 21 webscraping 20 jupyter-notebook 20 text-generation 20 logistic-regression 19 azure 18 tensorflow 18 data-mining 17 lda 17 nodejs 16 news 16 regex 16 summarization 15 html 15 language-detection 15 linguistics 15 analysis 14 tfidf 14 statistics 14 word-embeddings 14 nltk-python 14 open-source 14 social-media 14 matplotlib 14 data-visualisation 14 embeddings 14 named-entity-recognition 13 tokenization 13 twitter-api 13 dataset 13 rstats 13 automation 13 data-cleaning 12 lemmatization 12 scikit-learn 12 unsupervised-learning 12 golang 12 typescript 12 machine-learning-algorithms 12 bag-of-words 11 ml 11 analytics 11 gensim 11 social-network-analysis 11 cognitive-services 11 uima 11 natural-language-understanding 11 text-editor 10 exploratory-data-analysis 10 css 10 go 10 neural-network 10 semantic-analysis 10 pytorch 10 docker 10 information-retrieval 10 neural-networks 10 algorithms 10 openai 10 streamlit 10