Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ngram-analysis

euskadi31/go-ngram

an n-gram is a contiguous sequence of n items from a given sequence of text or speech.

Language: Go - Size: 48.8 KB - Last synced: 21 days ago - Pushed: 21 days ago - Stars: 2 - Forks: 0

behitek/word-counter

Dynamic n-gram counter on large text corpus (including next and previous)

Language: Java - Size: 15 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

wmentor/qgram

N-gram Go library

Language: Go - Size: 12.7 KB - Last synced: 17 days ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

T22sri/Personality_Recognition_NLP

Personality Recognition from text using nlp techniques

Language: Jupyter Notebook - Size: 6.14 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

michbur/biogram

N-Gram Analysis of Biological Sequences

Language: R - Size: 4.52 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 9 - Forks: 0

jamielaird/ngram-counter

A workflow using Alteryx, Python and Tableau to extract and analyse n-grams from a large set of raw email text.

Language: Python - Size: 53.2 MB - Last synced: 6 months ago - Pushed: almost 7 years ago - Stars: 2 - Forks: 2

nickduran/align-linguistic-alignment

Python library for extracting quantitative, reproducible metrics of multi-level alignment between two speakers in naturalistic language corpora.

Language: Python - Size: 54.8 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 38 - Forks: 11

jzonthemtn/ngramdb 📦

Distributed storage and querying of N-grams.

Language: Java - Size: 27.3 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 1 - Forks: 0

KhaledAshrafH/Auto-Filling-Text

This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.

Language: Python - Size: 27.1 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

dibyasonu/Malware-Analysis

Malware Family Classification.

Language: Assembly - Size: 3.83 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 3 - Forks: 2

krmbzds/turkish-presidents-in-books

🇹🇷 Occurrences of Turkish presidents in books (1920-2008)

Language: HTML - Size: 5.86 KB - Last synced: 10 months ago - Pushed: almost 6 years ago - Stars: 1 - Forks: 0

Koziev/WordRepresentations

Сравнение нескольких способов представления слов для построения языковых моделей

Language: Python - Size: 128 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 6 - Forks: 4

Shounak007/N-Gram-Distribution-and-TD-IDF-Analysis

We will do a basic textual analysis to study the n-gram distribution of different languages, and examine a "mystery" text to determine what language it is in. We will then perform a TF-IDF analysis on that dataset.

Language: Python - Size: 0 Bytes - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

ngrams-dev/general

NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.

Size: 31.3 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 5 - Forks: 0

Babar-Bashir/YouTubeAdultFilter

Restrict your child to watching Adult Content on YouTube using Android Accessibility.

Language: Java - Size: 167 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 12 - Forks: 7

elaad24/search-engine

full stack project - search engine mini project - Deloitte home task

Language: TypeScript - Size: 44.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

ZirvedaAytimur/Natural-Language-Processing-NLP-

The examples I prepared and brought together about the natural language processing topics I learned.

Language: Jupyter Notebook - Size: 42.7 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 6 - Forks: 2

AmbarZaidi/Name-based-Gender-Prediction

GuessMyGender - A Name based Gender Predictor for Indian Names

Size: 354 KB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 5 - Forks: 0

toolforgeio/ngram-gap-tool

Compares keyword frequency analyses between two bodies of text

Language: Java - Size: 48.8 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

data-integrations/ngram-analytics 📦

NGram Analytics Transform Plugin: Transforms input features into n-grams

Language: Java - Size: 50.8 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 2

toolforgeio/ngrams-tool

Performs an ngram frequency analysis on a text corpus stored in a spreadsheet

Language: Java - Size: 104 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

FilipHalon/text_historical_authenticity_evaluation

A study on the historical authenticity of a text. The historical authenticity is evaluated by comparing the frequencies of unigrams, bigrams and trigrams of a given text to the frequencies of the ngrams of texts written in the period of +/- 5 years from the claimed date of the release of the given text and to the frequency of the ngrams of recent texts. A tool to visualise the findings made with pandas and matplotlib-pyplot is included.

Language: Jupyter Notebook - Size: 1.01 MB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1

shivangraikar/NLP-Patient-summary

Natural language processing project to calculate patient readmission probability and summary of notes.

Language: Jupyter Notebook - Size: 184 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 4 - Forks: 1

pikulet/language-model

language ngram model, information retrieval assignment

Language: Python - Size: 52.7 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

linguistic-dev/n-gram-extractor

A PHP Library to extract n-grams from a text. Simple preprocessing tools (cleaning, tokenizing) included.

Language: PHP - Size: 28.3 KB - Last synced: 13 days ago - Pushed: over 6 years ago - Stars: 3 - Forks: 0

jonathanrjpereira/Ngram-Analytica

📈 Gathers & Plots the Google Ngram Graph for any Ngram in Python

Language: Python - Size: 438 KB - Last synced: 4 months ago - Pushed: over 5 years ago - Stars: 3 - Forks: 1

GuruMulay/big-data-class

Some of the projects from my Big Data class

Language: Java - Size: 5.55 MB - Last synced: over 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

maedi/NGrammer

Creates ngrams from wordlists.

Language: Ruby - Size: 4.83 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

Related Keywords
ngram-analysis 28 ngrams 10 ngram 8 nlp 7 natural-language-processing 5 ngram-language-model 4 python 3 ngram-model 3 language-model 2 nltk 2 tokenizer 2 text-analysis 2 toolforge 2 golang-library 2 golang 2 go 2 cask-marketplace 1 naive-bayes-classifier 1 heuristics 1 gender-prediction 1 cdap 1 turkish-nlp 1 cdap-plugin 1 data-science 1 spacy 1 part-of-speech-tagging 1 lesk-algorithm 1 lesk 1 lemmatization 1 latent-semantic-analysis 1 dogal-dil-isleme 1 react 1 javascript 1 elasticsearch 1 dotnet-core 1 csharp 1 youtube 1 porn-filter 1 parental-control 1 filter 1 android-accessibility 1 android 1 map-reduce 1 hadoop-mapreduce 1 gutenberg 1 big-data 1 webscraping 1 matplotlib 1 graph 1 google 1 beautifulsoup 1 tokenized-sentences 1 tokenize 1 tokenization 1 php7 1 php-library 1 php 1 information-retrieval 1 semantic-analyzer 1 readmission-probability 1 postagging 1 patient-summary 1 logistic-regression 1 pandas 1 matplotlib-pyplot 1 historical-texts 1 frequency-analysis 1 data-visualisation 1 authenticity 1 mllib 1 ml 1 adult-keywords 1 n-gram 1 bigrams 1 bigram-model 1 auto-filling 1 auto-complete-text 1 auto-complete 1 bag-of-words 1 word2vec 1 notebooks 1 linguistic-analysis 1 linguistic-alignment 1 corpus-tools 1 conversation-analysis 1 tableau 1 alteryx 1 r 1 biological-sequences 1 train-test-using-sklearn 1 tfidf-vectorizer 1 parameter-tuning 1 lexical-semantics 1 classification-algorithims 1 quadrigram 1 ngram-extraction 1 go-library 1 word-counter 1 word-count 1 machine-learning 1