GitHub topics: trigrams
michurin/ngramindex
Golang ngram index implementation
Language: Go - Size: 13.7 KB - Last synced at: 10 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

StephanGeorg/trigram-similarity
Determining the similarity of alphanumeric text based on trigram matching.
Language: JavaScript - Size: 231 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 3

Casecommons/pg_search
pg_search builds ActiveRecord named scopes that take advantage of PostgreSQL’s full text search
Language: Ruby - Size: 1.19 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 1,452 - Forks: 372

KhaledAshrafH/Auto-Filling-Text
This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.
Language: Python - Size: 27.1 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 4

cyb3rk0tik/pyfranc
Text language detection basic on trigrams.
Language: Python - Size: 382 KB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

pngo1997/N-gram-Language-Models
Builds N-gram language modes and applies text generation.
Language: Jupyter Notebook - Size: 4.73 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mochi-co/ngrams
A Go n-gram indexer for natural language processing with modular tokenizers and data stores
Language: Go - Size: 396 KB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

remram44/ngram-search
Ngram-based indexing of strings into a binary file
Language: Rust - Size: 88.9 KB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

sumn2u/string-comparisons
A collection of string comparisons algorithms
Language: JavaScript - Size: 700 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 5

mideind/Icegrams
A fast, compact trigram library for Icelandic
Language: Python - Size: 207 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 11 - Forks: 3

pouriaSameti/NLP
The projects for the NLP course at the University of Isfahan.
Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

f1monkey/ngrams
Ngram generation library
Language: Go - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ocrim1996/BigramTrigramGenerator Fork of n3d1117/BigramTrigramGenerator
Parallel Computing - Measure the speedup gained when parallelizing n-grams generation using Java Threads and C++ Threads
Language: TeX - Size: 26.9 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

daverlon/ngram-wordgen
Word generator using n-gram probabilities
Language: Jupyter Notebook - Size: 29.4 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

jorzel/postgres-full-text-search
Postgres full text search options (tsearch, trigram, ilike) examples
Language: Python - Size: 5.04 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 111 - Forks: 6

luizanisio/Doc2VecFacil
Classe responsável por simplificar o processo de criação de um modelo Doc2Vec (gensim) com facilitadores para geração de um vocab personalizado e com a geração de arquivos de curadoria. Dicas usando elasticsearch e singlestore.
Language: Python - Size: 31.9 MB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

Mirith/Trigram-tagger
Python, tags trigrams! K-fold validation doesn't work yet.
Language: Python - Size: 1.43 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

akshataupadhye/News-articles-clustering-A-comparative-approach
A project featuring the use of various NLP techniques and ML algorithms like the topic modelling and paragraph embeddings, for document clustering. 📰📚
Language: Jupyter Notebook - Size: 186 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mikethesilb/TrigramsE-M
A map-reduce implementation in Apache Hadoop (AWS EMR) for calculating the probabilities of trigrams in the Hebrew language. This project utilizes the deleted estimation two-way cross validation method to calculate trigram probabilities. The Google Hebrew Trigram database serves as this project's corpus.
Language: Java - Size: 46.9 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

CelineBoudier/jabberwocky
Written Language Detection and Gibberish Synthesis for NAO.
Language: Python - Size: 12.8 MB - Last synced at: 5 months ago - Pushed at: almost 8 years ago - Stars: 7 - Forks: 4

askeladdk/langdet
Language detection for Go.
Language: Go - Size: 197 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gromag/Data-Science-Specialisation-Predict-Next-Word
Predicting next word with Natural Language Processing. Being able to predict what word comes next in a sentence is crucial when writing on portable devices that don't have a full size keyboard. However the same techniques used in texting application can be applied to a variety of other applications, for example: genomics by segmenting DNA, sequences speech recognition, automatic language translation or even as one student in the course suggested music sequence prediction.
Language: HTML - Size: 853 KB - Last synced at: 7 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

Garrett-Webb/trumptweets
analyze trump's nonsense, feed in a topic, and generate a new tweet based on a custom corpus.
Language: Python - Size: 6.67 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 1

mehdiye5/WordSegmentation
The task for this project is to segment a sequence of English characters into the most likely word sequence.
Language: Python - Size: 2.09 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

philbooth/surch 📦
MOVED TO GITLAB
Language: JavaScript - Size: 29.3 KB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

susantabiswas/Word-Prediction-Ngram
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 30 - Forks: 12

Eatmeta/TextAnalysis
The Practice "Sentence Generator"
Language: C# - Size: 169 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Universemul/postgresql-search-engine
This application demonstrates how to use PostgreSQL as a full-text search and spatial engine
Language: Python - Size: 1.33 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

EUGINELETHAL/Docker-Nginx-Heroku
This is a simple search app built with Django, PostgreSQL, Docker, Gunicorn.Demo
Language: Python - Size: 58.6 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

urschrei/hexagrams
Procedural generation of I Ching hexagrams and trigrams using PIL and NumPy
Language: Python - Size: 50.8 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 3

msiebuhr/charcod.es
Small webpage for finding the odd unicode char code.
Language: JavaScript - Size: 3.53 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 3

himanish532/POS-Tagger-Model
This project implements HMM model on trigrams resulting a better accuracy in POS tagging
Language: Python - Size: 3.75 MB - Last synced at: 10 months ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 1

sabareeswarans11/Russia_Ukraine_War_Twitter_Analysis
Semi-Structured Data Processing with NoSQL Database Server MongoDB Collecting Social Media Data from Twitter Real-time Data Stream and Storing and Retrieving to Process from a Semi-Structured Database Server MongoDB
Language: Python - Size: 5.01 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

samet-ozkan/German-English-Detector
This code detects whether the text input is in German or English.
Language: C - Size: 5.86 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

BartekCK/graphql-cricket-league
Apollo application with public playground
Language: TypeScript - Size: 470 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

arvindpj007/Enigma-M4-Cryptanalysis
Cryptanalysis of Enigma M4 in Go Lang with pure statistical means utilizing Index of Coincidence and Trigram analysis.
Language: Go - Size: 1.52 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 0

Adrianogba/bigrama-trigrama-python
Este é um programa de inteligência artificial simples para prever a próxima palavra baseada em uma string informado usando bigramas e trigramas baseados em um arquivo .txt. Existem dois códigos, um usando console e outro usando o tkinter.
Language: Python - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

AslanDevbrat/Computational-Linguistic
Assigmnents of CL
Language: Jupyter Notebook - Size: 25.9 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

luizmellodev/Ngrams
This project has the objective of creating ngrams based on text inputs, generating text output with the ngrams and your frequency.
Language: Python - Size: 36.1 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Dechrissen/LIN538-Final
Final project for LIN 538: Statistics for Linguists at Stony Brook University (Fall 2020)
Language: Jupyter Notebook - Size: 208 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

iglee/HMMs-and-PCFG
POS tagging by using ngram based hidden markov models.
Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

ZNClub-PA-ML-AI/NLP-techniques
Testing & learning different nlp and lex techniques
Language: Jupyter Notebook - Size: 2.37 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

Adrianogba/bigram-trigram-python
This is an simple artificial intelligence program to predict the next word based on a informed string using bigrams and trigrams based on a .txt file. There are two codes, one using console and the other using tkinter.
Language: Python - Size: 1.66 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

albertusk95/nips-challenge-plagiarism-detection-vsm
Global NIPS Paper Implementation Challenge - Plagiarism Detection on Electronic Text Based Assignments Using Vector Space Model (iciafs14)
Language: Python - Size: 396 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

kaushikhande/Ensemble_sentiment
Language: Python - Size: 1.48 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

AlessandroBono/ProbabilisticPoSTagger
A Java implementation of different probabilistic part-of-speech tagging techniques.
Language: Java - Size: 3.11 MB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

DigitalTools/nlp-training
Natural Language Processing - Training
Language: Python - Size: 2.48 MB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

bfontaine/trigrams
Trigrams-based random text generation
Language: Python - Size: 15.6 KB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0
