An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: trigrams

michurin/ngramindex

Golang ngram index implementation

Language: Go - Size: 13.7 KB - Last synced at: 10 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

StephanGeorg/trigram-similarity

Determining the similarity of alphanumeric text based on trigram matching.

Language: JavaScript - Size: 231 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 3

Casecommons/pg_search

pg_search builds ActiveRecord named scopes that take advantage of PostgreSQL’s full text search

Language: Ruby - Size: 1.19 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 1,452 - Forks: 372

KhaledAshrafH/Auto-Filling-Text

This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.

Language: Python - Size: 27.1 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 4

cyb3rk0tik/pyfranc

Text language detection basic on trigrams.

Language: Python - Size: 382 KB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

pngo1997/N-gram-Language-Models

Builds N-gram language modes and applies text generation.

Language: Jupyter Notebook - Size: 4.73 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mochi-co/ngrams

A Go n-gram indexer for natural language processing with modular tokenizers and data stores

Language: Go - Size: 396 KB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

remram44/ngram-search

Ngram-based indexing of strings into a binary file

Language: Rust - Size: 88.9 KB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

sumn2u/string-comparisons

A collection of string comparisons algorithms

Language: JavaScript - Size: 700 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 5

mideind/Icegrams

A fast, compact trigram library for Icelandic

Language: Python - Size: 207 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 11 - Forks: 3

pouriaSameti/NLP

The projects for the NLP course at the University of Isfahan.

Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

f1monkey/ngrams

Ngram generation library

Language: Go - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ocrim1996/BigramTrigramGenerator Fork of n3d1117/BigramTrigramGenerator

Parallel Computing - Measure the speedup gained when parallelizing n-grams generation using Java Threads and C++ Threads

Language: TeX - Size: 26.9 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

daverlon/ngram-wordgen

Word generator using n-gram probabilities

Language: Jupyter Notebook - Size: 29.4 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

jorzel/postgres-full-text-search

Postgres full text search options (tsearch, trigram, ilike) examples

Language: Python - Size: 5.04 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 111 - Forks: 6

luizanisio/Doc2VecFacil

Classe responsável por simplificar o processo de criação de um modelo Doc2Vec (gensim) com facilitadores para geração de um vocab personalizado e com a geração de arquivos de curadoria. Dicas usando elasticsearch e singlestore.

Language: Python - Size: 31.9 MB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

Mirith/Trigram-tagger

Python, tags trigrams! K-fold validation doesn't work yet.

Language: Python - Size: 1.43 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

akshataupadhye/News-articles-clustering-A-comparative-approach

A project featuring the use of various NLP techniques and ML algorithms like the topic modelling and paragraph embeddings, for document clustering. 📰📚

Language: Jupyter Notebook - Size: 186 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mikethesilb/TrigramsE-M

A map-reduce implementation in Apache Hadoop (AWS EMR) for calculating the probabilities of trigrams in the Hebrew language. This project utilizes the deleted estimation two-way cross validation method to calculate trigram probabilities. The Google Hebrew Trigram database serves as this project's corpus.

Language: Java - Size: 46.9 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

CelineBoudier/jabberwocky

Written Language Detection and Gibberish Synthesis for NAO.

Language: Python - Size: 12.8 MB - Last synced at: 5 months ago - Pushed at: almost 8 years ago - Stars: 7 - Forks: 4

askeladdk/langdet

Language detection for Go.

Language: Go - Size: 197 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gromag/Data-Science-Specialisation-Predict-Next-Word

Predicting next word with Natural Language Processing. Being able to predict what word comes next in a sentence is crucial when writing on portable devices that don't have a full size keyboard. However the same techniques used in texting application can be applied to a variety of other applications, for example: genomics by segmenting DNA, sequences speech recognition, automatic language translation or even as one student in the course suggested music sequence prediction.

Language: HTML - Size: 853 KB - Last synced at: 7 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

Garrett-Webb/trumptweets

analyze trump's nonsense, feed in a topic, and generate a new tweet based on a custom corpus.

Language: Python - Size: 6.67 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 1

mehdiye5/WordSegmentation

The task for this project is to segment a sequence of English characters into the most likely word sequence.

Language: Python - Size: 2.09 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

philbooth/surch 📦

MOVED TO GITLAB

Language: JavaScript - Size: 29.3 KB - Last synced at: about 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

susantabiswas/Word-Prediction-Ngram

Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques

Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 30 - Forks: 12

Eatmeta/TextAnalysis

The Practice "Sentence Generator"

Language: C# - Size: 169 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Universemul/postgresql-search-engine

This application demonstrates how to use PostgreSQL as a full-text search and spatial engine

Language: Python - Size: 1.33 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

EUGINELETHAL/Docker-Nginx-Heroku

This is a simple search app built with Django, PostgreSQL, Docker, Gunicorn.Demo

Language: Python - Size: 58.6 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

urschrei/hexagrams

Procedural generation of I Ching hexagrams and trigrams using PIL and NumPy

Language: Python - Size: 50.8 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 3

msiebuhr/charcod.es

Small webpage for finding the odd unicode char code.

Language: JavaScript - Size: 3.53 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 3

himanish532/POS-Tagger-Model

This project implements HMM model on trigrams resulting a better accuracy in POS tagging

Language: Python - Size: 3.75 MB - Last synced at: 10 months ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 1

sabareeswarans11/Russia_Ukraine_War_Twitter_Analysis

Semi-Structured Data Processing with NoSQL Database Server MongoDB Collecting Social Media Data from Twitter Real-time Data Stream and Storing and Retrieving to Process from a Semi-Structured Database Server MongoDB

Language: Python - Size: 5.01 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

samet-ozkan/German-English-Detector

This code detects whether the text input is in German or English.

Language: C - Size: 5.86 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

BartekCK/graphql-cricket-league

Apollo application with public playground

Language: TypeScript - Size: 470 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

arvindpj007/Enigma-M4-Cryptanalysis

Cryptanalysis of Enigma M4 in Go Lang with pure statistical means utilizing Index of Coincidence and Trigram analysis.

Language: Go - Size: 1.52 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 0

Adrianogba/bigrama-trigrama-python

Este é um programa de inteligência artificial simples para prever a próxima palavra baseada em uma string informado usando bigramas e trigramas baseados em um arquivo .txt. Existem dois códigos, um usando console e outro usando o tkinter.

Language: Python - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

AslanDevbrat/Computational-Linguistic

Assigmnents of CL

Language: Jupyter Notebook - Size: 25.9 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

luizmellodev/Ngrams

This project has the objective of creating ngrams based on text inputs, generating text output with the ngrams and your frequency.

Language: Python - Size: 36.1 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Dechrissen/LIN538-Final

Final project for LIN 538: Statistics for Linguists at Stony Brook University (Fall 2020)

Language: Jupyter Notebook - Size: 208 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

iglee/HMMs-and-PCFG

POS tagging by using ngram based hidden markov models.

Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

ZNClub-PA-ML-AI/NLP-techniques

Testing & learning different nlp and lex techniques

Language: Jupyter Notebook - Size: 2.37 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

Adrianogba/bigram-trigram-python

This is an simple artificial intelligence program to predict the next word based on a informed string using bigrams and trigrams based on a .txt file. There are two codes, one using console and the other using tkinter.

Language: Python - Size: 1.66 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

albertusk95/nips-challenge-plagiarism-detection-vsm

Global NIPS Paper Implementation Challenge - Plagiarism Detection on Electronic Text Based Assignments Using Vector Space Model (iciafs14)

Language: Python - Size: 396 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 1

kaushikhande/Ensemble_sentiment

Language: Python - Size: 1.48 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

AlessandroBono/ProbabilisticPoSTagger

A Java implementation of different probabilistic part-of-speech tagging techniques.

Language: Java - Size: 3.11 MB - Last synced at: 4 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

DigitalTools/nlp-training

Natural Language Processing - Training

Language: Python - Size: 2.48 MB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

bfontaine/trigrams

Trigrams-based random text generation

Language: Python - Size: 15.6 KB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0