GitHub topics: bigrams
skku-vault/skku-sp
23-2 시스템프로그램 (prof. 엄영익)
Language: C - Size: 2.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

KhaledAshrafH/Auto-Filling-Text
This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.
Language: Python - Size: 27.1 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 4

pngo1997/N-gram-Language-Models
Builds N-gram language modes and applies text generation.
Language: Jupyter Notebook - Size: 4.73 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

ReshiAdavan/Replica
data generation tool that generates new data based on data provided as input
Language: Jupyter Notebook - Size: 8.4 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mochi-co/ngrams
A Go n-gram indexer for natural language processing with modular tokenizers and data stores
Language: Go - Size: 396 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 0

dohliam/hawaiian-corpus
Data from a corpus of written Hawaiian
Size: 22.2 MB - Last synced at: about 1 month ago - Pushed at: almost 9 years ago - Stars: 14 - Forks: 0

MishraSubash/SentimentAnalysisWith-NLP
Language: Jupyter Notebook - Size: 543 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gaaniruddha/FIT5196-A1
This repository contains assignments #1 that was completed as a part of "FIT5196 Data Wrangling", taught at Monash Uni in S2 2020.
Language: Jupyter Notebook - Size: 17.3 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bobbingwide/bigram
Simply because ... one word won't do
Language: PHP - Size: 2.4 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

PhonoGrams/soft_bigram
Soft Bigram distance in Go
Language: Go - Size: 230 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pouriaSameti/NLP
The projects for the NLP course at the University of Isfahan.
Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

shj1081/skku_SP_pa3
23-2 / 시스템 프로그램 / 엄영익 prof.
Language: C - Size: 4.6 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

racheliee/skku_SP
Bigram analyzer & optimization
Language: C - Size: 1.99 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

natnaelhhaile/Sentiment-Analysis
Language: Jupyter Notebook - Size: 5.75 MB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ocrim1996/TextFrequencyAnalysis
A Python algorithm for calculating the frequency of letters in a text and other things.
Language: Python - Size: 24.4 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

ocrim1996/BigramTrigramGenerator Fork of n3d1117/BigramTrigramGenerator
Parallel Computing - Measure the speedup gained when parallelizing n-grams generation using Java Threads and C++ Threads
Language: TeX - Size: 26.9 MB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

daverlon/ngram-wordgen
Word generator using n-gram probabilities
Language: Jupyter Notebook - Size: 29.4 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

langdonholmes/collocation-extractor
Visualizing dependency bigram filtering.
Language: Python - Size: 71.3 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

megantriplett/InformationRetrievalFinal-UEA
Final project for Information Retrieval Class at University of East Anglia
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

iAmKankan/Natural-Language-Processing-NLP-Tutorial
NLP tutorials and guidelines to learn efficiently
Size: 123 KB - Last synced at: 26 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

VaasuDevanS/Natural-Language-Processing-Assignments
UNB Fall-2018 NLP Assignments 💬
Language: Python - Size: 23.5 MB - Last synced at: 24 days ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

sashakenjeeva/spell-corrector
A context-sensitive, one-edit distance spelling corrector
Language: Jupyter Notebook - Size: 3.91 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

Psmths/bigram-file-analysis
Proof of concept that leverages machine learning to classify files based on their bigram frequency distributions.
Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

jianleisun/NLP-project
Sentimental Analysis for Amazon Book Reviews
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

roverbird/corpus_utils
Semantic word relations analysis and visualization for corpus linguistics and NLP
Language: R - Size: 28.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

motiurinfo/sentiment_classification
Performance evaluation of sentiment classification on movie reviews
Language: Python - Size: 20.9 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

iskorini/CPP-Bigrams
Final project for course of Parallel Computing @ UNIFI
Language: C++ - Size: 10.8 MB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

LinggarM/natural-language-processing
Collection of codes of Natural Language Processing college course
Language: Jupyter Notebook - Size: 34.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

stefanrer/CountBigramFreqInConlluCorpus
Count Bigram frequency in a conllu format corpus
Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Lynn425/Text-Analysis-of-Simpson-Transcript
Visualize the results from Word Frequency Analysis, Sentiment Analysis, Bigram Analysis and Word Trend Analysis performed on The Simpsons1 Transcripts.
Language: R - Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

emanuelemorales/TextMining
Text mining techniques applied on Facebook comments and SMS spam detection.
Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

luizanisio/Doc2VecFacil
Classe responsável por simplificar o processo de criação de um modelo Doc2Vec (gensim) com facilitadores para geração de um vocab personalizado e com a geração de arquivos de curadoria. Dicas usando elasticsearch e singlestore.
Language: Python - Size: 31.9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

akshataupadhye/News-articles-clustering-A-comparative-approach
A project featuring the use of various NLP techniques and ML algorithms like the topic modelling and paragraph embeddings, for document clustering. 📰📚
Language: Jupyter Notebook - Size: 186 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

DorinK/Deep-Learning-Gradient-based-Learning
First assignment in ׳Deep Learning for Texts and Sequences' course (using NumPy only) by Prof. Yoav Goldberg at Bar-Ilan University
Language: Python - Size: 278 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

czig/nh19_sentiment
Sentiment Analysis for Facebook pages
Language: Python - Size: 182 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

taniyariar/Natural-Language-Processing
Natural-Language-Processing
Language: Python - Size: 213 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

gromag/Data-Science-Specialisation-Predict-Next-Word
Predicting next word with Natural Language Processing. Being able to predict what word comes next in a sentence is crucial when writing on portable devices that don't have a full size keyboard. However the same techniques used in texting application can be applied to a variety of other applications, for example: genomics by segmenting DNA, sequences speech recognition, automatic language translation or even as one student in the course suggested music sequence prediction.
Language: HTML - Size: 853 KB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

zoobereq/Richness-of-the-Stimulus
A replication of an experiment by Reali and Christiansen (2005) disputing the basic assumptions of Chomsky's Poverty of Stimulus theory.
Language: Python - Size: 463 KB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

giocoal/word-embedding-italian-literature
Using distibuctional semantics (word2vec family algorithms and the CADE framework) to learn word embeddings from the Italian literary corpuses we generated.
Language: Python - Size: 21.4 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

sohailahmedkhan/Sentence-Completion-using-Hidden-Markov-Models
The goal of this script is to implement three langauge models to perform sentence completion, i.e. given a sentence with a missing word to choose the correct one from a list of candidate words. The way to use a language model for this problem is to consider a possible candidate word for the sentence at a time and then ask the language model which version of the sentence is the most probable one.
Language: Python - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 0

starlordvk/Typing-Assistant
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Language: CSS - Size: 1.14 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 44 - Forks: 13

mehdiye5/WordSegmentation
The task for this project is to segment a sequence of English characters into the most likely word sequence.
Language: Python - Size: 2.09 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

senya-ashukha/bigram-anchor-words 📦
An Implementation of Bigram Anchor Words algorithm
Language: Python - Size: 189 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 8 - Forks: 5

susantabiswas/Word-Prediction-Ngram
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 30 - Forks: 12

pbgnz/automatic-language-identification
a probabilistic language identification system that identifies the language of a sentence
Language: Python - Size: 8.62 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

DigitalTools/nltk-book
Jupyter Notebook for Natural Language Processing learning
Language: Jupyter Notebook - Size: 929 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 10 - Forks: 4

Eatmeta/TextAnalysis
The Practice "Sentence Generator"
Language: C# - Size: 169 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

octokami/news_stock_market
Predict stock price movements based on news articles. We used the BoW approach and sentiment analysis of titles of news articles.
Language: Jupyter Notebook - Size: 36.7 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

mhasegawa7045/Film_Movie_Text_Mining_Sentimental_Analysis_Machine_Learning
[Tokenization, Topic Modeling, Sentiment Analysis, Network of Bigrams] The purpose of this project is to see if text mining techniques can ease better analysis for categorizing movies with just the Descriptions while ignoring the Genre from the dataset, IMDB_movies.csv, which is stored under the data frame variable, movies_desc. Tokenization (TF-DF) was used to increase efficiency to analyze term frequencies in movie Descriptions so that the conceptual theme of a movie franchise would be determined even if a person has never watched any of the films. Create mixtures of terms that are correlated to every topic and the mixture of topics that distinguishes each document through Topic Modeling in the dataset, IMDB_movies.csv. Sentimental Analysis focused on Movies with Sentimental Clusters that were using bing and NRC lexicons to see how Sentiment affects Rating and Revenue. The network of bigrams for the Movies dataset help summarize how frequented Movie Description word-terms create term relationships and how they connect to other movies.
Language: HTML - Size: 7.4 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sandeepsukumaran/BigramsAndTBL
Computation of various bigrams models, Naive Bayesian Part of Speech tagging, and Transformation Based Learner
Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

burhanharoon/N-Gram-Language-Model
It's a python based n-gram langauage model which calculates bigrams, probability and smooth probability (laplace) of a sentence using bi-gram and perplexity of the model.
Language: Python - Size: 793 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

bobbingwide/sb
SB: Second Byte - Seriously Bonkers' experimental Full Site Editing theme
Language: HTML - Size: 854 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sachin-bisht/YouTube-Sentiment-Analysis
(UNMAINTAINED)Fetch comments from the given video and determine sentiment towards the video is positive or negative
Language: Python - Size: 488 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 2

sabareeswarans11/Russia_Ukraine_War_Twitter_Analysis
Semi-Structured Data Processing with NoSQL Database Server MongoDB Collecting Social Media Data from Twitter Real-time Data Stream and Storing and Retrieving to Process from a Semi-Structured Database Server MongoDB
Language: Python - Size: 5.01 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

samet-ozkan/German-English-Detector
This code detects whether the text input is in German or English.
Language: C - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Adrianogba/bigrama-trigrama-python
Este é um programa de inteligência artificial simples para prever a próxima palavra baseada em uma string informado usando bigramas e trigramas baseados em um arquivo .txt. Existem dois códigos, um usando console e outro usando o tkinter.
Language: Python - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

AslanDevbrat/Computational-Linguistic
Assigmnents of CL
Language: Jupyter Notebook - Size: 25.9 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

kamilkubik89/the_bigram_parsing_problem
Application in python that can take sentence from extrenal text file and make uotput of bigrams in the text with counting numbers of bigrams.
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohitthapliyal2000/Sentiment-Analysis-NLTK
Opinion mining for provided data from various NLTK corpus to test/enhance the accuracy of the NaiveBayesClassifier model.
Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 3

danielroncel/twitter-analysis-per-cities
Tweet sentiment analysis per user location in Spain, showing average results per city in a map. Also find the most common words and bigrams per location in those tweets during the last minutes.
Language: Jupyter Notebook - Size: 500 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Premchand95/Sentiment-Analysis-of-Reviews-using-Machine-Learning-algorithms-on-Textual-data
CSCI 59000 BIG DATA ANALYTICS PROJECT
Language: Python - Size: 87.2 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

norgrenm/Bigram_Book-Build
Bigram build using no NLP Packages
Language: Jupyter Notebook - Size: 45 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

luizmellodev/Ngrams
This project has the objective of creating ngrams based on text inputs, generating text output with the ngrams and your frequency.
Language: Python - Size: 36.1 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

yvetteyyuan/plant_based_segmentation
Analyze how people perceive plant-based diets online and generate marketing insights on the plant-based products.
Language: R - Size: 2.89 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

faisalsyfl/IndoLangModel
Indonesian Language Model (Ngrams & Shannon Visualization) in CodeIgniter
Language: PHP - Size: 1.82 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 3

zq99/bigram_frequency_analysis
Extracts all the bigrams from a list of 9,000 of the most common words in English. Counts the frequency of each bigram at each position in a word.
Language: Python - Size: 23.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

maitree7/Tales_from_the_Cryptos_NLP
Sentiment Analysis on Cryptocurrency (BTC vs. ETH)
Language: Jupyter Notebook - Size: 3.34 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

tot98git/nlp-stuff
Some nlp stuff.
Language: Python - Size: 3.23 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

iglee/HMMs-and-PCFG
POS tagging by using ngram based hidden markov models.
Language: Python - Size: 35.2 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

zmuhls/LING78100-MP3
Machine programming assignment on bigrams and functions
Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

garrrikkotua/text_generator
Simple text generator with bigrams
Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

ZNClub-PA-ML-AI/NLP-techniques
Testing & learning different nlp and lex techniques
Language: Jupyter Notebook - Size: 2.37 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

ilhamksyuriadi/Language-Modeling-Bigram
An example of n-gram with n = 2 (bigram) for language modelling with indonesian language
Language: Python - Size: 117 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

brentnd/FileAnalysis
Using MATLAB to analyze and visualize properties of binary files.
Language: Matlab - Size: 2.47 MB - Last synced at: 2 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

fikrirazor/bigramindo
bigram using python language, menggunakan kalimat berbahasa indonesia
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

sachin-bisht/Sentiment-Analysis-NLTK
Sentiment Analysis / Opinion Mining for provided data in NLTK corpus using NaiveBayesClassifier Algorithm
Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 3

bobbingwide/genesis-SB
Genesis-SB - Specially Built for seriously bonkers.com / bigram.co.uk
Language: CSS - Size: 365 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

farhandzakyarvianto/BigramLanguageModeling
Bigram - Permodelan bahasa menggunakan Python
Language: Python - Size: 30.9 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 2

Adrianogba/bigram-trigram-python
This is an simple artificial intelligence program to predict the next word based on a informed string using bigrams and trigrams based on a .txt file. There are two codes, one using console and the other using tkinter.
Language: Python - Size: 1.66 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

vgratian/phon_bigrams
[draft] phonological unigrams and bigrams
Language: Python - Size: 2.93 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

kaushikhande/Ensemble_sentiment
Language: Python - Size: 1.48 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

AlessandroBono/ProbabilisticPoSTagger
A Java implementation of different probabilistic part-of-speech tagging techniques.
Language: Java - Size: 3.11 MB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

ChirravuriChaitanya/Language-Modelling
Language Modelling using N-Gram Technique for text sentences
Language: Python - Size: 1000 Bytes - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

cache117/cs453-bigrams-text-analysis
Text analysis to determine rank-frequency curves for words and bigrams, and vocabulary growth curves.
Language: Java - Size: 1.76 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

DigitalTools/nlp-training
Natural Language Processing - Training
Language: Python - Size: 2.48 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

mameli/Bigrams
Bigrams using C++
Language: C++ - Size: 7.59 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0
