An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: bigrams

skku-vault/skku-sp

23-2 시스템프로그램 (prof. 엄영익)

Language: C - Size: 2.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

KhaledAshrafH/Auto-Filling-Text

This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.

Language: Python - Size: 27.1 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 4

pngo1997/N-gram-Language-Models

Builds N-gram language modes and applies text generation.

Language: Jupyter Notebook - Size: 4.73 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

ReshiAdavan/Replica

data generation tool that generates new data based on data provided as input

Language: Jupyter Notebook - Size: 8.4 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mochi-co/ngrams

A Go n-gram indexer for natural language processing with modular tokenizers and data stores

Language: Go - Size: 396 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 0

dohliam/hawaiian-corpus

Data from a corpus of written Hawaiian

Size: 22.2 MB - Last synced at: about 1 month ago - Pushed at: almost 9 years ago - Stars: 14 - Forks: 0

MishraSubash/SentimentAnalysisWith-NLP

Language: Jupyter Notebook - Size: 543 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gaaniruddha/FIT5196-A1

This repository contains assignments #1 that was completed as a part of "FIT5196 Data Wrangling", taught at Monash Uni in S2 2020.

Language: Jupyter Notebook - Size: 17.3 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bobbingwide/bigram

Simply because ... one word won't do

Language: PHP - Size: 2.4 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

PhonoGrams/soft_bigram

Soft Bigram distance in Go

Language: Go - Size: 230 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pouriaSameti/NLP

The projects for the NLP course at the University of Isfahan.

Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

shj1081/skku_SP_pa3

23-2 / 시스템 프로그램 / 엄영익 prof.

Language: C - Size: 4.6 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

racheliee/skku_SP

Bigram analyzer & optimization

Language: C - Size: 1.99 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

natnaelhhaile/Sentiment-Analysis

Language: Jupyter Notebook - Size: 5.75 MB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ocrim1996/TextFrequencyAnalysis

A Python algorithm for calculating the frequency of letters in a text and other things.

Language: Python - Size: 24.4 KB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

ocrim1996/BigramTrigramGenerator Fork of n3d1117/BigramTrigramGenerator

Parallel Computing - Measure the speedup gained when parallelizing n-grams generation using Java Threads and C++ Threads

Language: TeX - Size: 26.9 MB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

daverlon/ngram-wordgen

Word generator using n-gram probabilities

Language: Jupyter Notebook - Size: 29.4 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

langdonholmes/collocation-extractor

Visualizing dependency bigram filtering.

Language: Python - Size: 71.3 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

megantriplett/InformationRetrievalFinal-UEA

Final project for Information Retrieval Class at University of East Anglia

Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

iAmKankan/Natural-Language-Processing-NLP-Tutorial

NLP tutorials and guidelines to learn efficiently

Size: 123 KB - Last synced at: 26 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

VaasuDevanS/Natural-Language-Processing-Assignments

UNB Fall-2018 NLP Assignments 💬

Language: Python - Size: 23.5 MB - Last synced at: 24 days ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

sashakenjeeva/spell-corrector

A context-sensitive, one-edit distance spelling corrector

Language: Jupyter Notebook - Size: 3.91 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

Psmths/bigram-file-analysis

Proof of concept that leverages machine learning to classify files based on their bigram frequency distributions.

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

jianleisun/NLP-project

Sentimental Analysis for Amazon Book Reviews

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

roverbird/corpus_utils

Semantic word relations analysis and visualization for corpus linguistics and NLP

Language: R - Size: 28.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

motiurinfo/sentiment_classification

Performance evaluation of sentiment classification on movie reviews

Language: Python - Size: 20.9 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

iskorini/CPP-Bigrams

Final project for course of Parallel Computing @ UNIFI

Language: C++ - Size: 10.8 MB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

LinggarM/natural-language-processing

Collection of codes of Natural Language Processing college course

Language: Jupyter Notebook - Size: 34.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

stefanrer/CountBigramFreqInConlluCorpus

Count Bigram frequency in a conllu format corpus

Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Lynn425/Text-Analysis-of-Simpson-Transcript

Visualize the results from Word Frequency Analysis, Sentiment Analysis, Bigram Analysis and Word Trend Analysis performed on The Simpsons1 Transcripts.

Language: R - Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

emanuelemorales/TextMining

Text mining techniques applied on Facebook comments and SMS spam detection.

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

luizanisio/Doc2VecFacil

Classe responsável por simplificar o processo de criação de um modelo Doc2Vec (gensim) com facilitadores para geração de um vocab personalizado e com a geração de arquivos de curadoria. Dicas usando elasticsearch e singlestore.

Language: Python - Size: 31.9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

akshataupadhye/News-articles-clustering-A-comparative-approach

A project featuring the use of various NLP techniques and ML algorithms like the topic modelling and paragraph embeddings, for document clustering. 📰📚

Language: Jupyter Notebook - Size: 186 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

DorinK/Deep-Learning-Gradient-based-Learning

First assignment in ׳Deep Learning for Texts and Sequences' course (using NumPy only) by Prof. Yoav Goldberg at Bar-Ilan University

Language: Python - Size: 278 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 1

czig/nh19_sentiment

Sentiment Analysis for Facebook pages

Language: Python - Size: 182 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

taniyariar/Natural-Language-Processing

Natural-Language-Processing

Language: Python - Size: 213 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

gromag/Data-Science-Specialisation-Predict-Next-Word

Predicting next word with Natural Language Processing. Being able to predict what word comes next in a sentence is crucial when writing on portable devices that don't have a full size keyboard. However the same techniques used in texting application can be applied to a variety of other applications, for example: genomics by segmenting DNA, sequences speech recognition, automatic language translation or even as one student in the course suggested music sequence prediction.

Language: HTML - Size: 853 KB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

zoobereq/Richness-of-the-Stimulus

A replication of an experiment by Reali and Christiansen (2005) disputing the basic assumptions of Chomsky's Poverty of Stimulus theory.

Language: Python - Size: 463 KB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

giocoal/word-embedding-italian-literature

Using distibuctional semantics (word2vec family algorithms and the CADE framework) to learn word embeddings from the Italian literary corpuses we generated.

Language: Python - Size: 21.4 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

sohailahmedkhan/Sentence-Completion-using-Hidden-Markov-Models

The goal of this script is to implement three langauge models to perform sentence completion, i.e. given a sentence with a missing word to choose the correct one from a list of candidate words. The way to use a language model for this problem is to consider a possible candidate word for the sentence at a time and then ask the language model which version of the sentence is the most probable one.

Language: Python - Size: 5.86 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 0

starlordvk/Typing-Assistant

Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.

Language: CSS - Size: 1.14 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 44 - Forks: 13

mehdiye5/WordSegmentation

The task for this project is to segment a sequence of English characters into the most likely word sequence.

Language: Python - Size: 2.09 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

senya-ashukha/bigram-anchor-words 📦

An Implementation of Bigram Anchor Words algorithm

Language: Python - Size: 189 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 8 - Forks: 5

susantabiswas/Word-Prediction-Ngram

Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques

Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 30 - Forks: 12

pbgnz/automatic-language-identification

a probabilistic language identification system that identifies the language of a sentence

Language: Python - Size: 8.62 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

DigitalTools/nltk-book

Jupyter Notebook for Natural Language Processing learning

Language: Jupyter Notebook - Size: 929 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 10 - Forks: 4

Eatmeta/TextAnalysis

The Practice "Sentence Generator"

Language: C# - Size: 169 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

octokami/news_stock_market

Predict stock price movements based on news articles. We used the BoW approach and sentiment analysis of titles of news articles.

Language: Jupyter Notebook - Size: 36.7 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

mhasegawa7045/Film_Movie_Text_Mining_Sentimental_Analysis_Machine_Learning

[Tokenization, Topic Modeling, Sentiment Analysis, Network of Bigrams] The purpose of this project is to see if text mining techniques can ease better analysis for categorizing movies with just the Descriptions while ignoring the Genre from the dataset, IMDB_movies.csv, which is stored under the data frame variable, movies_desc. Tokenization (TF-DF) was used to increase efficiency to analyze term frequencies in movie Descriptions so that the conceptual theme of a movie franchise would be determined even if a person has never watched any of the films. Create mixtures of terms that are correlated to every topic and the mixture of topics that distinguishes each document through Topic Modeling in the dataset, IMDB_movies.csv. Sentimental Analysis focused on Movies with Sentimental Clusters that were using bing and NRC lexicons to see how Sentiment affects Rating and Revenue. The network of bigrams for the Movies dataset help summarize how frequented Movie Description word-terms create term relationships and how they connect to other movies.

Language: HTML - Size: 7.4 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sandeepsukumaran/BigramsAndTBL

Computation of various bigrams models, Naive Bayesian Part of Speech tagging, and Transformation Based Learner

Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

burhanharoon/N-Gram-Language-Model

It's a python based n-gram langauage model which calculates bigrams, probability and smooth probability (laplace) of a sentence using bi-gram and perplexity of the model.

Language: Python - Size: 793 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

bobbingwide/sb

SB: Second Byte - Seriously Bonkers' experimental Full Site Editing theme

Language: HTML - Size: 854 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sachin-bisht/YouTube-Sentiment-Analysis

(UNMAINTAINED)Fetch comments from the given video and determine sentiment towards the video is positive or negative

Language: Python - Size: 488 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 2

sabareeswarans11/Russia_Ukraine_War_Twitter_Analysis

Semi-Structured Data Processing with NoSQL Database Server MongoDB Collecting Social Media Data from Twitter Real-time Data Stream and Storing and Retrieving to Process from a Semi-Structured Database Server MongoDB

Language: Python - Size: 5.01 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

samet-ozkan/German-English-Detector

This code detects whether the text input is in German or English.

Language: C - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Adrianogba/bigrama-trigrama-python

Este é um programa de inteligência artificial simples para prever a próxima palavra baseada em uma string informado usando bigramas e trigramas baseados em um arquivo .txt. Existem dois códigos, um usando console e outro usando o tkinter.

Language: Python - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

AslanDevbrat/Computational-Linguistic

Assigmnents of CL

Language: Jupyter Notebook - Size: 25.9 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

kamilkubik89/the_bigram_parsing_problem

Application in python that can take sentence from extrenal text file and make uotput of bigrams in the text with counting numbers of bigrams.

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohitthapliyal2000/Sentiment-Analysis-NLTK

Opinion mining for provided data from various NLTK corpus to test/enhance the accuracy of the NaiveBayesClassifier model.

Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 3

danielroncel/twitter-analysis-per-cities

Tweet sentiment analysis per user location in Spain, showing average results per city in a map. Also find the most common words and bigrams per location in those tweets during the last minutes.

Language: Jupyter Notebook - Size: 500 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Premchand95/Sentiment-Analysis-of-Reviews-using-Machine-Learning-algorithms-on-Textual-data

CSCI 59000 BIG DATA ANALYTICS PROJECT

Language: Python - Size: 87.2 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

norgrenm/Bigram_Book-Build

Bigram build using no NLP Packages

Language: Jupyter Notebook - Size: 45 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

luizmellodev/Ngrams

This project has the objective of creating ngrams based on text inputs, generating text output with the ngrams and your frequency.

Language: Python - Size: 36.1 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

yvetteyyuan/plant_based_segmentation

Analyze how people perceive plant-based diets online and generate marketing insights on the plant-based products.

Language: R - Size: 2.89 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

faisalsyfl/IndoLangModel

Indonesian Language Model (Ngrams & Shannon Visualization) in CodeIgniter

Language: PHP - Size: 1.82 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 3

zq99/bigram_frequency_analysis

Extracts all the bigrams from a list of 9,000 of the most common words in English. Counts the frequency of each bigram at each position in a word.

Language: Python - Size: 23.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

maitree7/Tales_from_the_Cryptos_NLP

Sentiment Analysis on Cryptocurrency (BTC vs. ETH)

Language: Jupyter Notebook - Size: 3.34 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 1

tot98git/nlp-stuff

Some nlp stuff.

Language: Python - Size: 3.23 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

iglee/HMMs-and-PCFG

POS tagging by using ngram based hidden markov models.

Language: Python - Size: 35.2 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

zmuhls/LING78100-MP3

Machine programming assignment on bigrams and functions

Language: Python - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

garrrikkotua/text_generator

Simple text generator with bigrams

Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

ZNClub-PA-ML-AI/NLP-techniques

Testing & learning different nlp and lex techniques

Language: Jupyter Notebook - Size: 2.37 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

ilhamksyuriadi/Language-Modeling-Bigram

An example of n-gram with n = 2 (bigram) for language modelling with indonesian language

Language: Python - Size: 117 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

brentnd/FileAnalysis

Using MATLAB to analyze and visualize properties of binary files.

Language: Matlab - Size: 2.47 MB - Last synced at: 2 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

fikrirazor/bigramindo

bigram using python language, menggunakan kalimat berbahasa indonesia

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

sachin-bisht/Sentiment-Analysis-NLTK

Sentiment Analysis / Opinion Mining for provided data in NLTK corpus using NaiveBayesClassifier Algorithm

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 3

bobbingwide/genesis-SB

Genesis-SB - Specially Built for seriously bonkers.com / bigram.co.uk

Language: CSS - Size: 365 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

farhandzakyarvianto/BigramLanguageModeling

Bigram - Permodelan bahasa menggunakan Python

Language: Python - Size: 30.9 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 2

Adrianogba/bigram-trigram-python

This is an simple artificial intelligence program to predict the next word based on a informed string using bigrams and trigrams based on a .txt file. There are two codes, one using console and the other using tkinter.

Language: Python - Size: 1.66 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

vgratian/phon_bigrams

[draft] phonological unigrams and bigrams

Language: Python - Size: 2.93 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

kaushikhande/Ensemble_sentiment

Language: Python - Size: 1.48 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

AlessandroBono/ProbabilisticPoSTagger

A Java implementation of different probabilistic part-of-speech tagging techniques.

Language: Java - Size: 3.11 MB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

ChirravuriChaitanya/Language-Modelling

Language Modelling using N-Gram Technique for text sentences

Language: Python - Size: 1000 Bytes - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

cache117/cs453-bigrams-text-analysis

Text analysis to determine rank-frequency curves for words and bigrams, and vocabulary growth curves.

Language: Java - Size: 1.76 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

DigitalTools/nlp-training

Natural Language Processing - Training

Language: Python - Size: 2.48 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

mameli/Bigrams

Bigrams using C++

Language: C++ - Size: 7.59 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

Related Keywords
bigrams 86 trigrams 23 nlp 21 python 20 ngrams 17 natural-language-processing 16 sentiment-analysis 12 nltk 11 unigram 10 python3 9 language-model 8 machine-learning 7 corpus 6 naive-bayes-classifier 5 topic-modeling 5 matplotlib 4 bigram-model 4 word2vec 4 tokenization 4 stopwords 4 unigrams 4 text-analysis 4 tf-idf 4 n-grams 4 jupyter-notebook 3 named-entity-recognition 3 sentiment-analysis-nltk 3 text-generation 3 perplexity 3 classification 3 lemmatization 3 ngram 3 corpus-linguistics 3 opinion-mining 3 bag-of-words 3 ngram-language-model 3 clustering 2 frequency 2 collocations 2 ngram-probabilistic-model 2 indonesian-language 2 newsapi 2 file-analysis 2 prediction 2 data-science 2 sentiment 2 optimization 2 part-of-speech 2 ngram-model 2 numpy 2 word-embeddings 2 wordcloud 2 trigram-model 2 laplace-smoothing 2 brillstagger 2 text-preprocessing 2 gru 2 lstm 2 corpora 2 pytorch 2 text-mining 2 rnn 2 parcing 1 data-analysis 1 dataset 1 probabilistic-models 1 hacktoberfest2022 1 hacktoberfest 1 threadsafe 1 knesser-ney-smoothing 1 csharp-basics-part1 1 transformation-based-learning 1 movies 1 html 1 films 1 quadgrams 1 prediction-ngram 1 interpolated-knesser-ney 1 language-acquisition 1 chomsky 1 bigram-modeling 1 r 1 kneser 1 data-science-specialisation 1 viterbi-algorithm 1 partofspeech-tagger 1 markov-chain 1 finite-state-machine 1 wordclouds 1 xor-problem 1 multi-layer-perceptron-classifier 1 log-linear-model 1 gradients 1 deep-learning 1 gprof 1 good-turing 1 backoff 1 typing-assistant 1 text-prediction 1 keyboard 1