GitHub topics: wordembedding
vyraun/Half-Size
Code for "Effective Dimensionality Reduction for Word Embeddings".
Language: Python - Size: 6.67 MB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 131 - Forks: 24

Separius/awesome-sentence-embedding 📦
A curated list of pretrained sentence and word embedding models
Language: Python - Size: 282 KB - Last synced at: 7 days ago - Pushed at: about 4 years ago - Stars: 2,257 - Forks: 262

oborchers/Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast!
Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: about 23 hours ago - Pushed at: about 2 years ago - Stars: 623 - Forks: 84

woov2/Life_research_resources_AI_utilization_contest
[경진대회] 암환자 유전체 데이터의 변이 정보를 활용한 암종 분류 AI 모델 개발
Language: Jupyter Notebook - Size: 1.97 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ekzhu/go-fasttext
Facebook fastText database in SQLite with Go API
Language: Go - Size: 60.5 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 4

gesistsa/sweater
👚 Speedy Word Embedding Association Test & Extras using R
Language: R - Size: 23.4 MB - Last synced at: about 23 hours ago - Pushed at: 3 months ago - Stars: 30 - Forks: 5

Geo-y20/Enhanced-Learning-Experience
IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.
Language: Jupyter Notebook - Size: 41.2 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

Abdelrahman-Amen/Word-Embedding
This code showcases text preprocessing (tokenization, stopword removal, and standardization), training a Word2Vec model to generate word embeddings, and analyzing word relationships using metrics like cosine similarity and Jaccard index. It also visualizes high-dimensional embeddings in 2D using MDS, illustrating how similar words cluster together
Language: Jupyter Notebook - Size: 793 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

chaosgen/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
Language: Python - Size: 213 KB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

FPT-ThaiTuan/Using-Word-Embeddings-for-Twitter-Sentiment-Analysis
The project researches sentiment analysis on Twitter, with the goal of evaluating the positivity, negativity or neutrality of comments. Using Word Embeddings, an advanced method in natural language processing, our model achieved a high accuracy of 96.61%. The model was trained on Twitter data and tested on a data comment dataset from Binance.
Language: Jupyter Notebook - Size: 12.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

Mohana-Murugan/NLP
NLP
Language: Jupyter Notebook - Size: 4.44 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DanielDaCosta/Text-Classification-Exploration
Explore text classification with Logistic Regression and Naive Bayes models. Implementing from scratch, we compare feature engineering techniques like Bag-of-Words, TF-IDF, and Word Embedding for accurate labeling
Language: Python - Size: 35.6 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mgiorgi13/MITopics
Topic detection to identify the main topics on MIT management papers
Language: HTML - Size: 184 MB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

imraviagrawal/ReadingComprehension
Bi-Directional Attention Flow for Machine Comprehensions
Language: Python - Size: 238 KB - Last synced at: 6 months ago - Pushed at: over 7 years ago - Stars: 10 - Forks: 2

RedWn/Information-Retrieval-Project
A search engine / data processor on two datasets from ir-datasets.com
Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

javason22/nlp_cnn_model
Showcase of Natural Language Processing (NLP) on sentiment analysis of text in survey
Language: Python - Size: 3.72 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

seungheondoh/CER_Network
D3-Network Visualization with WordEmbedding Space
Language: JavaScript - Size: 7.59 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

lamia-datalover/Deep_Learning
This repository contains deep learning projects. The code for each project is provided, and the explanations can be found in the ReadMe.md file of each project !
Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

PrashantRanjan09/WordEmbeddings-Elmo-Fasttext-Word2Vec
Using pre trained word embeddings (Fasttext, Word2Vec)
Language: Python - Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: almost 7 years ago - Stars: 158 - Forks: 31

tca19/phd-thesis
My PhD thesis with all its source files, including all .tex files and images created, as well as the slides of my defense.
Language: TeX - Size: 3.82 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

tca19/dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries.
Language: Python - Size: 208 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 115 - Forks: 30

fann1993814/word2vecl
Labeled Word2Vec for Semi-Supervised Learning
Language: Python - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

karthikvarma247/Disastertweets
Natural Language processing with Disaster Tweets using word embeddings.
Language: Jupyter Notebook - Size: 80.1 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

auriml/capstone
Language: Python - Size: 1.26 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 6

shaimaaK/arabic-word-embedding
Arabic Word Embedding models SkipGram, and GLoVE are trained over Arabic Wiki data Dump 2018 dataset from scratch using Gensim and GLoVE python libraries. Then the models are evaluated on three NLP tasks and its results are visualized in T-SNE
Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

arleigh418/How-Much-News-Should-We-Extract-For-Stock-Price-Prediction
股市新聞信息與過去股價作為預測股價特徵時應提取文章數量及方法
Language: Python - Size: 63.2 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 1

r-sajal/DeepLearning-
Language: Jupyter Notebook - Size: 2.73 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 7

pAciFic132/Multi-Class-Text-Classification
Bi-LSTM, BERT Network을 사용한 한국어 문장 분류
Language: Jupyter Notebook - Size: 42.5 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

mohammadtavakoli78/Informations-Retrieval
This is final project of Information Retrieval course which is implementation of a search engine
Language: Python - Size: 3.14 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

Nada-Khater/Name-Verification-Model-with-Docker
Name verification model using tensorflow v2 and word embedding that classifies the input name into real and fake name with accuracy of 99%
Language: Jupyter Notebook - Size: 9.73 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

scarletcho/runWord2vec
Wrapper of Gensim word2vec along with T-SNE visualization
Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

QiutingWang/NLP
Language: Jupyter Notebook - Size: 35.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

gabrielpondc/oovunderstand
In this project, the authors propose to use contextual Word2Vec model for understanding OOV (out of vocabulary). The OOV is extracted by using left-right entropy and point information entropy. They choose to use Word2Vec to construct the word vector space and CBOW (continuous bag of words) to obtain the contextual information of the words.
Language: Python - Size: 12.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ashalogic/Persian-Word-Embedding
Persian word embedding ( نشاننده واژه ها فارسی | تعبیه سازی کلمات فارسی )
Language: Python - Size: 439 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 2

AvivYaish/X2VEC
We have implemented, expanded and reviewed the paper “Sense2Vec - A Fast and Accurate Method For Word Sense Disambiguation In Neural Word Embeddings" by Andrew Trask, Phil Michalak and John Liu.
Language: Python - Size: 793 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ojipadeson/Word-Embedding-SST
NLP word embedding course project
Language: Python - Size: 33.1 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

minji-mia/Cosmetics-analysis
Content-based recommendation system for cosmetics
Language: Jupyter Notebook - Size: 4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ilovemyminutes/Text2Hip
'AI Hiphop lyrics Generator🎙' project which makes hiphop lyrics based on a few user's keywords liks 'love', 'money'.
Language: Python - Size: 52.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

omarsar/data_mining_lab_fall_2
Data Mining Lab Session 2 (Fall 2017)
Language: Jupyter Notebook - Size: 5.26 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 19

bab2min/AdaGram-Cpp
Language: C++ - Size: 4.99 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

elbahaa/NLP-Word_Embedding-LSTM-PCA-TSNE
Classification of "BBC News" and comparison of performance between 3 types of model's architectures. Then 2D word embedding visualization using PCA and 3D word embedding visualisation using T-SNE
Language: Jupyter Notebook - Size: 1.33 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

lascar-pacagi/word2vec
Accompanying code for word embedding video
Language: C++ - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

grecosalvatore/lstm-glove-binary-toxic-comment-classification
Language: Jupyter Notebook - Size: 112 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

palashmoon/word_embedding_using_keras
A word embedding is a learned representation for text where words that have the same meaning have a similar representation.Word embeddings are in fact a class of techniques where individual words are represented as real-valued vectors in a predefined vector space. Each word is mapped to one vector and the vector values are learned in a way that resembles a neural network
Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

suthagar23/fyp-polysemy-embedding
Polysemy Embedding - Iterative approach to address the sense based embedding
Language: Python - Size: 4.28 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

smafjal/PosTag-polyglot2-embedder
Bengali Word Embedding by using Polygot2
Language: Python - Size: 58.2 MB - Last synced at: about 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0

Kevinwenya/Chinese-Word-Vectors
中文词向量
Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

anantSinghCross/poems_categorisation_neural_networks
A neural networks project that categorizes different poems in the dataset according to their genre
Language: Python - Size: 595 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

rajatdv/Kaggle-Toxic-classifier
This a part of Kaggle Competion, Toxic Comment Classification Challenge by Jigsaw .This was a multilabel classification challenge.This code is a improved version of my submission in the competion.
Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

zyx954/RestFul-on-wordEmbeddings
A Restful Web service to provide wordembedding functions
Language: Java - Size: 18.4 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

jerrygaoLondon/AdaGram.jl Fork of sbos/AdaGram.jl
Adaptive Skip-gram implementation in Julia
Language: Julia - Size: 9.89 MB - Last synced at: 5 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0
