An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: wordembedding

vyraun/Half-Size

Code for "Effective Dimensionality Reduction for Word Embeddings".

Language: Python - Size: 6.67 MB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 131 - Forks: 24

Separius/awesome-sentence-embedding 📦

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 282 KB - Last synced at: 7 days ago - Pushed at: about 4 years ago - Stars: 2,257 - Forks: 262

oborchers/Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!

Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: about 23 hours ago - Pushed at: about 2 years ago - Stars: 623 - Forks: 84

woov2/Life_research_resources_AI_utilization_contest

[경진대회] 암환자 유전체 데이터의 변이 정보를 활용한 암종 분류 AI 모델 개발

Language: Jupyter Notebook - Size: 1.97 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ekzhu/go-fasttext

Facebook fastText database in SQLite with Go API

Language: Go - Size: 60.5 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 4

gesistsa/sweater

👚 Speedy Word Embedding Association Test & Extras using R

Language: R - Size: 23.4 MB - Last synced at: about 23 hours ago - Pushed at: 3 months ago - Stars: 30 - Forks: 5

Geo-y20/Enhanced-Learning-Experience

IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.

Language: Jupyter Notebook - Size: 41.2 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

Abdelrahman-Amen/Word-Embedding

This code showcases text preprocessing (tokenization, stopword removal, and standardization), training a Word2Vec model to generate word embeddings, and analyzing word relationships using metrics like cosine similarity and Jaccard index. It also visualizes high-dimensional embeddings in 2D using MDS, illustrating how similar words cluster together

Language: Jupyter Notebook - Size: 793 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

chaosgen/awesome-sentence-embedding

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 213 KB - Last synced at: 13 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

FPT-ThaiTuan/Using-Word-Embeddings-for-Twitter-Sentiment-Analysis

The project researches sentiment analysis on Twitter, with the goal of evaluating the positivity, negativity or neutrality of comments. Using Word Embeddings, an advanced method in natural language processing, our model achieved a high accuracy of 96.61%. The model was trained on Twitter data and tested on a data comment dataset from Binance.

Language: Jupyter Notebook - Size: 12.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

Mohana-Murugan/NLP

NLP

Language: Jupyter Notebook - Size: 4.44 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DanielDaCosta/Text-Classification-Exploration

Explore text classification with Logistic Regression and Naive Bayes models. Implementing from scratch, we compare feature engineering techniques like Bag-of-Words, TF-IDF, and Word Embedding for accurate labeling

Language: Python - Size: 35.6 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mgiorgi13/MITopics

Topic detection to identify the main topics on MIT management papers

Language: HTML - Size: 184 MB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

imraviagrawal/ReadingComprehension

Bi-Directional Attention Flow for Machine Comprehensions

Language: Python - Size: 238 KB - Last synced at: 6 months ago - Pushed at: over 7 years ago - Stars: 10 - Forks: 2

RedWn/Information-Retrieval-Project

A search engine / data processor on two datasets from ir-datasets.com

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

javason22/nlp_cnn_model

Showcase of Natural Language Processing (NLP) on sentiment analysis of text in survey

Language: Python - Size: 3.72 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

seungheondoh/CER_Network

D3-Network Visualization with WordEmbedding Space

Language: JavaScript - Size: 7.59 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

lamia-datalover/Deep_Learning

This repository contains deep learning projects. The code for each project is provided, and the explanations can be found in the ReadMe.md file of each project !

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

PrashantRanjan09/WordEmbeddings-Elmo-Fasttext-Word2Vec

Using pre trained word embeddings (Fasttext, Word2Vec)

Language: Python - Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: almost 7 years ago - Stars: 158 - Forks: 31

tca19/phd-thesis

My PhD thesis with all its source files, including all .tex files and images created, as well as the slides of my defense.

Language: TeX - Size: 3.82 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

tca19/dict2vec

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

Language: Python - Size: 208 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 115 - Forks: 30

fann1993814/word2vecl

Labeled Word2Vec for Semi-Supervised Learning

Language: Python - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

karthikvarma247/Disastertweets

Natural Language processing with Disaster Tweets using word embeddings.

Language: Jupyter Notebook - Size: 80.1 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

auriml/capstone

Language: Python - Size: 1.26 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 6

shaimaaK/arabic-word-embedding

Arabic Word Embedding models SkipGram, and GLoVE are trained over Arabic Wiki data Dump 2018 dataset from scratch using Gensim and GLoVE python libraries. Then the models are evaluated on three NLP tasks and its results are visualized in T-SNE

Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

arleigh418/How-Much-News-Should-We-Extract-For-Stock-Price-Prediction

股市新聞信息與過去股價作為預測股價特徵時應提取文章數量及方法

Language: Python - Size: 63.2 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 1

r-sajal/DeepLearning-

Language: Jupyter Notebook - Size: 2.73 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 7

pAciFic132/Multi-Class-Text-Classification

Bi-LSTM, BERT Network을 사용한 한국어 문장 분류

Language: Jupyter Notebook - Size: 42.5 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

mohammadtavakoli78/Informations-Retrieval

This is final project of Information Retrieval course which is implementation of a search engine

Language: Python - Size: 3.14 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

Nada-Khater/Name-Verification-Model-with-Docker

Name verification model using tensorflow v2 and word embedding that classifies the input name into real and fake name with accuracy of 99%

Language: Jupyter Notebook - Size: 9.73 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

scarletcho/runWord2vec

Wrapper of Gensim word2vec along with T-SNE visualization

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

QiutingWang/NLP

Language: Jupyter Notebook - Size: 35.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

gabrielpondc/oovunderstand

In this project, the authors propose to use contextual Word2Vec model for understanding OOV (out of vocabulary). The OOV is extracted by using left-right entropy and point information entropy. They choose to use Word2Vec to construct the word vector space and CBOW (continuous bag of words) to obtain the contextual information of the words.

Language: Python - Size: 12.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ashalogic/Persian-Word-Embedding

Persian word embedding ( نشاننده واژه ها فارسی | تعبیه سازی کلمات فارسی )

Language: Python - Size: 439 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 2

AvivYaish/X2VEC

We have implemented, expanded and reviewed the paper “Sense2Vec - A Fast and Accurate Method For Word Sense Disambiguation In Neural Word Embeddings" by Andrew Trask, Phil Michalak and John Liu.

Language: Python - Size: 793 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ojipadeson/Word-Embedding-SST

NLP word embedding course project

Language: Python - Size: 33.1 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

minji-mia/Cosmetics-analysis

Content-based recommendation system for cosmetics

Language: Jupyter Notebook - Size: 4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ilovemyminutes/Text2Hip

'AI Hiphop lyrics Generator🎙' project which makes hiphop lyrics based on a few user's keywords liks 'love', 'money'.

Language: Python - Size: 52.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

omarsar/data_mining_lab_fall_2

Data Mining Lab Session 2 (Fall 2017)

Language: Jupyter Notebook - Size: 5.26 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 19

bab2min/AdaGram-Cpp

Language: C++ - Size: 4.99 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

elbahaa/NLP-Word_Embedding-LSTM-PCA-TSNE

Classification of "BBC News" and comparison of performance between 3 types of model's architectures. Then 2D word embedding visualization using PCA and 3D word embedding visualisation using T-SNE

Language: Jupyter Notebook - Size: 1.33 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

lascar-pacagi/word2vec

Accompanying code for word embedding video

Language: C++ - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

grecosalvatore/lstm-glove-binary-toxic-comment-classification

Language: Jupyter Notebook - Size: 112 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

palashmoon/word_embedding_using_keras

A word embedding is a learned representation for text where words that have the same meaning have a similar representation.Word embeddings are in fact a class of techniques where individual words are represented as real-valued vectors in a predefined vector space. Each word is mapped to one vector and the vector values are learned in a way that resembles a neural network

Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

suthagar23/fyp-polysemy-embedding

Polysemy Embedding - Iterative approach to address the sense based embedding

Language: Python - Size: 4.28 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

smafjal/PosTag-polyglot2-embedder

Bengali Word Embedding by using Polygot2

Language: Python - Size: 58.2 MB - Last synced at: about 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0

Kevinwenya/Chinese-Word-Vectors

中文词向量

Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

anantSinghCross/poems_categorisation_neural_networks

A neural networks project that categorizes different poems in the dataset according to their genre

Language: Python - Size: 595 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

rajatdv/Kaggle-Toxic-classifier

This a part of Kaggle Competion, Toxic Comment Classification Challenge by Jigsaw .This was a multilabel classification challenge.This code is a improved version of my submission in the competion.

Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

zyx954/RestFul-on-wordEmbeddings

A Restful Web service to provide wordembedding functions

Language: Java - Size: 18.4 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 0

jerrygaoLondon/AdaGram.jl Fork of sbos/AdaGram.jl

Adaptive Skip-gram implementation in Julia

Language: Julia - Size: 9.89 MB - Last synced at: 5 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

Related Keywords
wordembedding 51 nlp 20 word2vec 16 fasttext 8 python 7 machine-learning 7 deep-learning 6 natural-language-processing 6 wordembeddings 6 bert 5 word-embeddings 5 glove 4 gensim 4 lstm 4 embedding-models 4 gensim-word2vec 3 sentence-embeddings 3 ai 3 transformer 3 glove-embeddings 3 nlp-machine-learning 3 embeddings 3 neural-networks 2 wordtovec 2 tokenization 2 word2vec-embeddinngs 2 numpy 2 sklearn 2 cosine-similarity 2 pandas 2 text-summarization 2 speech-to-text 2 knn-classification 2 sentiment-analysis 2 text-classification 2 lda 2 glove-vectors 2 gru 2 convolutional-neural-networks 2 information-retrieval 2 word-embedding 2 search-engine 2 neural-network 2 tensorflow 2 preprocessing 2 semantic-similarity 2 pca 2 natural-language 2 awesome-list 2 sentence-representation 2 word2vec-model 2 pretrained-models 2 cross-lingual 2 unsupervised-learning 2 awesome 2 ktrain 1 math-equation-solver 1 math-expression-evaluator 1 opencv 1 python3 1 bi-lstm 1 sentence-classification 1 inverted-index 1 kmeans 1 kmeans-algorithm 1 kmeans-clustering 1 knn 1 knn-algorithm 1 searchengine 1 tfidf 1 tfidf-vectorizer 1 word2vec-algorithm 1 datamining 1 phd-thesis 1 pretrained-embedding 1 c 1 spearman 1 cancer 1 clinical-trials 1 language-model 1 glove-python 1 skipgram 1 contextualized-representation 1 stock 1 stock-market 1 stock-price-prediction 1 vector 1 vectorspace 1 bert-fine-tuning 1 computer-vision 1 fine-tuning-bert 1 fine-tuning-nlp 1 hugging-face 1 image-processing 1 knn-image-classification 1 emotion-recognition 1 t-sne 1 visualization 1 toxic-comment-classification 1 keras 1