An open API service providing repository metadata for many open source software ecosystems.

Topic: "word-embeddings"

piskvorky/gensim

Topic Modelling for Humans

Language: Python - Size: 101 MB - Last synced at: about 18 hours ago - Pushed at: 3 months ago - Stars: 16,023 - Forks: 4,396

flairNLP/flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language: Python - Size: 351 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 14,169 - Forks: 2,115

Embedding/Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language: Python - Size: 1.42 MB - Last synced at: about 9 hours ago - Pushed at: over 1 year ago - Stars: 12,010 - Forks: 2,329

srbhr/Resume-Matcher

Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.

Language: Python - Size: 100 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,763 - Forks: 3,317

bentrevett/pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Language: Jupyter Notebook - Size: 1.64 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 4,507 - Forks: 1,181

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language: Python - Size: 83.4 MB - Last synced at: about 17 hours ago - Pushed at: 6 months ago - Stars: 3,045 - Forks: 373

jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

Language: Python - Size: 22.1 MB - Last synced at: 15 minutes ago - Pushed at: over 1 year ago - Stars: 2,904 - Forks: 240

JasonKessler/scattertext

Beautiful visualizations of how language differs among document types.

Language: Python - Size: 39.4 MB - Last synced at: about 3 hours ago - Pushed at: 22 days ago - Stars: 2,302 - Forks: 292

Separius/awesome-sentence-embedding 📦

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 282 KB - Last synced at: 14 days ago - Pushed at: about 4 years ago - Stars: 2,257 - Forks: 262

MinishLab/model2vec

Fast State-of-the-Art Static Embeddings

Language: Python - Size: 3.62 MB - Last synced at: about 10 hours ago - Pushed at: 1 day ago - Stars: 1,670 - Forks: 83

plasticityai/magnitude

A fast, efficient universal vector embedding utility package.

Language: Python - Size: 70.7 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 1,647 - Forks: 119

omarsar/nlp_overview

Overview of Modern Deep Learning Techniques Applied to Natural Language Processing

Language: CSS - Size: 6.82 MB - Last synced at: 5 days ago - Pushed at: about 5 years ago - Stars: 1,332 - Forks: 198

nlptown/nlp-notebooks

A collection of notebooks for Natural Language Processing from NLP Town

Language: Jupyter Notebook - Size: 94.8 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 884 - Forks: 358

dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

Language: R - Size: 46.2 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 863 - Forks: 133

goru001/inltk

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

Language: Python - Size: 812 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 830 - Forks: 161

meta-toolkit/meta

A Modern C++ Data Sciences Toolkit

Language: C++ - Size: 30.4 MB - Last synced at: 10 months ago - Pushed at: about 2 years ago - Stars: 689 - Forks: 233

ncbi-nlp/BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Language: Jupyter Notebook - Size: 28.3 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 567 - Forks: 97

ynqa/wego

Word Embeddings in Go!

Language: Go - Size: 6.98 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 490 - Forks: 41

KristiyanVachev/Question-Generation

Generating multiple choice questions from text using Machine Learning.

Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 489 - Forks: 116

imgarylai/bert-embedding 📦

🔡 Token level embeddings from BERT model on mxnet and gluonnlp

Language: Python - Size: 120 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 452 - Forks: 67

Tixierae/deep_learning_NLP

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

Language: Jupyter Notebook - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 435 - Forks: 106

sunyilgdx/SIFRank_zh

Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)

Language: Python - Size: 2.38 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 404 - Forks: 78

kamalkraj/Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs

Named-Entity-Recognition-with-Bidirectional-LSTM-CNNs

Language: Python - Size: 1.09 MB - Last synced at: 2 days ago - Pushed at: about 5 years ago - Stars: 365 - Forks: 141

dccuchile/spanish-word-embeddings

Spanish word embeddings computed with different methods and from different corpora

Size: 41 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 356 - Forks: 82

amanchadha/coursera-natural-language-processing-specialization

Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.

Language: Jupyter Notebook - Size: 178 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 348 - Forks: 334

sudharsan13296/Hands-On-Deep-Learning-Algorithms-with-Python

Master Deep Learning Algorithms with Extensive Math by Implementing them using TensorFlow

Language: Jupyter Notebook - Size: 206 MB - Last synced at: 2 days ago - Pushed at: over 4 years ago - Stars: 344 - Forks: 186

chakki-works/chakin

Simple downloader for pre-trained word vectors

Language: Python - Size: 172 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 334 - Forks: 48

explosion/floret Fork of facebookresearch/fastText

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

Language: C++ - Size: 4.4 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 310 - Forks: 12

malllabiisc/WordGCN

ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks

Language: Python - Size: 5.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 291 - Forks: 64

gabrielspmoreira/chameleon_recsys

Source code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems

Language: Python - Size: 715 KB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 276 - Forks: 81

bloomberg/koan

A word2vec negative sampling implementation with correct CBOW update.

Language: C++ - Size: 378 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 260 - Forks: 18

vngrs-ai/vnlp

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.

Language: Python - Size: 392 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 259 - Forks: 17

tolga-b/debiaswe

Remove problematic gender bias from word embeddings.

Language: Jupyter Notebook - Size: 58.6 KB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 247 - Forks: 90

devmount/GermanWordEmbeddings

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

Language: Jupyter Notebook - Size: 911 KB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 238 - Forks: 51

vinhkhuc/JFastText

Java interface for fastText

Language: Java - Size: 57.6 KB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 236 - Forks: 98

lgalke/vec4ir

Word Embeddings for Information Retrieval

Language: Python - Size: 965 KB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 225 - Forks: 42

alexandrainst/danlp 📦

DaNLP is a repository for Natural Language Processing resources for the Danish Language.

Language: Python - Size: 49.4 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 205 - Forks: 34

giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

Language: Python - Size: 537 KB - Last synced at: 14 days ago - Pushed at: almost 8 years ago - Stars: 198 - Forks: 29

cbaziotis/datastories-semeval2017-task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".

Language: Python - Size: 9.14 MB - Last synced at: 6 months ago - Pushed at: almost 7 years ago - Stars: 197 - Forks: 63

loretoparisi/fasttext.js

FastText for Node.js

Language: JavaScript - Size: 3.31 MB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 195 - Forks: 29

YannDubs/Hash-Embeddings

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.

Language: Python - Size: 1.12 MB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 195 - Forks: 29

somosnlp/nlp-de-cero-a-cien

Curso práctico: NLP de cero a cien 🤗

Language: Jupyter Notebook - Size: 3.86 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 188 - Forks: 90

sebischair/Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

Language: Python - Size: 13.7 MB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 185 - Forks: 27

avidale/compress-fasttext

Tools for shrinking fastText models (in gensim format)

Language: Jupyter Notebook - Size: 30.9 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 178 - Forks: 13

dccuchile/wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!

Language: Python - Size: 41.6 MB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 177 - Forks: 14

datquocnguyen/LFTM

Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)

Language: Java - Size: 9.02 MB - Last synced at: 18 days ago - Pushed at: about 8 years ago - Stars: 177 - Forks: 59

yumeng5/Spherical-Text-Embedding

[NeurIPS 2019] Spherical Text Embedding

Language: C - Size: 10.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 176 - Forks: 29

robrua/easy-bert

A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)

Language: Java - Size: 44.9 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 171 - Forks: 44

zhongpeixiang/AI-NLP-Paper-Readings

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

Size: 797 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 165 - Forks: 25

pnpnpn/dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

Language: Python - Size: 32 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 164 - Forks: 59

PrashantRanjan09/Elmo-Tutorial

A short tutorial on Elmo training (Pre trained, Training on new data, Incremental training)

Language: Jupyter Notebook - Size: 396 KB - Last synced at: 4 months ago - Pushed at: almost 5 years ago - Stars: 155 - Forks: 38

yuvalpinter/Mimick

Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)

Language: Python - Size: 19.8 MB - Last synced at: 11 days ago - Pushed at: over 5 years ago - Stars: 153 - Forks: 34

augustwester/searchthearxiv

The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.

Language: Python - Size: 126 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 147 - Forks: 14

guenthermi/postgres-word2vec

utils to use word embedding models like word2vec vectors in a PostgreSQL database

Language: C - Size: 917 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 143 - Forks: 19

chatopera/wikidata-corpus

Train Wikidata with word2vec for word embedding tasks

Language: Python - Size: 74.6 MB - Last synced at: about 2 months ago - Pushed at: almost 7 years ago - Stars: 122 - Forks: 29

sunyilgdx/SIFRank

The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"

Language: Python - Size: 5.81 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 120 - Forks: 20

tca19/dict2vec

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

Language: Python - Size: 208 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 115 - Forks: 30

gaetangate/text-summarizer

Python Framework for Extractive Text Summarization

Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 111 - Forks: 32

DmitryRyumin/EMNLP-2023-Papers

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

Language: Python - Size: 6.43 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 107 - Forks: 7

loristns/Kadot 📦

Natural language processing using unsupervised vectors representation.

Language: Jupyter Notebook - Size: 942 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 106 - Forks: 9

xiamx/fastText Fork of facebookresearch/fastText

Windows Build of fastText, library for text representation and classification.

Language: HTML - Size: 4.2 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 106 - Forks: 25

pommedeterresautee/fastrtext

R wrapper for fastText

Language: C++ - Size: 5.89 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 101 - Forks: 15

TharinduDR/Simple-Sentence-Similarity

Exploring the simple sentence similarity measurements using word embeddings

Language: Python - Size: 60.4 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 100 - Forks: 37

BobXWu/FASTopic

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

Language: Python - Size: 1.68 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 97 - Forks: 6

Hellisotherpeople/Language-games

Dead simple games made with word vectors.

Language: Python - Size: 1.88 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 97 - Forks: 6

joisino/wordtour

Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)

Language: Python - Size: 629 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 94 - Forks: 4

gaohuang/S-WMD

Code for Supervised Word Mover's Distance (SWMD)

Language: Matlab - Size: 92.8 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 93 - Forks: 21

guillaume-chevalier/GloVe-as-a-TensorFlow-Embedding-Layer

Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.

Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 21 days ago - Pushed at: over 6 years ago - Stars: 90 - Forks: 19

adhaamehab/textblob-ar 📦

Arabic support for textblob

Language: Python - Size: 4.24 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 85 - Forks: 25

kaushalshetty/Positional-Encoding

Encoding position with the word embeddings.

Language: Jupyter Notebook - Size: 154 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 79 - Forks: 13

jonsafari/clustercat

Fast Word Clustering Software

Language: C - Size: 463 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 78 - Forks: 11

msahamed/yelp_comments_classification_nlp

Yelp round-10 review comments classification using deep learning (LSTM and CNN) and natural language processing.

Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 73 - Forks: 55

sismetanin/word2vec-tsne

Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.

Language: Jupyter Notebook - Size: 23.1 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 71 - Forks: 32

jind11/word2vec-on-wikipedia

A pipeline for training word embeddings using word2vec on wikipedia corpus.

Language: Python - Size: 170 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 68 - Forks: 23

tofunlp/sister

SImple SenTence EmbeddeR

Language: Python - Size: 1.72 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 67 - Forks: 18

olivettigroup/materials-synthesis-generative-models

Public release of data and code for materials synthesis generation

Language: HTML - Size: 1.17 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 65 - Forks: 20

yohasebe/ruby-spacy

A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

Language: Ruby - Size: 508 KB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 64 - Forks: 6

mandarjoshi90/pair2vec

pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference

Language: Python - Size: 3.13 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 61 - Forks: 8

chaitjo/lstm-context-embeddings

Augmenting word embeddings with their surrounding context using bidirectional RNN

Language: Python - Size: 23.2 MB - Last synced at: 14 days ago - Pushed at: over 5 years ago - Stars: 60 - Forks: 18

JulianGerhard21/bert_spacy_rasa

Tutorial for BERT (and other transformer) embeddings with spaCy and Rasa

Language: Python - Size: 10.5 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 59 - Forks: 15

yumeng5/JoSH

[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding

Language: C - Size: 187 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 58 - Forks: 7

itayzit/openai-async

A light-weight, asynchronous client for OpenAI API - text completion, image generation and embeddings.

Language: Python - Size: 15.6 KB - Last synced at: 17 days ago - Pushed at: about 2 years ago - Stars: 57 - Forks: 6

sismetanin/sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Language: Jupyter Notebook - Size: 439 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 55 - Forks: 33

sdimi/average-word2vec

🔤 Calculate average word embeddings (word2vec) from documents for transfer learning

Language: Jupyter Notebook - Size: 3.53 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 54 - Forks: 22

acbull/HiCE

Code for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"

Language: Python - Size: 520 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 54 - Forks: 9

pesoto/Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Language: Jupyter Notebook - Size: 461 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 54 - Forks: 29

rguthrie3/MorphologicalPriorsForWordEmbeddings

Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings

Language: Python - Size: 1.23 MB - Last synced at: 27 days ago - Pushed at: over 8 years ago - Stars: 52 - Forks: 12

bnosac/ETM

Topic Modelling in Semantic Embedding Spaces

Language: R - Size: 7.6 MB - Last synced at: 21 days ago - Pushed at: over 3 years ago - Stars: 51 - Forks: 3

HaniehP/PersianNER

Named-Entity Recognition in Persian Language

Size: 201 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 51 - Forks: 7

yumeng5/CatE

[WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding

Language: C - Size: 192 MB - Last synced at: 18 days ago - Pushed at: over 4 years ago - Stars: 50 - Forks: 15

Azure-Samples/MachineLearningSamples-BiomedicalEntityExtraction 📦

MachineLearningSamples-BiomedicalEntityExtraction

Language: Python - Size: 26.6 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 49 - Forks: 29

thomasahle/codenames

Codenames AI using Word Vectors

Language: JavaScript - Size: 13.4 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 48 - Forks: 7

harsh19/SPINE

Code for SPINE - Sparse Interpretable Neural Embeddings. Jhamtani H.*, Pruthi D.*, Subramanian A.*, Berg-Kirkpatrick T., Hovy E. AAAI 2018

Language: Python - Size: 62.5 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 48 - Forks: 11

Lambda-3/Indra

Indra is a Web Service which allows easy access to different distributional semantics models in several languages.

Language: Java - Size: 10.8 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 47 - Forks: 14

chao-ji/tf-word2vec

TensorFlow implementation of word2vec

Language: Python - Size: 576 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 47 - Forks: 19

zhangyafeikimi/word2vec-win32

A word2vec port for Windows.

Language: C - Size: 144 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 47 - Forks: 42

nlx-group/WordNetEmbeddings

Obtaining word embeddings from a WordNet ontology

Language: Python - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 46 - Forks: 5

vecto-ai/word-benchmarks

Benchmarks for intrinsic word embeddings evaluation.

Size: 2.89 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 46 - Forks: 25

tca19/near-lossless-binarization

This repository contains source code to binarize any real-value word embeddings into binary vectors.

Language: C - Size: 165 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 45 - Forks: 8

iamaziz/language-detection-fastText

Building a language detection classifier using fastText

Language: Jupyter Notebook - Size: 42.9 MB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 45 - Forks: 18

Related Topics
nlp 324 natural-language-processing 213 word2vec 202 machine-learning 194 deep-learning 149 python 113 sentiment-analysis 82 fasttext 66 lstm 66 tensorflow 64 pytorch 64 text-classification 62 glove 59 nlp-machine-learning 59 embeddings 58 keras 47 bert 42 neural-network 41 rnn 40 glove-embeddings 38 python3 34 gensim 33 word-vectors 33 cnn 29 topic-modeling 28 tf-idf 24 neural-networks 24 sentence-embeddings 24 word-embedding 23 classification 23 convolutional-neural-networks 23 recurrent-neural-networks 22 lstm-neural-networks 21 skipgram 21 text-mining 21 attention-mechanism 21 named-entity-recognition 21 spacy 21 artificial-intelligence 20 nltk 20 bag-of-words 20 clustering 19 transformer 19 data-science 19 language-model 19 computational-linguistics 17 fasttext-embeddings 17 deep-neural-networks 17 visualization 16 ai 16 transfer-learning 16 semantic-similarity 15 machine-translation 15 unsupervised-learning 14 transformers 14 cosine-similarity 14 keras-tensorflow 14 word2vec-model 14 text-analysis 14 deeplearning 14 bert-model 13 scikit-learn 13 cbow 13 data-visualization 13 natural-language 13 natural-language-understanding 13 sentiment-classification 13 text-generation 13 information-retrieval 13 pca 12 logistic-regression 12 gensim-word2vec 12 doc2vec 12 llm 12 gru 11 naive-bayes-classifier 11 preprocessing 11 search-engine 11 numpy 11 attention 10 language-modeling 10 evaluation 10 word-similarity 10 kaggle 9 tf-idf-vectorizer 9 tokenization 9 ml 9 lda 9 document-embeddings 9 flask 9 text-processing 9 lemmatization 9 character-embeddings 9 coursera 9 distributional-semantics 9 semantic-search 8 wordembeddings 8 latent-dirichlet-allocation 8 chatbot 8 tsne 8