GitHub topics: document-embeddings
mantzaris/TextSpace.jl
A Julia package for text embeddings and related NLP transformations
Language: Julia - Size: 8.61 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 1

sebischair/Lbl2Vec
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.
Language: Python - Size: 13.7 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 186 - Forks: 28

dborrelli/chat-intents
Clustering sentence embeddings to extract message intent
Language: Jupyter Notebook - Size: 6.38 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 173 - Forks: 24

zbmed-semtec/doc2vec-doc-relevance
An approach exploring and assessing literature-based doc-2-doc recommendations using a doc2vec and applying to the RELISH dataset.
Language: Python - Size: 9.55 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

monish-prabhu/Intra-Search
A tool for performing semantic search within pdf documents leveraging sentence transformers.
Language: Python - Size: 747 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

JoramMillenaar/relate-text
A REST API and CLI tool for managing text embeddings and querying similarities, ideal for NLP and search applications.
Language: TypeScript - Size: 112 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

zbmed-semtec/word2doc2vec-doc-relevance
An approach exploring and assessing literature-based doc-2-doc recommendations using word2vec combined with doc2vec, and applying it to TREC and RELISH datasets
Language: Python - Size: 13.2 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

zbmed-semtec/hybrid-pre-doc2vec-doc-relevance
Hybrid approach combining dictionary-based NER and doc2vec
Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ArikReuter/TNTM
This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Modelling with Transformer Representations" by Arik Reuter, Anton Thielmann, Christoph Weisser, Benjamin Säfken and Thomas Kneib
Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 7 - Forks: 1

ToxyBorg/llama_langchain_documents_embeddings
just testing langchain with llama cpp documents embeddings
Language: Python - Size: 60.5 KB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 14 - Forks: 1

ToxyBorg/Hugging-Face-Hub-Langchain-Document-Embeddings
Using Hugging Face Hub Embeddings with Langchain document loaders to do some query answering
Language: Python - Size: 16.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 26 - Forks: 6

Tixierae/deep_learning_NLP
Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP
Language: Jupyter Notebook - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 435 - Forks: 106

vgupta123/P-SIF
Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging
Language: Python - Size: 73.8 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 34 - Forks: 10

tssujt/document-embedchain
A streamlit app for chat based on embedding documents using langchain
Language: Python - Size: 75.2 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zbmed-semtec/protein-function-embeddings-thesis
Language: Python - Size: 64.5 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yumeng5/Spherical-Text-Embedding
[NeurIPS 2019] Spherical Text Embedding
Language: C - Size: 10.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 176 - Forks: 29

skesiraju/BaySMM
Model for learning document embeddings along with their uncertainties
Language: Python - Size: 40 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 4

mathiasbruun/GeneralisedPoliticalScaling
A Generalised Approach to Scaling Political Actors with Embedding Representations
Language: Jupyter Notebook - Size: 53.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mathiasbruun/DCPA
Data Collection, Processing, and Analysis
Language: Jupyter Notebook - Size: 165 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

fangrouli/Document-embedding-generation-models
Development and Application of Document Embedding for Semantic Text Retrieval
Language: Python - Size: 85.9 KB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

waingram/code-embeddings
A Comparative Study of Various Code Embeddings in Software Semantic Matching
Language: Jupyter Notebook - Size: 3.55 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 2
