An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: document-embeddings

mantzaris/TextSpace.jl

A Julia package for text embeddings and related NLP transformations

Language: Julia - Size: 8.61 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 1

sebischair/Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

Language: Python - Size: 13.7 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 186 - Forks: 28

dborrelli/chat-intents

Clustering sentence embeddings to extract message intent

Language: Jupyter Notebook - Size: 6.38 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 173 - Forks: 24

zbmed-semtec/doc2vec-doc-relevance

An approach exploring and assessing literature-based doc-2-doc recommendations using a doc2vec and applying to the RELISH dataset.

Language: Python - Size: 9.55 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

monish-prabhu/Intra-Search

A tool for performing semantic search within pdf documents leveraging sentence transformers.

Language: Python - Size: 747 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

JoramMillenaar/relate-text

A REST API and CLI tool for managing text embeddings and querying similarities, ideal for NLP and search applications.

Language: TypeScript - Size: 112 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

zbmed-semtec/word2doc2vec-doc-relevance

An approach exploring and assessing literature-based doc-2-doc recommendations using word2vec combined with doc2vec, and applying it to TREC and RELISH datasets

Language: Python - Size: 13.2 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

zbmed-semtec/hybrid-pre-doc2vec-doc-relevance

Hybrid approach combining dictionary-based NER and doc2vec

Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ArikReuter/TNTM

This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Modelling with Transformer Representations" by Arik Reuter, Anton Thielmann, Christoph Weisser, Benjamin Säfken and Thomas Kneib

Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 7 - Forks: 1

ToxyBorg/llama_langchain_documents_embeddings

just testing langchain with llama cpp documents embeddings

Language: Python - Size: 60.5 KB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 14 - Forks: 1

ToxyBorg/Hugging-Face-Hub-Langchain-Document-Embeddings

Using Hugging Face Hub Embeddings with Langchain document loaders to do some query answering

Language: Python - Size: 16.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 26 - Forks: 6

Tixierae/deep_learning_NLP

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

Language: Jupyter Notebook - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 435 - Forks: 106

vgupta123/P-SIF

Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging

Language: Python - Size: 73.8 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 34 - Forks: 10

tssujt/document-embedchain

A streamlit app for chat based on embedding documents using langchain

Language: Python - Size: 75.2 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zbmed-semtec/protein-function-embeddings-thesis

Language: Python - Size: 64.5 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yumeng5/Spherical-Text-Embedding

[NeurIPS 2019] Spherical Text Embedding

Language: C - Size: 10.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 176 - Forks: 29

skesiraju/BaySMM

Model for learning document embeddings along with their uncertainties

Language: Python - Size: 40 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 4

mathiasbruun/GeneralisedPoliticalScaling

A Generalised Approach to Scaling Political Actors with Embedding Representations

Language: Jupyter Notebook - Size: 53.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mathiasbruun/DCPA

Data Collection, Processing, and Analysis

Language: Jupyter Notebook - Size: 165 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

fangrouli/Document-embedding-generation-models

Development and Application of Document Embedding for Semantic Text Retrieval

Language: Python - Size: 85.9 KB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

waingram/code-embeddings

A Comparative Study of Various Code Embeddings in Software Semantic Matching

Language: Jupyter Notebook - Size: 3.55 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 2