Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-similarity

shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language: Python - Size: 15.4 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 4,034 - Forks: 368

nlpodyssey/cybertron

Cybertron: the home planet of the Transformers in Go

Language: Go - Size: 1.17 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 265 - Forks: 25

srbhr/Resume-Matcher

Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.

Language: Python - Size: 96.4 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 4,618 - Forks: 1,734

SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language: Python - Size: 744 KB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 380 - Forks: 29

sidphbot/Auto-Research

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Language: Python - Size: 429 KB - Last synced: 8 days ago - Pushed: 6 months ago - Stars: 50 - Forks: 6

raviagheda/ml-movie-recommendation

Learn ML

Language: Jupyter Notebook - Size: 8.67 MB - Last synced: 15 days ago - Pushed: 16 days ago - Stars: 0 - Forks: 0

Lipairui/textgo

Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!

Language: Python - Size: 532 KB - Last synced: 7 days ago - Pushed: about 2 years ago - Stars: 43 - Forks: 2

aldebran97/AC

AC自动机 文本相似检索 词库匹配 分词器

Language: Java - Size: 1.05 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 17 - Forks: 0

IDEA-CCNL/GTS-Engine

GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。

Language: Python - Size: 3.81 MB - Last synced: 14 days ago - Pushed: over 1 year ago - Stars: 89 - Forks: 9

VanekPetr/text-similarity-ranking

Algorithm to rank text similarity between set of strings and given inputs

Language: Python - Size: 128 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

lonePatient/TorchBlocks

A PyTorch-based toolkit for natural language processing

Language: Python - Size: 481 KB - Last synced: 28 days ago - Pushed: over 1 year ago - Stars: 149 - Forks: 26

brianrisk/simphile-text-similarity-nlp

Python Text Similarity NLP Libray

Language: Python - Size: 199 KB - Last synced: 24 days ago - Pushed: over 1 year ago - Stars: 29 - Forks: 2

NTMC-Community/awesome-neural-models-for-semantic-match

A curated list of papers dedicated to neural text (semantic) matching.

Language: HTML - Size: 158 KB - Last synced: 22 days ago - Pushed: 6 months ago - Stars: 771 - Forks: 125

awslabs/aws-ai-solution-kit

Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.

Language: Python - Size: 80.6 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 137 - Forks: 24

giacbrd/python-dandelion-eu

A python client for connecting to all the services provided by https://dandelion.eu

Language: Python - Size: 71.3 KB - Last synced: about 1 month ago - Pushed: 10 months ago - Stars: 36 - Forks: 15

dodona-edu/dolos

:detective: Source code plagiarism detection

Language: TypeScript - Size: 39.5 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 206 - Forks: 25

themaximalist/vectordb.js

Simple in-memory vector database for text similarity in Node.js

Language: HTML - Size: 299 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 0

tlatkowski/multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.

Language: Jupyter Notebook - Size: 1.43 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 182 - Forks: 43

StephanGeorg/trigram-similarity

Determining the similarity of alphanumeric text based on trigram matching.

Language: JavaScript - Size: 231 KB - Last synced: 12 days ago - Pushed: over 2 years ago - Stars: 13 - Forks: 2

hellonlp/sentence-similarity

文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT

Language: Python - Size: 221 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 37 - Forks: 11

yaoxiaoyuan/mimix

Mimix: A Text Generation Tool and Pretrained Chinese Models

Language: Python - Size: 6.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 144 - Forks: 16

Mohana-Murugan/NLP

NLP

Language: Jupyter Notebook - Size: 4.44 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

behitek/VietNamTextSimilarity

vietnam text similarity using tf-idf wite on java programing

Language: Java - Size: 31.2 MB - Last synced: 2 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

muhammedalikocabey/Text-Similarity-of-Products

Text similarity of products on e-commerce sites

Language: Python - Size: 5.11 MB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

emarkou/Text-Similarity

A text similarity computation using minhashing and Jaccard distance on reuters dataset

Language: R - Size: 69.3 KB - Last synced: about 2 months ago - Pushed: about 6 years ago - Stars: 16 - Forks: 5

rhnvrm/textsimilarity

go package that provides similarity between two string documents using cosine similarity and tf-idf along with various other useful things.

Language: Go - Size: 57.6 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 6 - Forks: 4

prdowluri/Semantic-Similarity-between-Paragraphs-or-Sentences

Build an algorithm/model that can quantify the degree of similarity between the two text-based on Semantic similarity. Semantic Textual Similarity (STS) assesses the degree to which two sentences are semantically equivalent to each other.

Language: Jupyter Notebook - Size: 5.68 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

CLUEbenchmark/CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language: Python - Size: 8.87 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 3,772 - Forks: 581

zake7749/CIKM-AnalytiCup-2018

[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.

Language: Python - Size: 1.47 MB - Last synced: about 2 months ago - Pushed: about 5 years ago - Stars: 76 - Forks: 15

lprtk/text-summarize

Create your text summarizer with Python and Streamlit

Language: Python - Size: 1.17 MB - Last synced: 4 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

justinbt1/Akin

Python library for detecting near duplicate texts in a corpus at scale using Locality Sensitive Hashing, as described in chapter three of Mining Massive Datasets.

Language: Python - Size: 154 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 5 - Forks: 0

sljavi/text-sound-similarity

JavaScript library useful to find degrees of similarity between text's phonetics

Language: JavaScript - Size: 61.5 KB - Last synced: 27 days ago - Pushed: almost 7 years ago - Stars: 16 - Forks: 1

cr1m5onk1ng/text_similarity

A nlp library for text similarity based on Transformer models

Language: Python - Size: 12.3 MB - Last synced: 5 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 1

cjymz886/sentence-similarity

对四种句子/文本相似度计算方法进行实验与比较

Language: Python - Size: 50.7 MB - Last synced: 5 months ago - Pushed: almost 4 years ago - Stars: 277 - Forks: 60

anas-899/synthetic_text_images_generator

Language: Jupyter Notebook - Size: 1.48 MB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

BairagiSaurabh/Project-I-Recommendation-system-Amazon

Recommending items for women fashion wear using "Title similarity" and "Nearest Neighbours"

Language: Jupyter Notebook - Size: 6.84 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

MelaniaNitu/Detection-of-Romanian-Machine-Generated-Text

Language: Python - Size: 16.1 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

shubham16394/Text-Similarity

Text Similarity using BM25 algorithm and WordNet

Language: Python - Size: 74.2 KB - Last synced: 5 months ago - Pushed: over 6 years ago - Stars: 14 - Forks: 8

javadshoja/text-similarity-comparing

💡 A simple svelte app for text similarity comparing.

Language: Svelte - Size: 12.7 KB - Last synced: 28 days ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

jnferfer/text-magnet

Code for TextMagnet web app

Language: Python - Size: 55.8 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 2

VietHoang1512/KPA

Matching The Statements: A Simple and Accurate Model for Key Point Analysis (ArgMining | EMNLP 2021)

Language: Python - Size: 1.17 MB - Last synced: 26 days ago - Pushed: over 2 years ago - Stars: 12 - Forks: 0

akurizaldirv/essayscoring

Automatic Essay Scoring implements the Rabin-Karp Algorithm and Synonym Recognition

Language: Python - Size: 1.45 MB - Last synced: 5 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

bstoilov/digitalowl-pysemantics

Free Python client, that utilizes the digitalowl.org NLP API.

Language: Python - Size: 30.3 KB - Last synced: 4 months ago - Pushed: about 2 years ago - Stars: 7 - Forks: 2

ywu94/python-text-distance

A python implementation of a variety of text/string distance and similarity metrics. No GPL!

Language: Python - Size: 62.5 KB - Last synced: about 2 months ago - Pushed: about 4 years ago - Stars: 7 - Forks: 2

soumyagautam/text-similarity

Software based on flask, used to check the most similar texts out of a list.

Language: HTML - Size: 40 KB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

fanghon/antiplag

作业查重软件,它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.

Language: Java - Size: 52.6 MB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 322 - Forks: 61

murray-z/text_analysis_tools

中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)

Language: Python - Size: 9.98 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 533 - Forks: 114

vinmahajan/Plagiarism_Detection

Plagiarism Detection System, designed to identify similarities between a given text and existing online content.

Language: Python - Size: 20.5 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

padeoe/cail2019

法研杯2019相似案例匹配第二名解决方案(附数据集和文档),CAIL2020/2021司法考试赛道冠军团队

Language: Python - Size: 261 KB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 234 - Forks: 37

siddgood/podcast-recommendation-engine

:microphone: Building a content-based podcast recommender system using NLP

Language: Jupyter Notebook - Size: 143 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 27 - Forks: 4

izikeros/sentence-plagiarism

Compare sentences from input document with all sentences from reference documents - find very similar ones.

Language: Python - Size: 9.77 KB - Last synced: 21 days ago - Pushed: 7 months ago - Stars: 2 - Forks: 0

Mstfakts/Text-Similarity-Towhee-MilvusDB

A Milvus Database and NLP project where you can perform text-based similar searches on the dataset you will upload. Milvus Database is a vector Database and Towhee provides several advantages such as ready-made pipelines.

Language: Jupyter Notebook - Size: 20.2 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

amansrivastava17/lstm-siamese-text-similarity

⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity

Language: Python - Size: 69.3 KB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 278 - Forks: 91

zhongbin1/bert_for_text_matching

A simplified fine tune and deploy code based on bert for text matching.

Language: Python - Size: 6.23 MB - Last synced: 8 months ago - Pushed: almost 5 years ago - Stars: 14 - Forks: 4

IceFlameWorm/bert_fine_tune Fork of huggingface/transformers

Bert based siamese network for text similarity

Language: Jupyter Notebook - Size: 21.6 MB - Last synced: 8 months ago - Pushed: almost 5 years ago - Stars: 7 - Forks: 0

kvarun07/asag-gt

Multi-Relational Graph Transformer for Automatic Short Answer Grading (NAACL 2022)

Language: Jupyter Notebook - Size: 24.2 MB - Last synced: 8 months ago - Pushed: almost 2 years ago - Stars: 7 - Forks: 1

yanliang12/bert_text_embedding

Embedding a text to a vector by pre-trained BERT word embeddings and pooling layers, for the pur[ose of text similarity measuring

Language: Python - Size: 3.37 MB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 6 - Forks: 0

yashchitre03/Fetch-Rewards

Django web application for calculating text similarity using naive algorithms.

Language: Python - Size: 210 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

MagallanesFito/weheart

Meet people just like you

Language: Python - Size: 34.1 MB - Last synced: 9 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

CrownedHead06/AnimeOdyssey

Anime Odyssey: Embark on a Journey of Recommendations 🥷

Language: Jupyter Notebook - Size: 355 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 8 - Forks: 1

bysiber/text_similarity_tfidf

The project utilizes the TF-IDF (Term Frequency-Inverse Document Frequency) algorithm. The main objective of this project is to measure the similarity between text documents using the TF-IDF algorithm.

Language: Python - Size: 4.88 KB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

akulez/Text-Similarity_Phrase-Matching

Implementing Text Similarity for US Patents using modern day Word2Vec and USE(Universal Sentence Encoding) and some classical algos. like Jaro Winkler and Jaccard

Language: Jupyter Notebook - Size: 821 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

MaastrichtU-IDS/docona

DoConA (Document Content and Citation Analysis Pipeline) is an open source, configurable and extensible Python tool to analyse the level of agreement between the citation network of a set of textual documents and the textual similarity of these documents.

Language: Python - Size: 86.9 KB - Last synced: about 2 months ago - Pushed: about 3 years ago - Stars: 2 - Forks: 2

adhaamehab/textblob-ar 📦

Arabic support for textblob

Language: Python - Size: 4.24 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 82 - Forks: 25

YC-wind/bert_few_shot_learning

BERT for few_shot_learning, provided siamese net, match net, prototypical net

Language: Python - Size: 77.1 KB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 12 - Forks: 0

eren23/semantic-code-searcher

Basic example for searching code semantically in github profiles. In python

Language: Python - Size: 44 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

Ikram-Maulana/text-similarity-api

🚀 Text Similarity API is a free and open source API for comparing text similarity

Language: TypeScript - Size: 4.22 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

somjit101/NLP-Star-Trek-Scripts

Using digital form of the actual scripts of the 'Star Trek' science fiction series to perform interesting NLP tasks and answering some questions on Topic Modelling, Character properties and the plot as a whole.

Language: Jupyter Notebook - Size: 8.51 MB - Last synced: 11 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

nityansuman/marvin

Web app to automatically generate subjective or an objective test and evaluate user responses without any human intervention in an efficient and automatic manner using machine learning and natural language processing.

Language: CSS - Size: 18.7 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 83 - Forks: 29

ycatsh/past-pilot

Manage school resources and navigate past papers with ease.

Language: Python - Size: 408 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 1 - Forks: 0

jarvis0/image-search

🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.

Language: Jupyter Notebook - Size: 55.9 MB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 5 - Forks: 0

cja5553/attention-driven-imitation-in-consumer-reviews

Codes for manuscript titled "Attention-driven imitation in consumer reviews" by Charles Alba, Mikhail Spektor and Lukasz Walasek

Language: Jupyter Notebook - Size: 418 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

JaouadT/Sentence-similarity

Sentence similarity between sentences or paragraphs: flask API + streamlit app.

Language: Python - Size: 145 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

greg2451/aggregating-text-similarity-metrics

This repository consists of a benchmark of various text similarity measures for Natural Language Generation (NLG) evaluation, on multiple datasets, and provides a way to aggregate these measures.

Language: Jupyter Notebook - Size: 1.22 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 1

osainz59/AQGSAS

Automatic Question Generation and Short Answer Scoring system

Language: Python - Size: 298 KB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 6 - Forks: 1

ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

Language: Python - Size: 243 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 78 - Forks: 19

ZhengZixiang/chip2019_task2_question_pairs_matching

CHIP 2019平安医疗科技疾病问答迁移学习比赛baseline,rank7

Language: Python - Size: 27.3 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 25 - Forks: 6

SakigamiYang/text-similarity-model-evaluation

Evaluation of several non-personally developed pre-trained text semantic similarity models on SimCLUE.

Language: Python - Size: 23.4 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

EmreKumas/Text_Similarity 📦

This C project is built to find text similarities between two files. What it actually does is it scans two files you give as input and outputs the similar words.

Language: C - Size: 2.93 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

talaatmagdyx/NLP-Assignment 📦

NLP (Natural language Processing) Assignment and small Task

Language: Python - Size: 1.95 KB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

KeremZaman/semantic-sh

semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).

Language: Python - Size: 40 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 23 - Forks: 3

mmaguero/guarani-tweets

Download guarani-dominant tweets

Language: Python - Size: 797 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

FedericoCalonge/automatic_reading_of_CVs_using_text_similarity

Proyecto de Tesis basado en la lectura automática de Curriculum Vitae y su comparación con descripciones de puestos laborales utilizando técnicas de text similarity (Cosine Similarity y Word Mover's Distance -WMD-) y algoritmos de vectorización (TF-IDF y Word Embeddings).

Language: Jupyter Notebook - Size: 311 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

maemresen/similarity-detection

An example project to detect cheats in an exam with using similarity detection.

Language: Java - Size: 36.1 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

Abhishek-EE/Text-Similarity-Score-Generator

This is a Text Similarity Score Generator. It takes in two different texts and compares how similar they are. To calculate the similarity score I am using Vector Space Model. This model creates a vector Space where each dimension represents a single word. Words are taken from all the texts that are considered. One document is a single vector space. Each dimension of a single document vector represents how often this word appears in the text.To compare two documents a cosine similarity is used. This generates a value between 0 and 1, 0 meaning no similarity and 1 meaning perfect match.

Language: HTML - Size: 15.6 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 3 - Forks: 0

luminoso/news-keywords-searcher

Proof of concept project that implements a keyword search (text similarity) over a corpus

Language: Python - Size: 6.23 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 0

vikrantdeshpande09876/Masterize_Hospital_Entities

The goal was to maintain a ‘single version of truth’ for associated entities across the entire organization’s data sources. The RecordLinkage package is integrated with a wrapper recursive data-pipeline for de-duplicating of records and generating a master set. Similarity between two textual strings determines if they are a probabilistic match.

Language: Jupyter Notebook - Size: 38.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 2

ArkinDharawat/Web-of-Videos Fork of abagasra98/Web-of-Videos

Project to link scattered videos from various sources using concepts covered in video transcripts

Language: Python - Size: 9.78 MB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

howard-haowen/NLP-demos

NLP demos and talks made with Jupyter Notebook and reveal.js

Language: Jupyter Notebook - Size: 58.1 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 1

asgaardlab/21-markos-test_case_similarity_technique-code

Repository with the source code of our technique to analyze a test suite and find similar test cases written in natural language

Language: Jupyter Notebook - Size: 631 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0

kaledhoshme123/Semantic-Similarity-using-TimeDistributed-LSTM

The following notebook, reviews the methodology by which we can build a recurrent neural network that is able to analyze text sentences and determine whether they are congruent in meaning or contradictory in terms of meaning

Language: Jupyter Notebook - Size: 150 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

TanyaChutani/Siamese-Network-On-Text-Data

Siamese Network On Quora Question Pairs Similarity Data Keras

Language: Jupyter Notebook - Size: 11 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 1

Aditya1001001/similarity-and-embedding-app

Learn about text similarity measures & text embedding methods.

Language: Python - Size: 6.75 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1

pemagrg1/Magic-Of-TFIDF

TFIDF being the most basic and simple topic in NLP, there's alot that can be done using TFIDF only! So, in this repo, I'll be adding the blog, TFIDF basics, wonders done using tfidf etc.

Language: Jupyter Notebook - Size: 465 KB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 6 - Forks: 2

JoKerDii/nlp-projects

Built text classifiers by fine-tuning pre-trained BERT models

Language: Jupyter Notebook - Size: 2.15 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

kamilhan-karaismailoglu/Tversky-Text-Similarity-Calculation-Program

Program to Calculate Text Similarity ratio using Tversky Index, Sørensen–Dice coefficient and Jaccard Index. Made with C#. This program was written for the Algorithms and Programming lecture.

Language: C# - Size: 109 KB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0

the-black-knight-01/NLP-Models-Tensorflow Fork of huseinzol05/NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems

Language: Jupyter Notebook - Size: 44.7 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 3

chris-santiago/stringcluster

A Scikit-Learn style deduper.

Language: Jupyter Notebook - Size: 345 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

joaquimgomez/BachelorsThesis-TextSimilarityMeasures

Code and models used in my Bachelor’s Degree Thesis about large text similarity measures are here. The similarities have been combined with machine learning based embeddings. This repository also contains raw results obtained from tasks/experiments.

Language: Python - Size: 509 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 2 - Forks: 0

kenneth-lange/java-nlp-text-similarity

Measure the similarity between different text documents.

Language: Java - Size: 18.6 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 2 - Forks: 1

Related Keywords
text-similarity 114 nlp 39 python 22 machine-learning 19 text-classification 18 natural-language-processing 17 bert 17 deep-learning 12 flask 9 word2vec 9 text-mining 7 cosine-similarity 7 sentiment-analysis 7 sentence-similarity 7 semantic-similarity 7 similarity 7 embeddings 7 pytorch 6 text-clustering 6 tf-idf 6 sentence-embeddings 6 text-embedding 5 text-processing 5 word-embeddings 5 nltk 5 transformer 5 nlp-machine-learning 5 quora-question-pairs 4 text-analysis 4 transformers 4 bert-embeddings 4 jaccard-similarity 4 levenshtein-distance 4 tensorflow 4 keras 4 java 3 text-search 3 clustering 3 topic-modeling 3 ocr 3 ner 3 speech-to-text 3 text-summarization 3 fasttext 3 bag-of-words 3 language-detection 3 question-answering 3 summarization 3 word-embedding 3 code-similarity 3 lstm 3 plagiarism 3 rest-api 3 plagiarism-checker 3 plagiarism-detection 3 tfidf-vectorizer 3 siamese-network 3 semantic-search 3 paraphrase-identification 3 python3 3 spacy 2 gensim 2 openai 2 scraping 2 siamese-neural-network 2 attention 2 api-client 2 api 2 roberta 2 flask-application 2 string-similarity 2 similarity-analysis 2 semantic 2 glove 2 recommender-system 2 text-matching 2 django 2 tfidf 2 large-language-models 2 neural-network 2 bm25 2 word-similarity 2 locality-sensitive-hashing 2 deduplication 2 data-mining 2 gensim-doc2vec 2 streamlit 2 natural-language-understanding 2 fastapi 2 text-to-speech 2 spelling-correction 2 question-generation 2 restful-api 2 word-vectors 2 neural-machine-translation 2 semantic-matching 2 scikit-learn 2 text-preprocessing 2 named-entity-recognition 2 machine-translation 2