An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-similarity

SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language: Python - Size: 889 KB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 547 - Forks: 38

AbdoBakr20/resume

This repository serves as both a personal resume for Richard Adleta and a customizable resume template. It dynamically loads content from a JSON file, supports easy styling with CSS, and can be hosted on GitHub Pages for sharing or job applications.

Language: JavaScript - Size: 25.4 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

srbhr/Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.

Language: TypeScript - Size: 108 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 8,981 - Forks: 3,388

shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language: Python - Size: 15.4 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4,754 - Forks: 413

dodona-edu/dolos

:detective: Source code plagiarism detection

Language: TypeScript - Size: 43.4 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 296 - Forks: 42

yongzhuo/near-synonym

near-synonym, 基于大模型LLM的中文反义词/近义词(antonyms/synonyms)工具包. 也可计算词语相似度/句子相似度/文本相似度等。

Language: Python - Size: 79 MB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 25 - Forks: 1

justinbt1/Akin

Python library for detecting near duplicate texts in a corpus at scale.

Language: Python - Size: 2.77 MB - Last synced at: about 2 hours ago - Pushed at: 19 days ago - Stars: 8 - Forks: 0

Mintone-creators/string-proximity

String-proximity is a high-performance string comparison library built in Rust, offering efficient similarity and proximity functions.

Language: Rust - Size: 116 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

varunkhurana07/asag-gt

Multi-Relational Graph Transformer for Automatic Short Answer Grading (NAACL 2022)

Language: Jupyter Notebook - Size: 24.2 MB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 1

CLUEbenchmark/CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language: Python - Size: 8.87 MB - Last synced at: 30 days ago - Pushed at: over 2 years ago - Stars: 4,320 - Forks: 624

StephanGeorg/trigram-similarity

Determining the similarity of alphanumeric text based on trigram matching.

Language: JavaScript - Size: 231 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 3

yongzhuo/char-similar

汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used for data augmentation, CSC misclassified character detection and recognition tasks (building confusion sets))

Language: Python - Size: 4.66 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 3

izikeros/sentence-plagiarism

Compare sentences from input document with all sentences from reference documents - find very similar ones.

Language: Python - Size: 263 KB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

lonePatient/TorchBlocks

A PyTorch-based toolkit for natural language processing

Language: Python - Size: 481 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 26

nlpodyssey/cybertron

Cybertron: the home planet of the Transformers in Go

Language: Go - Size: 1.17 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 308 - Forks: 27

sidphbot/Auto-Research

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Language: Python - Size: 429 KB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 7

themaximalist/vectordb.js

Simple in-memory vector database for text similarity in Node.js

Language: HTML - Size: 299 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 26 - Forks: 3

awslabs/aws-ai-solution-kit

Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.

Language: Python - Size: 80.7 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 175 - Forks: 25

ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

Language: Python - Size: 243 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 90 - Forks: 20

NTMC-Community/awesome-neural-models-for-semantic-match

A curated list of papers dedicated to neural text (semantic) matching.

Language: HTML - Size: 158 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 779 - Forks: 121

Lipairui/textgo

Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!

Language: Python - Size: 532 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 45 - Forks: 3

l1ght14/resume-screener-nlp

An AI-powered Resume Screener built with NLP and Streamlit. Upload multiple resumes and get similarity scores against a job description using TF-IDF and cosine similarity. Highlights missing keywords and suggests improvements.

Language: Python - Size: 266 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

howard-haowen/NLP-demos

NLP demos and talks made with Jupyter Notebook and reveal.js

Language: Jupyter Notebook - Size: 58.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

sljavi/text-sound-similarity

JavaScript library useful to find degrees of similarity between text's phonetics

Language: JavaScript - Size: 61.5 KB - Last synced at: 12 days ago - Pushed at: almost 8 years ago - Stars: 19 - Forks: 1

taimoorkhan-nlp/text_edit_distance_similarity

The method compares two text samples for their similarity/dissimilarity as edits needed to convert source string to target string.

Language: Jupyter Notebook - Size: 295 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 2

tomlin7/AI-research-assistant

Semantic document search system with pgvector and PGAI

Language: Python - Size: 50.8 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 2

kaoutarmi/Contextual-Text-Similarity

Contextual Text Similarity with Sentence-BERT est un projet permettant de mesurer la similarité entre des phrases en utilisant Sentence-BERT et la similarité cosinus. Il permet de retrouver les phrases les plus proches contextuellement d'une phrase donnée à partir d'un dataset.

Language: Jupyter Notebook - Size: 294 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

khalidbelk/jaccard

🧬 Calculate the similarity index between two texts

Language: OCaml - Size: 11.7 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

nityansuman/marvin

Web app to automatically generate subjective or an objective test and evaluate user responses without any human intervention in an efficient and automatic manner using machine learning and natural language processing.

Language: CSS - Size: 18.7 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 111 - Forks: 33

sheriff1max/recs-searcher

Python library for correcting registry and spelling errors in user input when comparing with a database of texts.

Language: Python - Size: 1.97 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

tlatkowski/multihead-siamese-nets

Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.

Language: Jupyter Notebook - Size: 1.43 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 182 - Forks: 43

Md-Emon-Hasan/DistilBERT-model-with-HF-Transformer

📝 DistilBERT, a lightweight Transformer model from Hugging Face, for various NLP tasks without requiring custom fine-tuning or datasets.

Language: Python - Size: 71.3 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

brianrisk/simphile-text-similarity-nlp

Python Text Similarity NLP Libray

Language: Python - Size: 199 KB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 34 - Forks: 5

bstoilov/digitalowl-pysemantics

Free Python client, that utilizes the digitalowl.org NLP API.

Language: Python - Size: 30.3 KB - Last synced at: 4 days ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 2

vinmahajan/Plagiarism_Detection

Plagiarism Detection System, designed to identify similarities between a given text and existing online content.

Language: Python - Size: 32.2 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

IDEA-CCNL/GTS-Engine

GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。

Language: Python - Size: 3.81 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 91 - Forks: 10

fanghon/antiplag

作业查重软件,它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.

Language: Java - Size: 52.6 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 379 - Forks: 61

adhaamehab/textblob-ar 📦

Arabic support for textblob

Language: Python - Size: 4.24 MB - Last synced at: about 18 hours ago - Pushed at: over 3 years ago - Stars: 85 - Forks: 24

xiaorancs/text-similarity

使用不同的方法计算相似度

Language: Python - Size: 8.79 KB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 42 - Forks: 9

atinyshrimp/TripAdvisor-Recommendation-ML-NLP

Machine Learning and NLP models for improving text-based recommendations on TripAdvisor, using BM25, TF-IDF, embeddings, and a Hybrid approach.

Language: Jupyter Notebook - Size: 489 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ycatsh/past-pilot

Manage school resources and navigate past papers with ease.

Language: HTML - Size: 416 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

a-poor/jarowinkler

An implementation of the Jaro-Winkler string similarity algorithm in Go.

Language: Go - Size: 10.7 KB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

jnferfer/text-magnet

Code for TextMagnet web app

Language: Python - Size: 55.9 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 2

hyper35/Text-Similarity-Using-SBERT

A GUI-based tool to calculate cosine similarity between two texts. It uses SBERT models from the sentence-transformers library for text encoding and tkinter for the interface.

Language: Python - Size: 3.91 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

michellemashutian/NJUST-at-SMP

This repository contains some basic text similarity algorithms.

Language: Python - Size: 6.84 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

radismili/Mobile-games-recommender-system

Content-based mobile game recommender system using game similarities.

Language: Jupyter Notebook - Size: 527 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

eren23/semantic-code-searcher

Basic example for searching code semantically in github profiles. In python

Language: Python - Size: 44 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

rhnvrm/textsimilarity

go package that provides similarity between two string documents using cosine similarity and tf-idf along with various other useful things.

Language: Go - Size: 57.6 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 4

luminoso/news-keywords-searcher

Proof of concept project that implements a keyword search (text similarity) over a corpus

Language: Python - Size: 6.23 MB - Last synced at: 8 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

Mohana-Murugan/NLP

NLP

Language: Jupyter Notebook - Size: 4.44 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

abhishek21441/NLP-Assignments

Assignments of the course CSE 556 - Natural Language Processing

Language: Jupyter Notebook - Size: 22.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

siddgood/podcast-recommendation-engine

:microphone: Building a content-based podcast recommender system using NLP

Language: Jupyter Notebook - Size: 143 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 30 - Forks: 4

ywu94/python-text-distance

A python implementation of a variety of text/string distance and similarity metrics. No GPL!

Language: Python - Size: 62.5 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 2

aldebran97/AC

AC自动机 文本相似检索 词库匹配 分词器

Language: Java - Size: 923 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 17 - Forks: 0

UAlberta-NLP/SemEval2024-STR

This repository is for the paper UAlberta at SemEval-2024 Task 1: A Potpourri of Methods for Quantifying Multilingual Semantic Textual Relatedness and Similarity. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024). Association for Computational Linguistics.

Language: Jupyter Notebook - Size: 4.47 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

anikoloff/nlp

Natural Language Processing (NLP) exercises and projects.

Language: Jupyter Notebook - Size: 107 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

prdowluri/Text-Similarity-Analyzer

Build an algorithm/model that can quantify the degree of similarity between the two text-based on Semantic similarity. Semantic Textual Similarity (STS) assesses the degree to which two sentences are semantically equivalent to each other.

Language: Jupyter Notebook - Size: 5.68 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

raviagheda/ml-movie-recommendation

Learn ML

Language: Jupyter Notebook - Size: 8.67 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

chiragjn/short-text-similarity

Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475

Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 1

VanekPetr/text-similarity-ranking

Algorithm to rank text similarity between set of strings and given inputs

Language: Python - Size: 128 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

giacbrd/python-dandelion-eu

A python client for connecting to all the services provided by https://dandelion.eu

Language: Python - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 36 - Forks: 15

hellonlp/sentence-similarity

文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT

Language: Python - Size: 221 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 37 - Forks: 11

yaoxiaoyuan/mimix

Mimix: A Text Generation Tool and Pretrained Chinese Models

Language: Python - Size: 6.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 144 - Forks: 16

behitek/VietNamTextSimilarity

vietnam text similarity using tf-idf wite on java programing

Language: Java - Size: 31.2 MB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

muhammedalikocabey/Text-Similarity-of-Products

Text similarity of products on e-commerce sites

Language: Python - Size: 5.11 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

emarkou/Text-Similarity

A text similarity computation using minhashing and Jaccard distance on reuters dataset

Language: R - Size: 69.3 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 16 - Forks: 5

zake7749/CIKM-AnalytiCup-2018

[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.

Language: Python - Size: 1.47 MB - Last synced at: 7 months ago - Pushed at: about 6 years ago - Stars: 76 - Forks: 15

lprtk/text-summarize

Create your text summarizer with Python and Streamlit

Language: Python - Size: 1.17 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

cr1m5onk1ng/text_similarity

A nlp library for text similarity based on Transformer models

Language: Python - Size: 12.3 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

cjymz886/sentence-similarity

对四种句子/文本相似度计算方法进行实验与比较

Language: Python - Size: 50.7 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 277 - Forks: 60

anas-899/synthetic_text_images_generator

Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

BairagiSaurabh/Project-I-Recommendation-system-Amazon

Recommending items for women fashion wear using "Title similarity" and "Nearest Neighbours"

Language: Jupyter Notebook - Size: 6.84 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

MelaniaNitu/Detection-of-Romanian-Machine-Generated-Text

Language: Python - Size: 16.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shubham16394/Text-Similarity

Text Similarity using BM25 algorithm and WordNet

Language: Python - Size: 74.2 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 14 - Forks: 8

javadshoja/text-similarity-comparing

💡 A simple svelte app for text similarity comparing.

Language: Svelte - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

VietHoang1512/KPA

Matching The Statements: A Simple and Accurate Model for Key Point Analysis (ArgMining | EMNLP 2021)

Language: Python - Size: 1.17 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 1

akurizaldirv/essayscoring

Automatic Essay Scoring implements the Rabin-Karp Algorithm and Synonym Recognition

Language: Python - Size: 1.45 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

murray-z/text_analysis_tools

中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)

Language: Python - Size: 9.98 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 533 - Forks: 114

padeoe/cail2019

法研杯2019相似案例匹配第二名解决方案(附数据集和文档),CAIL2020/2021司法考试赛道冠军团队

Language: Python - Size: 261 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 234 - Forks: 37

Mstfakts/Text-Similarity-Towhee-MilvusDB

A Milvus Database and NLP project where you can perform text-based similar searches on the dataset you will upload. Milvus Database is a vector Database and Towhee provides several advantages such as ready-made pipelines.

Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

amansrivastava17/lstm-siamese-text-similarity

⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity

Language: Python - Size: 69.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 278 - Forks: 91

zhongbin1/bert_for_text_matching

A simplified fine tune and deploy code based on bert for text matching.

Language: Python - Size: 6.23 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 4

IceFlameWorm/bert_fine_tune Fork of huggingface/transformers

Bert based siamese network for text similarity

Language: Jupyter Notebook - Size: 21.6 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 0

yanliang12/bert_text_embedding

Embedding a text to a vector by pre-trained BERT word embeddings and pooling layers, for the pur[ose of text similarity measuring

Language: Python - Size: 3.37 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 0

yashchitre03/Fetch-Rewards

Django web application for calculating text similarity using naive algorithms.

Language: Python - Size: 210 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

osainz59/AQGSAS

Automatic Question Generation and Short Answer Scoring system

Language: Python - Size: 298 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 1

MagallanesFito/weheart

Meet people just like you

Language: Python - Size: 34.1 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

CrownedHead06/AnimeOdyssey

Anime Odyssey: Embark on a Journey of Recommendations 🥷

Language: Jupyter Notebook - Size: 355 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 1

bysiber/text_similarity_tfidf

The project utilizes the TF-IDF (Term Frequency-Inverse Document Frequency) algorithm. The main objective of this project is to measure the similarity between text documents using the TF-IDF algorithm.

Language: Python - Size: 4.88 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

akulez/Text-Similarity_Phrase-Matching

Implementing Text Similarity for US Patents using modern day Word2Vec and USE(Universal Sentence Encoding) and some classical algos. like Jaro Winkler and Jaccard

Language: Jupyter Notebook - Size: 821 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

MaastrichtU-IDS/docona

DoConA (Document Content and Citation Analysis Pipeline) is an open source, configurable and extensible Python tool to analyse the level of agreement between the citation network of a set of textual documents and the textual similarity of these documents.

Language: Python - Size: 87.9 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 2

YC-wind/bert_few_shot_learning

BERT for few_shot_learning, provided siamese net, match net, prototypical net

Language: Python - Size: 77.1 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 0

Ikram-Maulana/text-similarity-api

🚀 Text Similarity API is a free and open source API for comparing text similarity

Language: TypeScript - Size: 4.22 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

somjit101/NLP-Star-Trek-Scripts

Using digital form of the actual scripts of the 'Star Trek' science fiction series to perform interesting NLP tasks and answering some questions on Topic Modelling, Character properties and the plot as a whole.

Language: Jupyter Notebook - Size: 8.51 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

jarvis0/image-search

🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.

Language: Jupyter Notebook - Size: 55.9 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

cja5553/attention-driven-imitation-in-consumer-reviews

Codes for manuscript titled "Attention-driven imitation in consumer reviews" by Charles Alba, Mikhail Spektor and Lukasz Walasek

Language: Jupyter Notebook - Size: 418 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

soumyagautam/text-similarity

Software based on flask, used to check the most similar texts out of a list.

Language: HTML - Size: 40 KB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

JaouadT/Sentence-similarity

Sentence similarity between sentences or paragraphs: flask API + streamlit app.

Language: Python - Size: 145 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

greg2451/aggregating-text-similarity-metrics

This repository consists of a benchmark of various text similarity measures for Natural Language Generation (NLG) evaluation, on multiple datasets, and provides a way to aggregate these measures.

Language: Jupyter Notebook - Size: 1.22 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

ZhengZixiang/chip2019_task2_question_pairs_matching

CHIP 2019平安医疗科技疾病问答迁移学习比赛baseline,rank7

Language: Python - Size: 27.3 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 25 - Forks: 6

Related Keywords
text-similarity 136 nlp 45 python 28 machine-learning 27 natural-language-processing 21 text-classification 19 bert 18 deep-learning 13 cosine-similarity 10 sentiment-analysis 9 similarity 9 word2vec 9 flask 9 embeddings 8 sentence-similarity 8 sentence-embeddings 8 text-mining 7 semantic-similarity 7 pytorch 6 text-embedding 6 transformer 6 tf-idf 6 text-clustering 6 nlp-machine-learning 6 levenshtein-distance 5 summarization 5 text-processing 5 text-summarization 5 word-embeddings 5 nltk 5 semantic-search 5 bert-embeddings 4 tensorflow 4 quora-question-pairs 4 jaccard-similarity 4 text-analysis 4 named-entity-recognition 4 keras 4 plagiarism 4 question-answering 4 transformers 4 topic-modeling 4 java 3 tfidf-vectorizer 3 lstm 3 paraphrase-identification 3 siamese-network 3 pretrained-models 3 python3 3 rest-api 3 language-detection 3 word-similarity 3 speech-to-text 3 sentence-transformers 3 ner 3 data-science 3 clustering 3 word-embedding 3 text-search 3 ocr 3 bm25 3 hacktoberfest 3 fasttext 3 machine-translation 3 spelling-correction 3 code-similarity 3 similarity-analysis 3 tfidf 3 bag-of-words 3 huggingface 3 streamlit 3 plagiarism-detection 3 api-client 2 roberta 2 search 2 semantic 2 postgres 2 huggingface-transformers 2 optical-character-recognition 2 api 2 text-generation 2 restful-api 2 kaggle-dataset 2 scraping 2 semantic-matching 2 jaro-winkler 2 text-preprocessing 2 triplet-loss 2 text 2 sbert 2 openai 2 content-based-recommendation 2 large-language-models 2 neural-network 2 fastapi 2 gensim 2 spacy 2 scikit-learn 2 sts 2 string-similarity 2