An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: near-duplicate-detection

Luis-Varona/shadowseek

A CLI tool for near-duplicate detection in text files, written in Rust with no dependencies on runtime environments.

Language: Rust - Size: 30.3 KB - Last synced at: 23 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

Logan-Fouts/Thesis

Bachelor's Thesis on Near-Duplicate Image Detection. This repo contains all resources, code, and documentation developed during the process.

Language: Python - Size: 1.15 GB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

justinbt1/Akin

Python library for detecting near duplicate texts in a corpus at scale.

Language: Python - Size: 2.77 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

iscc/iscc-specs

ISCC: International Standard Content Code

Language: Python - Size: 6.74 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 48 - Forks: 9

vitali-fedulov/images4

Image similarity in Golang. Version 4 (LATEST)

Language: Go - Size: 890 KB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 89 - Forks: 10

s-emanuilov/LangVec

Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).

Language: Python - Size: 1 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

SasheVuchkov/near-duplicate-docs

Simple library for finding duplicate and near-duplicate text documents in massive sets/libraries/databases

Language: TypeScript - Size: 2 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 0

vitali-fedulov/imagehash2

Fast image similarity search with hash tables (Golang). Version 2 (LATEST)

Language: Go - Size: 33.2 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

vitali-fedulov/imagehash

Fast image similarity search with hash tables (Golang). Version 1

Language: Go - Size: 43 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

kamil-sita/image-copy-finder

Multi module project focused on near-duplicate search for images.

Language: Java - Size: 176 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

MaviVestini/ADM-LT_HW1

First homework for the Advance Data Mining course

Language: HTML - Size: 5.91 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

LexCybermac/smlr

A Simple Image Clustering Script using CLIP and Hierarchial Clustering

Language: Python - Size: 21.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

santurini/Search-Engine-Evaluation-and-Near-Duplicate-Detection

Exploiting the PyTerrier library to perform Search Engine Evaluation and Near Duplicate Detection on different datasets.

Language: Jupyter Notebook - Size: 267 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

giulio-derasmo/Search-Engine-Evaluation-and-Near-Duplicate-Detection

Exploiting the PyTerrier library to build a Search Engine and resolve the Near Duplicate Detection tasks.

Language: Jupyter Notebook - Size: 547 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

sayakpaul/near-dup-parser

Holds code for near-duplicate image parser using optimized image classifiers.

Language: Jupyter Notebook - Size: 6.32 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1