Topic: "duplicates"
qarmin/czkawka
Multi functional app to find duplicates, empty folders, similar images etc.
Language: Rust - Size: 4.61 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 23,624 - Forks: 741

kucherenko/jscpd
Copy/paste detector for programming source code.
Language: TypeScript - Size: 9.13 MB - Last synced at: 3 days ago - Pushed at: 10 days ago - Stars: 4,870 - Forks: 211

sahib/rmlint
Extremely fast tool to remove duplicates and other lint from your filesystem
Language: C - Size: 12.4 MB - Last synced at: 1 day ago - Pushed at: 18 days ago - Stars: 2,076 - Forks: 137

scinos/yarn-deduplicate
Deduplication tool for yarn.lock files
Language: TypeScript - Size: 7.27 MB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 1,389 - Forks: 57

sean-public/python-hashes
Interesting (non-cryptographic) hashes implemented in pure Python.
Language: Python - Size: 29.3 KB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 240 - Forks: 43

kristiankoskimaki/vidupe
Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:
Language: C++ - Size: 266 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 176 - Forks: 18

F483/dejavu
Quickly detect already witnessed data.
Language: Go - Size: 305 KB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 157 - Forks: 5

PJDude/dude
Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.
Language: Python - Size: 4.48 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 140 - Forks: 11

kouhin/redux-dataloader
Loads async data for Redux apps focusing on preventing duplicated requests and dealing with async dependencies.
Language: JavaScript - Size: 108 KB - Last synced at: 9 days ago - Pushed at: over 7 years ago - Stars: 139 - Forks: 3

Canop/backdown
A deduplicator
Language: Rust - Size: 541 KB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 132 - Forks: 7

scrubbbbs/cbird
Command-line program for Content-Based Image Retrieval of images and videos. Includes tools for general search and de-duplication.
Language: C++ - Size: 12.7 MB - Last synced at: 4 days ago - Pushed at: 22 days ago - Stars: 122 - Forks: 5

matteodelabre/mongoose-beautiful-unique-validation
Plugin for Mongoose that turns duplicate errors into regular Mongoose validation errors
Language: JavaScript - Size: 194 KB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 117 - Forks: 38

jvirkki/dupd
CLI utility to find duplicate files
Language: C - Size: 1.9 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 109 - Forks: 16

microsoft/near-duplicate-code-detector
A simple tool for detecting near-duplicate source code
Language: C# - Size: 38.1 KB - Last synced at: about 11 hours ago - Pushed at: 8 months ago - Stars: 100 - Forks: 31

cloud-py-api/mediadc
Nextcloud Media Duplicate Collector application
Language: PHP - Size: 91.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 99 - Forks: 9

eyalroz/removedupes
Remove Duplicate Messages
Language: JavaScript - Size: 8.56 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 93 - Forks: 7

StephaneCouturier/Katalog
Katalog is an application to manage catalogs of disks and files to search and get statistics.
Language: C++ - Size: 12.2 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 81 - Forks: 7

vuolter/deplicate 📦
Advanced Duplicate File Finder for Python
Language: Python - Size: 137 KB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 77 - Forks: 17

twpayne/find-duplicates
Find duplicate files quickly.
Language: Go - Size: 82 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 57 - Forks: 1

src-d/gemini
Advanced similarity and duplicate source code at scale.
Language: Scala - Size: 7 MB - Last synced at: 17 days ago - Pushed at: almost 6 years ago - Stars: 55 - Forks: 16

deric/es-dedupe
Tool for removing duplicate documents from Elasticsearch
Language: Python - Size: 130 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 22

src-d/apollo
Advanced similarity and duplicate source code proof of concept for our research efforts.
Language: Python - Size: 197 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 17

shevchenkoartem/lastfm-smart-deduper
JS script that allows you to remove duplicates from your Last.fm scrobbles library.
Language: JavaScript - Size: 1.74 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 48 - Forks: 2

LyonSyonII/akin
Rust crate for writing repetitive code easier and faster.
Language: Rust - Size: 51.8 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 2

eliasfloreteng/bitwarden_find_duplicates
Find duplicate logins based on domain, from Bitwarden export. Open source for your safety.
Language: HTML - Size: 34.2 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 34 - Forks: 8

Bartozzz/potential-duplicates-bot
A configurable GitHub App which checks for potential issue duplicates using Damerau–Levenshtein distance algorithm.
Language: JavaScript - Size: 28.3 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 31 - Forks: 6

guicamest/GDuplicate-Finder
GDuplicate Finder - A Groovy way to find duplicates among your computer and network shares!
Language: Groovy - Size: 49.7 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 30 - Forks: 7

Navid2zp/dups
A CLI tool to find/remove duplicate files supporting multi-core and different algorithms (MD5, SHA256, and XXHash).
Language: Go - Size: 62.5 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 26 - Forks: 3

gacarrillor/AppendFeaturesToLayer
QGIS Processing plugin to add an algorithm for upserting features from a source vector layer to an existing target vector layer.
Language: Python - Size: 162 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 25 - Forks: 5

rasheedsulayman/DuplicateContactsRemover
📒 A simple app to optimize your address book and remove duplicate contacts.
Language: Kotlin - Size: 30.9 MB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 22 - Forks: 10

arikw/outlook-duplicated-items-remover
A VBA script that finds and moves duplicated items in selected outlook folders
Language: VBA - Size: 151 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 2

caluml/finddups
Find duplicate files on your computer
Language: Java - Size: 431 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 22 - Forks: 5

rustomax/ndf
Duplicate file finder written in Nim
Language: Nim - Size: 22.5 KB - Last synced at: about 8 hours ago - Pushed at: over 4 years ago - Stars: 20 - Forks: 0

mkearney/funique
⌚️ A faster unique() function
Language: R - Size: 7.15 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 0

vladsmirnov/url-rewrites
Magento 1.x module to target the URL Rewrite issue
Language: PHP - Size: 23.4 KB - Last synced at: 10 months ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 0

raspi/duplikaatti
Remove duplicate files.
Language: Go - Size: 33.2 KB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 1

lmammino/indexed-string-variation
Experimental JavaScript module to generate all possible variations of strings over an alphabet using an n-ary virtual tree
Language: JavaScript - Size: 52.7 KB - Last synced at: 16 days ago - Pushed at: over 7 years ago - Stars: 18 - Forks: 4

kevinpollet/pocket-deduper
Remove duplicates from your Pocket list.
Language: Go - Size: 6.55 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 0

cemahseri/Duplica
A very fast duplicate file finder.
Language: C# - Size: 25.4 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 14 - Forks: 4

DeaDSouL/dugu 📦
Find, remove and avoid duplicates with dugu: The Duplicates Guru
Language: Python - Size: 85.9 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 4

rmdm/nodups
No dups, no doubts
Language: JavaScript - Size: 2.24 MB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 14 - Forks: 0

kotharan/LeetCode_Solutions
Solutions of LeetCode interview questions
Language: C++ - Size: 2.36 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 5

rsalmei/refine
Refine your file collections using Rust!
Language: Rust - Size: 628 KB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 13 - Forks: 0

NicolasBizzozzero/dupe_eraser
A command-line tool which automate the deletion of duplicate files based on their hash or perceptual-hash.
Language: Python - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 13 - Forks: 0

7room/aya
Disk Usage Analyzer & Duplicate File Finder
Size: 71.3 KB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 0

DragonOfMath/dupe-images
Node.js package for finding and removing duplicate image files with extreme precision
Language: JavaScript - Size: 17.6 KB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 3

danielpclark/dfm
Duplicate File Manager
Language: Ruby - Size: 324 KB - Last synced at: about 1 month ago - Pushed at: over 8 years ago - Stars: 11 - Forks: 1

eh2k/fs-inspect
FS-Inspect is an easy to use tool designed to give you an overview about your files and directories (Disk Usage).
Language: C - Size: 1.94 MB - Last synced at: about 2 years ago - Pushed at: about 10 years ago - Stars: 11 - Forks: 5

prowide/prowide-integrator-examples
Source code examples for "Prowide Integrator"
Language: Java - Size: 8.55 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 17

mesqueeb/compare-anything
Compares objects and tells you which props are duplicate, and props are only present once.
Language: TypeScript - Size: 666 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 0

thomas694/finddupe
Enhanced version of finddupe, a duplicate file detector for Windows
Language: C - Size: 204 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 0

carlbeech/fast-duplicate-finder
A python program to locate duplicate files - and do it fast
Language: Python - Size: 74.4 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 3

rix4uni/unew
A tool combined of 2 commands features in 1 sort and tee for adding new lines to files, skipping duplicates
Language: Go - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 8 - Forks: 1

raspi/samanlainen
Delete duplicate files
Language: Rust - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 1

EllangoK/duplicate-image-remover
Uses SSIM and MSE to get rid of duplicates and near duplicates
Language: Python - Size: 21.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 2

CedricReichenbach/audiomerge
Merge multiple scattered music collections into one, taking only the best version of duplicates
Language: Java - Size: 33.5 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 1

artemanufrij/findfileconflicts
An elementary OS app
Language: Vala - Size: 6.87 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 5

redonkulus/dump-deps
Dump NPM package dependencies to display packages with multiple versions.
Language: JavaScript - Size: 15.6 KB - Last synced at: 9 days ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 0

Dmitriy-Vas/go-file-copies
A Go program to get duplicates from specified paths.
Language: Go - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 0

zx8754/StackOverflowNotes
SO [r] tag list of dupes, help, links, etc.
Size: 37.1 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 3

ant-js/compare-similarity
👁 Compare the similarity of two strings
Language: TypeScript - Size: 12.7 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

innovatrics/dedubcheck
dedubcheck - De-Duplicate Dependency Checker for Node.js monorepos
Language: JavaScript - Size: 29.3 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

TSunny007/Document-Similarity
Using Jaccard-Similarity and Minhashing to determine similarity between two text documents
Language: Jupyter Notebook - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 3

tasleson/duplihere
Copy & Paste finder for structured text files.
Language: Rust - Size: 104 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

LibreTranslate/RemoveDup
Remove duplicates from parallel corpora
Language: Python - Size: 835 KB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

arasgungore/job-posting-duplicate-detection
A project aiming to leverage text embeddings and Milvus, a high-performance vector search engine, to detect duplicate job postings.
Language: Python - Size: 289 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

nav9/duplicateFileFinder
This program finds duplicate files in a folder and its subfolders. Duplicates are moved to a separate folder. A few other modes of operation are also planned/available.
Language: Python - Size: 104 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

pcraciunoiu/AndroidSMSBackupRestoreCleaner Fork of NumbGnat/AndroidSMSBackupRestoreCleaner
This cleans up duplicate SMS entries in a backup created by SMS Backup & Restore Android app.
Language: Python - Size: 4.78 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 2

SuryaMaulana/kitabisa.dev 📦
Kitabisa.com DUPLICATE.
Language: PHP - Size: 53 MB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

carlnewton/line-duplicates
Quickly review or remove duplicate lines from a body of text
Language: HTML - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

KeyWeeUsr/Bear
:bear: The decluttering deduplicator
Language: Python - Size: 135 KB - Last synced at: 29 days ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

PythonCoderAS/DuplicateBot 📦
Bot for duplicates
Language: Python - Size: 38.1 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 2

tjcafferkey/removeduplicates
Function that removes duplicate items and objects based on a key from an array of objects.
Language: JavaScript - Size: 4.88 KB - Last synced at: 15 days ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 0

qwertz19281/dupion
Duplicate file/folder finder, can also scan in archives, HDD optimized
Language: Rust - Size: 365 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

GreenComfyTea/duplicate-emote-check-tool
Check if you have duplicate emotes across FrankerFaceZ, BetterTTV and 7TV for Twitch.
Language: JavaScript - Size: 414 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ajmalshahabudeen/Bitwarden-Duplicate-remover
When Importing multiple CSV files Bitwarden creates Duplicate Entries. So this Python script will remove duplicate entries and keep ONE.
Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 2

marinebox/tab-killer
A Chrome extension. Close all duplicate tabs.
Language: JavaScript - Size: 738 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

molevol-ub/BacterialDuplicates
Identification of putative duplicated genes among bacterial genomes
Language: Perl - Size: 2.22 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

codecliff/FdupesAnalyzer
A script to analyze output of fdupes linux utility to find level of overlap between directories. Written in R
Language: R - Size: 237 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

Supporterino/TextAnalyzer
Language: Python - Size: 83 KB - Last synced at: 3 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

RomaniukVadim/Ninety-Nine-Erlang-Problems
Ninety-Nine Prolog Problems - Erlang edition
Language: Erlang - Size: 24.2 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

r-darwish/dupfiles-cpp
Find duplicate files
Language: C++ - Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

vuolter/deplicate-cli
Command Line Interface for deplicate
Language: Python - Size: 37.1 KB - Last synced at: 5 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

nbari/backup
Command line tool for creating encrypted backups avoiding duplicates
Language: Rust - Size: 75.2 KB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 1

softonus-io/prettier-plugin-duplicate-remover
A Prettier plugin that removes duplicate class names in class and className attributes, ensuring cleaner, more efficient code in frontend projects like React, Vue.js, and Angular.
Language: JavaScript - Size: 11.7 KB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

webis-de/sigir20-sampling-bias-due-to-near-duplicates-in-learning-to-rank
Sampling Bias Due to Near-Duplicates in Learning to Rank
Language: Kotlin - Size: 51.3 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 2 - Forks: 2

davidefiocco/dockerized-elasticsearch-duplicate-finder
Attempt to use MinHash to find duplicates in an Elasticsearch index
Language: Python - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

pouyakary/dup
a tiny and fast command line utility to find the duplicate files within a directory
Language: Go - Size: 16.6 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Arkadiy-Garber/ParaHunter
Identification of gene paralogs in genomes, and calculation of dS and dN/dS values for paralogous gene pairs
Language: Python - Size: 73.2 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

MarciaBM/Sorting_Memories
Language: Java - Size: 139 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

0aub/duplicates-cleaner
simple script to delete duplicated files from specific directory
Language: Python - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

cobanov/easy-duplicate
Compares the files in a folder with md5 checksums and deletes duplicate files or moves them to the desired folder.
Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

McKael/goduf
A simple (but fast) duplicate file finder written in Go [Mirror repository]
Language: Go - Size: 35.2 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

TheArkive/ConstScanner
C/C++ Constant Scanner - includes lists of constants from groups of headers. Check the docs for the repo that lists several Win10 APIs.
Language: AutoHotkey - Size: 22.5 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

PercentEquals/SIDF
Simple Image Duplicate Finder
Language: C# - Size: 230 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

bkb3/duplicate-bib-fix
Small python script to check and replace duplicated bib entries in your .tex files
Language: TeX - Size: 358 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

konstantin89/swift-duplicate-images-finder
Application that finds duplicate images.
Language: Python - Size: 598 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

JordiCorbilla/DuplicateChecker
☑️ .Net service that allows you to check duplicate rows on a sql table using Levenshtein distance
Language: C# - Size: 53.7 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

betaWeb/twicejs
Manage duplicates, count item occurences, dedupe an Array.
Language: JavaScript - Size: 349 KB - Last synced at: 2 months ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

Erdk/imgdedup
Show possible duplicates of photos/images.
Language: Go - Size: 26.4 KB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0
