An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: duplicates

StephaneCouturier/Katalog

Katalog is an application to manage catalogs of disks and files to search and get statistics.

Language: C++ - Size: 12.2 MB - Last synced at: about 15 hours ago - Pushed at: about 15 hours ago - Stars: 81 - Forks: 7

qarmin/czkawka

Multi functional app to find duplicates, empty folders, similar images etc.

Language: Rust - Size: 4.61 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 23,521 - Forks: 739

sahib/rmlint

Extremely fast tool to remove duplicates and other lint from your filesystem

Language: C - Size: 12.4 MB - Last synced at: about 18 hours ago - Pushed at: 10 days ago - Stars: 2,078 - Forks: 137

jaffster595/Duplinator

A tool to find duplicate images in a folder

Language: Python - Size: 88.9 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

kucherenko/jscpd

Copy/paste detector for programming source code.

Language: TypeScript - Size: 9.13 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4,866 - Forks: 211

veltzer/pyunique

Pyunique helps you get rid of duplicate files

Language: Python - Size: 905 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

cloud-py-api/mediadc

Nextcloud Media Duplicate Collector application

Language: PHP - Size: 91.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 98 - Forks: 8

scinos/yarn-deduplicate

Deduplication tool for yarn.lock files

Language: TypeScript - Size: 7.27 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 1,389 - Forks: 57

mackgorski/ai-duplicate-detector

AI-Powered GitHub Issue Duplicates & Relations Detector

Language: Python - Size: 52.7 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

PJDude/dude

Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.

Language: Python - Size: 4.46 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 141 - Forks: 11

scrubbbbs/cbird

Command-line program for Content-Based Image Retrieval of images and videos. Includes tools for general search and de-duplication.

Language: C++ - Size: 12.7 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 119 - Forks: 5

7room/aya

Disk Usage Analyzer & Duplicate File Finder

Size: 71.3 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 0

microsoft/near-duplicate-code-detector

A simple tool for detecting near-duplicate source code

Language: C# - Size: 38.1 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 100 - Forks: 31

twpayne/find-duplicates

Find duplicate files quickly.

Language: Go - Size: 82 KB - Last synced at: about 19 hours ago - Pushed at: 3 months ago - Stars: 57 - Forks: 1

raspi/duplikaatti

Remove duplicate files.

Language: Go - Size: 33.2 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 1

Canop/backdown

A deduplicator

Language: Rust - Size: 541 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 132 - Forks: 7

src-d/gemini

Advanced similarity and duplicate source code at scale.

Language: Scala - Size: 7 MB - Last synced at: 10 days ago - Pushed at: almost 6 years ago - Stars: 55 - Forks: 16

kristiankoskimaki/vidupe

Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:

Language: C++ - Size: 266 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 176 - Forks: 18

eyalroz/removedupes

Remove Duplicate Messages

Language: JavaScript - Size: 8.56 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 93 - Forks: 7

nbari/backup

Command line tool for creating encrypted backups avoiding duplicates

Language: Rust - Size: 75.2 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 1

DeaDSouL/dugu 📦

Find, remove and avoid duplicates with dugu: The Duplicates Guru

Language: Python - Size: 85.9 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 4

gacarrillor/AppendFeaturesToLayer

QGIS Processing plugin to add an algorithm for upserting features from a source vector layer to an existing target vector layer.

Language: Python - Size: 162 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 5

rsalmei/refine

Refine your file collections using Rust!

Language: Rust - Size: 783 KB - Last synced at: 7 days ago - Pushed at: 14 days ago - Stars: 13 - Forks: 0

F483/dejavu

Quickly detect already witnessed data.

Language: Go - Size: 305 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 157 - Forks: 5

arikw/outlook-duplicated-items-remover

A VBA script that finds and moves duplicated items in selected outlook folders

Language: VBA - Size: 151 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 2

milkmansson/plex-seeDuplicates

Find duplicates in your Plex library.

Language: Python - Size: 23.4 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mkearney/funique

⌚️ A faster unique() function

Language: R - Size: 7.15 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 19 - Forks: 0

shafirahmad/pydeduper

Duplicate file finder - with % duplication of folders

Language: Python - Size: 30.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

rix4uni/unew

A tool combined of 2 commands features in 1 sort and tee for adding new lines to files, skipping duplicates

Language: Go - Size: 49.8 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 8 - Forks: 1

vuolter/deplicate 📦

Advanced Duplicate File Finder for Python

Language: Python - Size: 137 KB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 77 - Forks: 17

mesqueeb/compare-anything

Compares objects and tells you which props are duplicate, and props are only present once.

Language: TypeScript - Size: 666 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 10 - Forks: 0

Robb-Fr/fast-dupes-finder

This repository proposes clean, fast and shell based scripts for identifying finding duplicate files in a folder.

Language: Shell - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

thomas694/finddupe

Enhanced version of finddupe, a duplicate file detector for Windows

Language: C - Size: 204 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 0

prowide/prowide-integrator-examples

Source code examples for "Prowide Integrator"

Language: Java - Size: 8.55 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 17

kevinpollet/pocket-deduper

Remove duplicates from your Pocket list.

Language: Go - Size: 6.55 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 0

rasheedsulayman/DuplicateContactsRemover

📒 A simple app to optimize your address book and remove duplicate contacts.

Language: Kotlin - Size: 30.9 MB - Last synced at: 22 days ago - Pushed at: about 2 years ago - Stars: 22 - Forks: 10

kouhin/redux-dataloader

Loads async data for Redux apps focusing on preventing duplicated requests and dealing with async dependencies.

Language: JavaScript - Size: 108 KB - Last synced at: 1 day ago - Pushed at: over 7 years ago - Stars: 139 - Forks: 3

raspi/samanlainen

Delete duplicate files

Language: Rust - Size: 54.7 KB - Last synced at: 29 days ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 1

NicolasBizzozzero/dupe_eraser

A command-line tool which automate the deletion of duplicate files based on their hash or perceptual-hash.

Language: Python - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 13 - Forks: 0

Navid2zp/dups

A CLI tool to find/remove duplicate files supporting multi-core and different algorithms (MD5, SHA256, and XXHash).

Language: Go - Size: 62.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 25 - Forks: 3

tasleson/duplihere

Copy & Paste finder for structured text files.

Language: Rust - Size: 104 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

vuolter/deplicate-cli

Command Line Interface for deplicate

Language: Python - Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

clement-berard/go-imap-backup

A collection of Go tools for managing IMAP emails, featuring backup capabilities and duplicate detection/cleanup.

Language: Go - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

FreddyFunk/ddk

DeDuplicationKit: Advanced File Storage Deduplication

Language: C++ - Size: 192 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

dgudim/qt_disk-deduper

A desktop app that will help you find and deal with file duplicates on you drive

Language: C++ - Size: 135 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

eliasfloreteng/bitwarden_find_duplicates

Find duplicate logins based on domain, from Bitwarden export. Open source for your safety.

Language: HTML - Size: 34.2 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 34 - Forks: 8

softonus-io/prettier-plugin-duplicate-remover

A Prettier plugin that removes duplicate class names in class and className attributes, ensuring cleaner, more efficient code in frontend projects like React, Vue.js, and Angular.

Language: JavaScript - Size: 11.7 KB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

bkb3/duplicate-bib-fix

Small python script to check and replace duplicated bib entries in your .tex files

Language: TeX - Size: 358 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

webis-de/sigir20-sampling-bias-due-to-near-duplicates-in-learning-to-rank

Sampling Bias Due to Near-Duplicates in Learning to Rank

Language: Kotlin - Size: 51.3 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 2 - Forks: 2

SuperJMN/DeDup

Tool to detect duplicates and copy them to a curated directory (without duplicates)

Language: C# - Size: 2.81 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Intera/redmine_subject_autocomplete

makes the new issue subject field show an autocomplete that lists existing issues to prevent duplicate tickets

Language: Ruby - Size: 57.6 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 1

VMC10/Simple-Duplicate-Cleaner

A simple app written in Python to delete duplicate files

Language: Python - Size: 2.93 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

guicamest/GDuplicate-Finder

GDuplicate Finder - A Groovy way to find duplicates among your computer and network shares!

Language: Groovy - Size: 49.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 30 - Forks: 7

LyonSyonII/akin

Rust crate for writing repetitive code easier and faster.

Language: Rust - Size: 51.8 KB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 2

ruester/midnightdup

MidnightDup - Duplicate File Finder

Language: Perl - Size: 45.2 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Firefox-1998/PhotoVideoOrganizer

How many of you have thousands of photos scattered everywhere (cloud, folders, external hard drives, USB sticks, etc. etc.)?

Language: C# - Size: 460 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

arasgungore/job-posting-duplicate-detection

A project aiming to leverage text embeddings and Milvus, a high-performance vector search engine, to detect duplicate job postings.

Language: Python - Size: 289 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

Prajjwol09/Data-Cleaning-Project

This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.

Size: 62.5 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

innovatrics/dedubcheck

dedubcheck - De-Duplicate Dependency Checker for Node.js monorepos

Language: JavaScript - Size: 29.3 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

sean-public/python-hashes

Interesting (non-cryptographic) hashes implemented in pure Python.

Language: Python - Size: 29.3 KB - Last synced at: 9 months ago - Pushed at: over 3 years ago - Stars: 240 - Forks: 43

cemahseri/Duplica

A very fast duplicate file finder.

Language: C# - Size: 25.4 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 14 - Forks: 4

qwertz19281/dupion

Duplicate file/folder finder, can also scan in archives, HDD optimized

Language: Rust - Size: 365 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

DragonOfMath/dupe-images

Node.js package for finding and removing duplicate image files with extreme precision

Language: JavaScript - Size: 17.6 KB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 3

cwkingjr/find_duplicate_files

Find duplicate files on your system using inclusion and exclusion folder lists.

Language: Go - Size: 4.88 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

bob3000/dupcrawler

finds duplicate files

Language: Go - Size: 14.6 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Sina1218/Text

text duplicate edit

Size: 2.93 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

TheArkive/ConstScanner

C/C++ Constant Scanner - includes lists of constants from groups of headers. Check the docs for the repo that lists several Win10 APIs.

Language: AutoHotkey - Size: 22.5 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

rustomax/ndf

Duplicate file finder written in Nim

Language: Nim - Size: 22.5 KB - Last synced at: 2 days ago - Pushed at: over 4 years ago - Stars: 20 - Forks: 0

LeonSteinbach/BitwardenTools

This repository is a collection of tools for the usage of Bitwarden

Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

davidmalasek/awfulsorter

Awful Sorter is a tool that makes it easier to sort files based on their file types and extensions.

Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jvirkki/dupd

CLI utility to find duplicate files

Language: C - Size: 1.9 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 109 - Forks: 16

deric/es-dedupe

Tool for removing duplicate documents from Elasticsearch

Language: Python - Size: 130 KB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 22

QuietWindUponTheMoor/Quiets-Duplicate-Manager

Quiet's Duplicate Manager is an Electron.js-based desktop application that is currently in the works. It will offer a range of basic features like including any/all files that are duplicates, giving the option to choose whether to delete or archive duplicates, etc. As more features roll out, more will be added here.

Language: JavaScript - Size: 93.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

matteodelabre/mongoose-beautiful-unique-validation

Plugin for Mongoose that turns duplicate errors into regular Mongoose validation errors

Language: JavaScript - Size: 194 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 117 - Forks: 38

pouyakary/dup

a tiny and fast command line utility to find the duplicate files within a directory

Language: Go - Size: 16.6 KB - Last synced at: about 17 hours ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Dmitriy-Vas/go-file-copies

A Go program to get duplicates from specified paths.

Language: Go - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 0

lachhabw/Duplicate-Images-Remover

Python tool for finding and removing duplicate images

Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ajmalshahabudeen/Bitwarden-Duplicate-remover

When Importing multiple CSV files Bitwarden creates Duplicate Entries. So this Python script will remove duplicate entries and keep ONE.

Language: Python - Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 2

fiddyschmitt/udp_dedupe

Deduplicate UDP datagrams

Language: C# - Size: 36.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hansalemaos/dropduplicatesplanb

Drops duplicates in DataFrames with tedious dtypes

Language: Python - Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

quattroformaggi/Mindmap-mini-programs

A mindmap & programs so small they don't require their own repository.

Language: Go - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ArtTorres/FileMatch

Find duplicate files in directories.

Language: C# - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

carlbeech/fast-duplicate-finder

A python program to locate duplicate files - and do it fast

Language: Python - Size: 74.4 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 3

LibreTranslate/RemoveDup

Remove duplicates from parallel corpora

Language: Python - Size: 835 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

ikeratzakis/duplicate-detection

Algorithms for duplicate document and question detection/classification, implemented as part of a project

Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

strong-roots-capital/remove-duplicates-from-sorted 📦

Remove duplicates from a sorted list

Language: TypeScript - Size: 8.79 KB - Last synced at: 24 days ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

GreenComfyTea/duplicate-emote-check-tool

Check if you have duplicate emotes across FrankerFaceZ, BetterTTV and 7TV for Twitch.

Language: JavaScript - Size: 414 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

hansalemaos/arrayhascher

Fast hash in 2D Arrays (Numpy/Pandas/lists/tuples)

Language: C - Size: 101 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pcraciunoiu/AndroidSMSBackupRestoreCleaner Fork of NumbGnat/AndroidSMSBackupRestoreCleaner

This cleans up duplicate SMS entries in a backup created by SMS Backup & Restore Android app.

Language: Python - Size: 4.78 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 2

davidefiocco/dockerized-elasticsearch-duplicate-finder

Attempt to use MinHash to find duplicates in an Elasticsearch index

Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

codecliff/FdupesAnalyzer

A script to analyze output of fdupes linux utility to find level of overlap between directories. Written in R

Language: R - Size: 237 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

TheProv1/Java-Codes

Java Codes

Language: Java - Size: 64.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

davdiv/hashfolder

Simple command line tool that can create/update an sqlite database that contains the hash (by default SHA256) of all files inside a specified root folder.

Language: TypeScript - Size: 80.1 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

AlexZasorin/delete-duplicates.py

Python script to find, filter, and delete duplicate files. Work in progress.

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

lmammino/indexed-string-variation

Experimental JavaScript module to generate all possible variations of strings over an alphabet using an n-ary virtual tree

Language: JavaScript - Size: 52.7 KB - Last synced at: 9 days ago - Pushed at: over 7 years ago - Stars: 18 - Forks: 4

tutts/react-single-image 📦

Centralise duplicate images in your React app, while maintaining a modular file system 🖼

Language: JavaScript - Size: 233 KB - Last synced at: 21 days ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

jonas054/dupfind

Duplication finder for source code and other text files

Language: C++ - Size: 202 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

hansalemaos/duplicateindexer

Find duplicates in multiple lists and return their indices and values.

Language: Python - Size: 3.91 KB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hansalemaos/stridesduplicatefinder

Calculate overlapping values between two arrays and return the results as a DataFrame

Language: Python - Size: 24.4 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

AndreyKlychnikov/deduplicate-elasticsearch

Remove duplicate documents from Elasticsearch

Language: Python - Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0