An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: duplicate-files

twpayne/find-duplicates

Find duplicate files quickly.

Language: Go - Size: 82 KB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 56 - Forks: 1

scrubbbbs/cbird

Command-line program for Content-Based Image Retrieval of images and videos. Includes tools for general search and de-duplication.

Language: C++ - Size: 12.7 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 118 - Forks: 5

thushan/smash

Smash through to find duplicate files super fast by slicing files intelligently!

Language: Go - Size: 7.71 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 16 - Forks: 0

diskoverdata/diskover-community

Diskover Community Edition - Open source file indexer, file search engine and data management and analytics powered by Elasticsearch

Language: PHP - Size: 33.6 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 1,554 - Forks: 166

pierrre/file-duplicate

Find duplicate files

Language: Go - Size: 241 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

sreedevk/deduplicator

Filter, Sort & Delete Duplicate Files Recursively

Language: Rust - Size: 334 KB - Last synced at: 15 days ago - Pushed at: 10 months ago - Stars: 326 - Forks: 16

DeaDSouL/dugu 📦

Find, remove and avoid duplicates with dugu: The Duplicates Guru

Language: Python - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 4

harshasrisri/dedup

Remove local files that are duplicates of files in another path

Language: Rust - Size: 97.7 KB - Last synced at: 4 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

m-manu/go-find-duplicates

Find duplicate files (photos, videos, music, documents) on your computer, portable hard drives etc.

Language: Go - Size: 59.6 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 280 - Forks: 24

LpCodes/Duplicate-Files-Remover

This Python script removes duplicate files from a given directory. It prompts the user to select a folder and then uses the MD5 hashing algorithm to check if any files have the same hash value. If it finds duplicate files, it deletes all but one of them.

Language: Python - Size: 6.84 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

milkmansson/plex-seeDuplicates

Find duplicates in your Plex library.

Language: Python - Size: 23.4 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

opencoff/go-progs

useful golang utilities for Unixish environments

Language: Go - Size: 51.8 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 3 - Forks: 0

Chinory/lndup

Hardlink duplicate files! 🤓

Language: JavaScript - Size: 163 KB - Last synced at: 9 days ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

Paulogb98/DupFile-Analyzer

Ferramenta de linha de comando em Rust para detectar arquivos duplicados em diretórios e subdiretórios. Utilizando hash SHA-256 e processamento paralelo com Rayon, garante alta performance e precisão na identificação de arquivos duplicados, independentemente de seus metadados, como nome etc.

Language: Rust - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

elmindreda/duff

Command-line utility for finding duplicate files

Language: C - Size: 241 KB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 12

jurassicLizard/files-deduplicator

A command-line tool that identifies and removes duplicate files recursively from a specified folder. It uses cryptographic hashes (Blake2) for reliable file comparisons to ensure precise duplicate detection.

Language: C++ - Size: 46.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

shafirahmad/pydeduper

Duplicate file finder - with % duplication of folders

Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

waived/duplicate-remover

Windows utility that checks folder and sub-folder(s) for all files that contain identical MD5 hash and prep them for deletion

Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Vatshayan/Data-Duplication-Removal-using-Machine-Learning

Final Year Project as Deletion of Duplicated data using Machine learning project with source code and Report.

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 44 - Forks: 6

akbarisajjad/Scanner-file-windows-

This code is a useful tool for checking for duplicate and corrupted files and can be very useful for managing files in a large directory. However, it could be made more robust and reliable by improving error handling and performance optimization.

Language: Python - Size: 19.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

vuolter/deplicate 📦

Advanced Duplicate File Finder for Python

Language: Python - Size: 137 KB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 77 - Forks: 16

gechandesu/fdup

File duplicates finder

Language: V - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 7 - Forks: 0

plecos/dupefindr

A file duplicate detector utility written in Rust

Language: Rust - Size: 99.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

theohbrothers/Get-DuplicateItem

Gets duplicate or non-duplicate files.

Language: PowerShell - Size: 91.8 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 2

magnetikonline/identix

Python utility which will recursively scan one or more given directories for duplicate files.

Language: Python - Size: 48.8 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 2

Junsious/DupFinder

A simple desktop application to search for duplicate files in a specified directory. This application uses SHA-256 hashing to identify duplicates and provides a user-friendly interface with progress tracking.

Language: Rust - Size: 43 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

germabyte/obsidian-deduper

The Obsidian Duplicate Finder is a user-friendly tool designed to help users manage duplicate files within their Obsidian vaults. It identifies similar Markdown files based on their content, making it easier to organize and declutter note-taking environments.

Language: Python - Size: 11.7 KB - Last synced at: 17 days ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

Robb-Fr/fast-dupes-finder

This repository proposes clean, fast and shell based scripts for identifying finding duplicate files in a folder.

Language: Shell - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

raspi/duplikaatti

Remove duplicate files.

Language: Go - Size: 33.2 KB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 1

Idrinth/duplication

A backup of backups solution for among other things s3 storages

Language: PHP - Size: 125 KB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

raspi/samanlainen

Delete duplicate files

Language: Rust - Size: 54.7 KB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 1

Jim-JMCD/DuplicateFF

A small Linux app that compares files using sha256. Input can be one or more directories with optional filters of maximum files size and parts of file names (e.g. ‘.mp4’ ‘IMG’ ) . Output is multiple CSV (spreadsheet) reports that can be used to move or delete duplicates. Can be used on Linux, Windows (WSL2, MSYS2 and Gitbash).

Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Jim-JMCD/Duplicate-File-Finder

A bash script compares files using sha256. Input can be one or more directories with optional filters of maximum files size and parts of file names (e.g. ‘.mp4’ ‘IMG’ ) . Output is multiple CSV (spreadsheet) reports that can be used to move or delete duplicates. Can be used on Linux, Windows (WSL2, MSYS2 and Gitbash).

Language: Shell - Size: 47.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

NicolasBizzozzero/dupe_eraser

A command-line tool which automate the deletion of duplicate files based on their hash or perceptual-hash.

Language: Python - Size: 54.7 KB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 13 - Forks: 0

visiuun/Folder-duplicates-deleter

Directory Bulk Duplicate Files Deleter written in python.

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

unhammer/wshiml

📚 Word shingling for near duplicate document detection

Language: OCaml - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 1

ciphermike/tidypics

a tool to clean duplicate pictures

Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

px86/dff 📦

Recursively finds and lists duplicate files in a directory. Uses C++17 filesystem library and SHA-256 hashing algorithm.

Language: C++ - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

vuolter/deplicate-cli

Command Line Interface for deplicate

Language: Python - Size: 37.1 KB - Last synced at: 17 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

FreddyFunk/ddk

DeDuplicationKit: Advanced File Storage Deduplication

Language: C++ - Size: 192 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

umstek/DupKiller

Slow, but more reliable duplicate files cleaner.

Language: C# - Size: 372 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

onder7/DuplicateFinder

Yinelenen Dosya Bulucu

Language: Python - Size: 9.77 KB - Last synced at: 18 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

guofei9987/fuzzy-hash

fuzzy hash in python (from ssdeep)

Language: C - Size: 24.4 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

NoCLin/duplicate-scanner

Ultra Fast Duplicate File Scanner Based on Everything (Windows and NTFS only)

Language: Python - Size: 104 KB - Last synced at: 17 days ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 2

VISWESWARAN1998/CyberGod-KSGMPRH

An open-source antivirus for windows

Language: C - Size: 42.3 MB - Last synced at: 18 days ago - Pushed at: over 5 years ago - Stars: 45 - Forks: 22

FabriceSalvaire/filewalker

A Python library to scan a file system, find duplicated file etc.

Language: Python - Size: 123 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

feishu0110/DupFileCleaner

为了有效地管理和清理重复文件,我用Cursor Ai开发了一款文件去重工具,利用多线程对比哈希值实现了高效的文件去重功能。

Language: TeX - Size: 13.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

fffaraz/QtHashSum

File Checksum Integrity Verifier & Duplicate File Finder written in C++ Qt

Language: C++ - Size: 210 KB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 45 - Forks: 10

cbismuth/fdupes-java

Advanced file duplicate finder with native MD5 support

Language: Java - Size: 248 KB - Last synced at: 8 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

shivakar/gdupes

A multithreaded tool for identifying duplicate files

Language: Go - Size: 22.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

ruester/midnightdup

MidnightDup - Duplicate File Finder

Language: Perl - Size: 45.2 MB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

hubertdungen/Duplicate_Deleter-EK-Script

Windows Batch script to selectively delete files within a directory, retaining only those with a specified extension. Ideal for organizing files and removing unwanted duplicates by extension.

Language: Batchfile - Size: 4.88 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

jeff-99/hashdex

A file indexer based on content hashes to quickly find duplicate files on your system.

Language: Python - Size: 106 KB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 2

lvntky/noditto

Noditto: AST Based Code Duplication Finder

Language: CMake - Size: 55.7 KB - Last synced at: 19 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

matteopic/hashdir

A fast file hashing tool designed to find duplicate files by performing checksums and indexing them.

Language: Go - Size: 4.88 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

kingzcheung/bdzer

A Small Tool for Finding Duplicate Files

Language: Rust - Size: 12.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

cemahseri/Duplica

A very fast duplicate file finder.

Language: C# - Size: 25.4 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 14 - Forks: 4

cwkingjr/find_duplicate_files

Find duplicate files on your system using inclusion and exclusion folder lists.

Language: Go - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

niradar/duplicate_files_in_folders

Identifies and processes duplicate files between a source and target directory.

Language: Python - Size: 104 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

rustomax/ndf

Duplicate file finder written in Nim

Language: Nim - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 20 - Forks: 0

PJDude/dude

Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.

Language: Python - Size: 4.71 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 57 - Forks: 6

israel-lugo/capidup

Quickly find duplicate files in directories

Language: Python - Size: 129 KB - Last synced at: 12 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

ltworf/pulputils 📦

not coreutils, pulputils

Size: 119 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

dfirsec/dup_file_finder

Search for duplicate files based on extension.

Language: Python - Size: 35.2 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

by321/Find-Duplicate-Files

Find Duplicate Files with Simple GUI

Language: Python - Size: 5.86 KB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

PostsDesert/copyfolderhierarchy

Copy Folder Hierarchy and Contents for Google Drive

Language: JavaScript - Size: 40 KB - Last synced at: 9 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

hekmon/deduper

Analyse 2 paths to found identical files and hard link them to save space

Language: Go - Size: 151 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

jvirkki/dupd

CLI utility to find duplicate files

Language: C - Size: 1.9 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 109 - Forks: 16

masgari/dup-fu 📦

Terminal application for finding duplicate files

Language: Go - Size: 281 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

JakubJanowski/File-Duplicate-Finder

A small desktop application to help you organize disk space and backup folders by searching duplicated files by their content.

Language: C# - Size: 8.26 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

QuietWindUponTheMoor/Quiets-Duplicate-Manager

Quiet's Duplicate Manager is an Electron.js-based desktop application that is currently in the works. It will offer a range of basic features like including any/all files that are duplicates, giving the option to choose whether to delete or archive duplicates, etc. As more features roll out, more will be added here.

Language: JavaScript - Size: 93.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

impoteto/goduper

rather fast duplicate file finder written in go

Language: Go - Size: 14.6 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nagracks/py_duplicates

Find duplicates

Language: Python - Size: 16.6 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 4 - Forks: 3

pouyakary/dup

a tiny and fast command line utility to find the duplicate files within a directory

Language: Go - Size: 16.6 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

francois-le-ko4la/duplicate-file-finder

A duplicate file finder.

Language: Python - Size: 78.1 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

n0obit4/SFind

Un programa para buscar archivos duplicados

Language: Shell - Size: 182 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

pixolution/dupe-scanner

Detect duplicate images locally

Language: Python - Size: 130 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

NIR3X/hardlinkdeduplicator

HardLink Deduplicator - Detect and Manage Duplicate Files with Hard Links

Language: Go - Size: 24.4 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nogibjj/pyli

Deduper

Language: Python - Size: 265 KB - Last synced at: 7 days ago - Pushed at: almost 8 years ago - Stars: 5 - Forks: 3

dipietro-salvatore/liten3 Fork of VarnaLab/liten3

Liten3 searches in a given directory the duplicate files, producing a report at the end of its execution.

Language: Python - Size: 58.6 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1

Gartham/file-duplicate-checker

Program to scan and search for file duplicates. (~300MB/s)

Language: Java - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 3

tyler-tee/File-Deduplicator

Python app built to scan a directory, check for duplicate files, and send them to the trash.

Language: Python - Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

NattyXO/Mega-Copy

This Windows Forms application allows users to copy, cut, and duplicate files from one location to another.

Language: C# - Size: 3.32 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

elcorto/findsame

Find duplicate files and directories based on file hashes.

Language: Python - Size: 249 KB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

mamaorha/compare-files

scanning for duplicate files by content

Language: Java - Size: 46.9 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

EmreCanKURAN/DuplicateFileRemover

Removes the duplicate files.

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

knobs-dials/duppy

Duplicate file detector. Incremental-block style.

Language: Python - Size: 137 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

davdiv/hashfolder

Simple command line tool that can create/update an sqlite database that contains the hash (by default SHA256) of all files inside a specified root folder.

Language: TypeScript - Size: 80.1 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

neatdecisions/detwinner

Detwinner - duplicate file finder for the Linux desktop

Language: C++ - Size: 535 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 6

AlexZasorin/delete-duplicates.py

Python script to find, filter, and delete duplicate files. Work in progress.

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

MK-Ware/pyCleaner

Securely wipe files or folders and clean duplicated files

Language: Python - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 8 - Forks: 1

jer-irl/ffdup

freaking fast duplicate file detection

Language: C - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

LoneStamp99/Foundouble-Mirror

Struggling to find duplicate images, try using this script.

Language: Python - Size: 296 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

giosali/dupeutil

A command-line program written in Python for detecting and removing duplicate files.

Language: Python - Size: 60.5 KB - Last synced at: about 23 hours ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

thomasmfish/Duplicate-File-Checker

A simple Python tool to help identify and delete duplicate files that may be distributed within subdirectories.

Language: Python - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

cbergoon/fdup

File and Directory Duplication Finder Utility

Language: Go - Size: 1.3 MB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

oscarsun72/delete_duplicate_files_from_the_source_directory

檔案總管汰重-WindowsFormsApplication1 delete duplicate files from the source directory

Language: C# - Size: 105 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Demez/duplicate_file_finder

Language: Python - Size: 18.6 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

KowalskiThomas/Python-DuplicatesFinder

A useful tool to find duplicate files in a filesystem and to easily apply "fixes" on that. Will be translated to English soon.

Language: Python - Size: 16.6 KB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

dahead/DupeFiles

Dupe Files scans your disks for duplicate files.

Language: C# - Size: 673 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0