An open API service providing repository metadata for many open source software ecosystems.

Topic: "duplicate-detection"

tobbe84/FindDuplicates

Этот проект представляет собой мощный инструмент для поиска и анализа дублирующихся файлов в указанной директории. Программа позволяет эффективно выявлять одинаковые файлы на основе их содержимого, используя алгоритм хеширования SHA-256. Она поддерживает настройку параметров, таких как минимальный размер файла для проверки и игнорирование определен

Language: Python - Size: 12.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

nviewsweb/css-optimizer

A lightweight Node.js tool for optimizing, sorting, and merging CSS files by removing duplicate selectors, sorting CSS properties alphabetically, and preserving @media queries.

Language: JavaScript - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

programmfabrik/fylr-plugin-find-duplicate-field-values

Masksplitter that is configured for a text field and shows in the editor whether the field has the same value in the user's visibility range before.

Language: CoffeeScript - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Robb-Fr/fast-dupes-finder

This repository proposes clean, fast and shell based scripts for identifying finding duplicate files in a folder.

Language: Shell - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jym272/duplexscan

DuplexScan is a high-performance contact deduplication tool written in Rust that helps identify potential duplicate contacts in large datasets.

Language: Rust - Size: 44.9 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

visiuun/Folder-duplicates-deleter

Directory Bulk Duplicate Files Deleter written in python.

Language: Python - Size: 5.86 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

fevieira27/DeezerAnalysisAI-R

An R script that uses AI for data analysis on Deezer playlists, like looking for fuzzy duplicates, rank of genre and artists.

Language: R - Size: 239 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ciphermike/tidypics

a tool to clean duplicate pictures

Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

wsadqert/duplicate-file-finder

Language: Python - Size: 5.86 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

veronhoxha/duplicate-datasets

In this study, we examine duplicate medical imaging datasets on Kaggle, with a particular focus on The International Skin Imaging Collaboration (ISIC) dataset. The ISIC dataset is particularly noteworthy, as it contains numerous duplicates.

Language: Jupyter Notebook - Size: 37.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

InvincibleJuggernaut/DeDup

Identifies duplicates/near-similar images of a query image from the given set of pre-loaded images

Language: Jupyter Notebook - Size: 9.32 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

FreddyFunk/ddk

DeDuplicationKit: Advanced File Storage Deduplication

Language: C++ - Size: 192 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

jempe/shasums_duplicates

Shasums Duplicates A Bash and Golang utility for detecting and managing duplicate files by generating, comparing, and processing sorted hash lists.

Language: Go - Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

opacicmarko/duplicate-question-identification

Project for the TAR 2024 course at FER, University of Zagreb

Language: Jupyter Notebook - Size: 2.39 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Aniruddhakhedkar/EDA_to_Evaluate_Bank_Telemarketing_Campaign_for_Revenue_Enhancement

Exploratory_Data_Analysis_Python_Project_2

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Aniruddhakhedkar/EDA_for_Chinese_Automotive_Company_Teclov_Chinese

Exploratory_Data_Analysis_Python_Project_1

Language: Jupyter Notebook - Size: 1.89 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

morgant/claws-mail-find_all_copies

A Claws Mail 'Action' script to find all copies of a selected message within a locally synchronized IMAP account

Language: Shell - Size: 4.88 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

s-emanuilov/LangVec

Language of Vectors (LangVec) is a simple Python library designed for transforming numerical vector data into a language-like structure using a predefined set of words (lexicon).

Language: Python - Size: 1 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

ObieMunoz/scss-color-similarity

Quickly identify similar colors within a threshold for the purposes of removing duplicates

Language: JavaScript - Size: 4.88 KB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

elramidreju/Playnite-DuplicateFinder

DuplicateFinder is a Playnite extension that offers a simple way to find duplicate games in your library.

Language: C# - Size: 25.4 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

VMC10/Simple-Duplicate-Cleaner

A simple app written in Python to delete duplicate files

Language: Python - Size: 2.93 KB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

abdulmazidakash/m-22-javascript-simple-coding-problem-part-1

Language: JavaScript - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

StatAziz/Data-Cleaning-in-MS-Excel

This project focuses on applying data cleaning techniques in MS Excel to transform a raw dataset into a structured and reliable format, enabling precise and consistent data analysis.

Size: 1.91 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ruester/midnightdup

MidnightDup - Duplicate File Finder

Language: Perl - Size: 45.2 MB - Last synced at: about 22 hours ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

cydesai/L2-Assignment-Sepaking-the-coding-Language

Word analysis and Pattern Check

Language: Python - Size: 6.84 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

prajeshshrestha/Natural-Language-Processing-Specialization

Assignment Files, Notes and Slides of DeepLearning.AI Natural Language Processing Specialization

Language: Jupyter Notebook - Size: 247 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

khushalvanani/Data-Cleaning-using-SQL-healthcare_dataset

This repository contains a project focused on data cleaning using SQL, applied to a healthcare dataset.

Size: 6.07 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

uziwhoavg/Module_2_UZINCU608_PTO2403_Group-C_UZI-NCUBE_JSL02 Fork of CodeSpace-Academy/Module_2_StudentNo_Classcode_Group_Name-Surname_JSL02

Module_1__JSL02

Language: JavaScript - Size: 874 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

barannmeisterr/ZoomDurationAnalyzer

Java program that reads, processes, and displays attendance records in a Zoom meeting report provided as a txt file.

Language: Java - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

megahelio/BackupUtilities

Python Utility Scripts (Duplicated hashes, Duplicated names, Backup)

Language: Python - Size: 29.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

niradar/duplicate_files_in_folders

Identifies and processes duplicate files between a source and target directory.

Language: Python - Size: 104 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dfirsec/dup_file_finder

Search for duplicate files based on extension.

Language: Python - Size: 35.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nemat-al/Advance-Natural-Language-Processing

Tasks for Advance Natural Language Processing Course @ ITMO University

Language: Jupyter Notebook - Size: 22.7 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BurakKontas/Rabbit-Multiple-Consumer

This project aims to test how .NET and RabbitMQ behave with multiple consumers and one publisher, simulating a distributed system using the saga pattern to observe whether message duplication occurs.

Language: C# - Size: 9.77 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tbarbette/refdup

Find and delete duplicate files in a folder using regex

Language: Python - Size: 7.81 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

hrishabht5/Top-Movies-analysis-

This project utilizes Python for data preprocessing and analysis, along with Power BI for creating an interactive dashboard, to analyze trends and insights within the movie industry. The project encompasses data collection, cleaning, exploration, visualization, and interpretation to provide valuable insights into various aspects of the industry.

Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

streanger/duplicate

files duplicate viewer

Language: Python - Size: 316 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dorryspears/rupe

Find duplicate files FAST

Language: Rust - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Robson-Teixeira/java-design-patterns-I-loja

Repositório do curso Jornada do Conhecimento de Back-End Java (Nível Intermediário) - Design Patterns em Java I: boas práticas de programação da plataforma Alura.

Language: Java - Size: 25.4 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

valeriatisch/transfer-learn-dup-detection

Project @HPI

Language: Python - Size: 204 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

francescopapaleo/duplicate-finder

Find duplicate files in a directory with a hashing function

Language: Python - Size: 28.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mr-eyes/duplicate_images_detector

Detect duplicate images and view the distinct in a web app

Language: Python - Size: 11.7 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

TheProv1/Java-Codes

Java Codes

Language: Java - Size: 64.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ayekaunic/Image-Duplication-and-Similarity-Detection

Detecting duplicity & similarity among images.

Language: Jupyter Notebook - Size: 6.02 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kratikajain511/Excel-Dashboard-Sales-analysis

Built a dynamic sales dashboard in Microsoft Excel to visualize sales trends and key performance indicators (KPIs). Cleaned sales data by removing duplicates and replacing values, resulting in increased usage.

Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

heischichou/Sample-CDM-Tagger

A simple tool to compare new data to historical records. It will tag rows accordingly as duplicate or NULL. The team of interns I was in designed this tool using PySpark and Jupyter Notebook in Microsoft Fabric as a practice exercise within Lexmark Research and Development Corporation's Digital Transformation program.

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Bisalkumar/Duplicate_Finder

This tool scans directories and identifies duplicate files based on their size and SHA-256 hash. The tool is optimized to handle a large number of files efficiently using parallel processing.

Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jer-irl/ffdup

freaking fast duplicate file detection

Language: C - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

LoneStamp99/Foundouble-Mirror

Struggling to find duplicate images, try using this script.

Language: Python - Size: 296 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SwathiRekhaM/CardioGoodFitnessProject

A Fitness Company wants to know the customer behavior towards the threadmill and want recommendations to increase its profits.

Language: Jupyter Notebook - Size: 1.46 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

BreakingPitt/mp3-duplicate-finder

MP3 Duplicate Finder: A command-line tool to identify and compare duplicate MP3 audio files, helping users efficiently manage their music collections.

Language: Python - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

TheNishishiro/HPages

An image/video web manager with tagging, filtering and duplication detection

Language: C# - Size: 3.21 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

dmatsanganis/Advanced_Techniques_for_Entity_Resolution_and_Duplicate_Detection

This repository is dedicated to advanced entity resolution and duplicate detection techniques. Learn how Token Blocking and Meta-Blocking enhance data accuracy.

Language: Jupyter Notebook - Size: 17.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

github-userx/Awesome-Duplication-Finders

Apps to find duplicate files including same/similar images & videos (with computer vision/AI)

Size: 26.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

udithperera-dev/flutter-extensions

Some of useful extension for manipulate text and data sets in flutter

Language: Dart - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

jodiambra/Yandex-Music-EDA

First data science project. EDA on music preferences of users in two cities.

Language: HTML - Size: 1.95 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

TayfunBasoglu/Helper_Scripts

Helper Scripts

Language: Python - Size: 57.6 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

itsayellow/finddup

Find duplicate files or directories in a list of paths.

Language: Python - Size: 144 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dealfonso/searchdups

Search for duplicate files

Language: Python - Size: 9.77 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

iamkish0re/Gdrive-Duplicates-Remover

This notebook will remove duplicates in your google drive!

Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lazycatcoder/common-python

Implementing various tasks using Python

Language: Python - Size: 6.84 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

alinematich/findLongestSharedSequence

Takes a directory address and a filename, then checks all Java files in the directory recursively to find duplicate code

Language: Python - Size: 43.9 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

hamzaerbay/find-duplicate

find-duplicate

Language: Rust - Size: 1.95 KB - Last synced at: 26 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

RaThorat/iterate-in-pandas

Following codes provide better ways to iterate in pandas dataframe over rows instead of nested loops

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Bonniface/CleanData

New way of Cleaning Data in R.

Language: R - Size: 13 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Mike7154/DupCatch

This tool is built to find duplicates in anki cards that are not identified by the built in Anki 'find duplicates' function

Language: Python - Size: 24.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

richstokes/dupehunter

Python script to find (and optionally, delete) duplicate files

Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

LMurphy001/DuplicateFiles

Find and delete duplicate files.

Language: Python - Size: 49.8 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

zeronyk/image_duplication 📦

Software that removes duplicated images from folder2 in respect to folder1

Language: Python - Size: 17.8 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

trinhbiendich/extract-image-from-video

extract image from video

Language: Python - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

AlexZasorin/delete-duplicates.py

Python script to find, filter, and delete duplicate files. Work in progress.

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

germanivanov0719/PyFile-Scripts Fork of Formak21/PyFile-Scripts

A simple set of Python scripts to work with a large number of files available under MIT License.

Language: Python - Size: 62.5 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Nemat-Allah-Aloush/Advance-Natural-Language-Processing

Tasks for Advance Natural Language Processing Course at ITMO University

Language: Jupyter Notebook - Size: 26.2 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

justinkunz/ArrayUSA

npm i unitedstatesofamerica

Language: JavaScript - Size: 24.4 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Unbewohnte/broom

🧹 A command line utility to locate and manage duplicate and empty files 🧹

Language: C++ - Size: 92.8 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

TrevorDArcyEvans/Similarasm

.NET duplicate code finder .NET duplicate code finder .NET duplicate code finder

Language: C# - Size: 127 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

TrevorDArcyEvans/Duplo.Net

Eliminating copypasta in your source code

Language: C# - Size: 138 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

giosali/dupeutil

A command-line program written in Python for detecting and removing duplicate files.

Language: Python - Size: 60.5 KB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Mehmet-Emre-Dogan/fileComparer

Toolset to determine duplicate files.

Language: Python - Size: 33.6 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

skyler-myers-db/Duplicate-File-Handler

Checks OS for duplicate files and cleans them up.

Language: Python - Size: 5.86 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yolona-oss/rm-clones

Small dups remover based on sha256 algorithm

Language: C - Size: 47.9 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rmbranto/Oceanlife-Data-Dashboard

R-Shiny dashboard offering visualisations of species occurrence data extracted from multiple open-access biodiversity information systems ...

Language: R - Size: 5.08 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

BhargavKadali39/Python_Data_Structure_Cheat_Sheet

Clean representation of how every datatype in python should be used.

Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

cserajdeep/Efficient_remover_of_duplicate_images

Image Processing and Hashing-based duplicate image remover python code that can deal with different sizes, intensity, and grayscale.

Language: Jupyter Notebook - Size: 3.98 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SSbit01/Duplicate-Remover

It's a simple Python program that searches and deletes all duplicate files in a folder

Language: Python - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

mheil/find-duplicates

CLI utilitiy implemened in Go to identify duplicated lines in texts.

Language: Go - Size: 3.91 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

aarmn/quidditch

🧹 A broom to crystal clean your file system, in CLI, fast and magical! Some Features: Duplicate finder (with music and picture different quality support in process of adding), Empty directory deletion, Music album cover fixer, and a bunch of other stuff

Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

AmMoPy/Vendor_Master_File_Automated_Test_Procedures

Analysing Vendor Master File using automated test procedures

Language: Jupyter Notebook - Size: 896 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

StanTsky/LinkedList

Examples of various linked list node operations

Language: C++ - Size: 25.4 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

abhishekkr/dup

elixir cli to check for duplicate files at current directories

Language: Elixir - Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

thomasmfish/Duplicate-File-Checker

A simple Python tool to help identify and delete duplicate files that may be distributed within subdirectories.

Language: Python - Size: 11.7 KB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

MarcinOrlowski/dhunter

Fast, content based duplicate file detector with cache and more!

Language: Python - Size: 349 KB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

apaz-cli/ML-ImageHash

A PyTorch implementation of a machine learning perceptual image hash algorithm for near-duplicate detection and fast content-based image retrieval.

Language: Jupyter Notebook - Size: 3.89 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

LaurentVeyssier/Siamese-Network-for-predicting-duplicate-questions

Use Trax Siamese deep Neural LSTM Network to predict pair of similar question (duplicates)

Language: Jupyter Notebook - Size: 19.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

yaiestura/sibur_challenge_2020

AI Community SIBUR Challenge 2020 - NLP task

Language: Jupyter Notebook - Size: 15.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

hamza1886/handle-duplicate-location

Handle duplication location (map markers) by moving them by some distance apart

Language: JavaScript - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Sogolumbo/Phonograph Fork of kabouzeid/Phonograph

A material designed music player for Android

Language: Java - Size: 17.2 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

murugeshmanthiramoorthi/Quora-Duplicate-Question-Classification

Using traditional methods like TF-IDF Vectorizer and logistic regression, I have built a linear model to classify duplicate questions in Quora platform.

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

Pradnya1208/Data-Preprocessing

Basics of Data Preprocessing.

Language: Python - Size: 80.1 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

rajatdv/DIF

A simple and fast web app to remove duplicate images from your datasets.

Language: Python - Size: 191 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0