An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: min-hashing

Lefteris-Souflas/Movie-Rating-User-Similarity

Explored Jaccard distance, Min-Hashing, and LSH for user similarity in a movie rating dataset. Tasks involve dataset preprocessing, exact Jaccard Similarity computation, Min-Hash signatures, and LSH implementation. Results and observations are documented in code, output files, and a report

Language: Jupyter Notebook - Size: 1.22 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mark-antal-csizmadia/finding-similar-items-textually-similar-documents

Finding Similar Items: Textually Similar Documents

Language: Jupyter Notebook - Size: 451 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

shr1611/Data-mining-Plagiarism-Check

A Java program to check Plagiarisms between multiple documents using the method of Shingling, MinHashing and Locality Sensitive Hashing.

Language: Java - Size: 38.1 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0