An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: jaccard-index

adrg/strutil

Go metrics for calculating string similarity and other string utility functions

Language: Go - Size: 109 KB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 371 - Forks: 24

oertl/treeminhash

TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation

Language: C++ - Size: 2.62 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 14 - Forks: 3

rekha-kandukuri/CentralisedInformationSystem

A platform for both students and instructors to browse courses in the MOOC world easily. The platform features a recommender system that predicts courses of users preference from past courses, a Student-Instructor Course enrollment and Real-Time Discussion Forum Systems.

Language: Jupyter Notebook - Size: 36.3 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Padhma/Liver-Disease-Prediction

This project aims to predict liver disease in Indian patients

Language: Jupyter Notebook - Size: 42 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 4

duttaprasanta/clustering

Different clustering and clustering metrics are implemented in this repository

Language: Python - Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

youssefelmougy/jaccard-selector

Asynchronous Distributed Actor-based Approach to Jaccard Similarity for Genome Comparisons

Language: Fortran - Size: 112 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

MovieTone/JaccardDocumentComparison

Document Comparison web application based on Jaccard Similarity Index. The uploaded file is compared to all previously uploaded ones. Built with Java/JSP

Language: CSS - Size: 16.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

cissagatto/Generate-Partitions-Jaccard

This code is part of my doctoral research. The aim is to generate partitions from the Jaccard index for multilabel classification.

Language: R - Size: 18.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

samuel-bohman/jaccard-index

Function for calculating the Jaccard index and Jaccard distance for binary attributes

Language: R - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

micts/jss

Fast Jaccard similarity search for abstract sets (documents, products, users, etc.) using MinHashing and Locality Sensitve Hashing

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

fagnercarvalho/QuestionSimilarityTest

Testing Jaccard similarity and Cosine similarity techniques to calculate the similarity between two questions.

Language: C# - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

cissagatto/jaccard

This code generate partitions for a multilabel dataset using the Jaccard Index similarity measure. We use HCLUST with 6 linkage metrics to generate several partitions. You may build the partition with the highest coefficient. This code also provide an analysis about the partitioning.

Language: R - Size: 338 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Divya-Bhargavi/Kaggle_HomeDepot

Predict search relevance given a product name and its text attributes

Language: Jupyter Notebook - Size: 130 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

rimshasaeed/lesion-segmentation

Breast ultrasound (BUS) image segmentation using region-growing algorithm

Language: MATLAB - Size: 4.08 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Amirreza-Mousavi/Aspartate_Racemase_Ligands_Simlarity_Score

An R script that uses MACCS166 chemical fingerprint and calculates Jaccard Index/Tanimoto Coefficient for a list of Aspartate Racemase Ligands

Language: R - Size: 32.2 KB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

esalini22/gene-hll

HyperLogLog en C++ y OpenMP para cálculo de similitud de genomas mediante índice de Jaccard

Language: C++ - Size: 185 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

FilobateerEssam/IBM-Machine-Learning-Project

build a classifier to predict whether a loan case will be paid off or not. in loan applications, clean the data, and apply different classification algorithm on the data. use the following algorithms to build your models: k-Nearest Neighbour Decision Tree Support Vector Machine Logistic Regression The results is reported as the accuracy of each classifier, using the following metrics when these are applicable: Jaccard index F1-score LogLoass

Language: Jupyter Notebook - Size: 32.2 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Jonas1312/dice-coefficient-scale-sensitivity-pitfall

The Dice Coefficient Is Scale Sensitive, Mathematical Proof.

Size: 19.5 KB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

faisal-irzal/CC_Fraud_Detector

Implementation of various machine learning techniques to detect credit card frauds based on a given dataset. This repo will guide you through the data analysis, viz and building predictive models

Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

elizabethshen/Machine-Learning-Project

Machine Learning with Python

Language: Jupyter Notebook - Size: 58.6 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

nikoshet/pyspark-movie-similarities

Using Spark In Python For Movie Similarities With Jaccard Index

Language: Python - Size: 863 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

xp-song/photo-classify

Classifying images into discrete categories based on keywords generated from the Google Cloud Vision API

Size: 1.49 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Agisthemantobeat/Loan-Repay

We load a historical dataset from previous loan applications, clean the data, and apply different classification algorithms on the data.

Language: Jupyter Notebook - Size: 113 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

jas-haria/News-Recommendation-Reliability-Indicator-System

A Google Chrome Extension that estimates the Reliability, Polarity and Subjectivity of any news article on the web. It allows you to like/dislike any article and recommends you articles based on your choices.

Language: Jupyter Notebook - Size: 14.8 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

hbenbel/Thematisation

Pipeline that learns and recognize thematics

Language: Python - Size: 180 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Related Keywords
jaccard-index 25 jaccard-similarity 10 jaccard-distance 7 logistic-regression 6 f1-score 5 jaccard 4 machine-learning 4 classification 4 r 3 jaccard-similarity-estimation 3 python 3 decision-trees 3 logloss 3 random-forest 2 decision-tree 2 svm-classifier 2 cosine-similarity 2 similarity-measures 2 minhash 2 locality-sensitive-hashing 2 machine-learning-algorithms 2 support-vector-machines 2 jaccard-coefficient 2 dice-coefficient 2 golang 1 tfidf-text-analysis 1 tokenizer 1 image-processing 1 image-segmentation 1 lesion-segmentation 1 matlab 1 region-growing 1 aspartate-racemase 1 chemoinformatics 1 dti 1 dti-prediction 1 ligand 1 maccs-fingerprint 1 tanimoto-coefficient 1 cardinality-estimation 1 webscraping 1 count-distinct 1 cosine 1 csharp 1 near-duplicate 1 shingles 1 multilabel-classification 1 multilabel-partition 1 partitioning 1 feature-engineering 1 home-depot-competition 1 thematic 1 kaggle 1 linear-regression 1 nlp-machine-learning 1 normalized-compression-distance 1 ngrams 1 stemming 1 text-analysis 1 k-nearest-neighbour 1 logloass 1 pythin3 1 photograph-classification 1 keyword-analysis 1 sorensen-dice-coefficient 1 knn 1 log-loss-score-metric 1 model-evaluation 1 svm 1 hierarchical-clustering 1 classification-algorithm 1 k-means 1 hierarchical-classification 1 movie-similarities 1 discrete-categories 1 pyspark 1 spark 1 cpp 1 genomics 1 hyperloglog 1 kmer 1 sentiment-analysis 1 naive-bayes-classifier 1 linear-svm 1 count-vectorizer 1 openmp 1 parallel-computing 1 parallel-programming 1 chrome-extension 1 nearest-neighbors 1 probabilistic-data-structures 1 stream-processing 1 streaming-algorithms 1 classification-algorithms 1 algorithms 1 minwise-hashing 1 minwise-hashing-algorithm 1 similarity-metric 1 similarity-search 1 sketching 1