An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: categorical-missing-value

grahman20/SiMI

SiMI imputes numerical and categorical missing values by making an educated guess based on records that are similar to the record having a missing value. Using the similarity and correlations, missing values are then imputed. To achieve a higher quality of imputation some segments are merged together using a novel approach.

Language: Java - Size: 265 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0