GitHub topics: data-valuation
daviddao/awesome-data-valuation
💱 A curated list of data valuation (DV) to design your next data marketplace
Size: 51.8 KB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 118 - Forks: 14

opendataval/opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
Language: Python - Size: 23.4 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 96 - Forks: 7

aai-institute/pyDVL
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
Language: Python - Size: 435 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 124 - Forks: 7

uvanlp/valda
A Python Data Valuation Package
Language: Python - Size: 57.6 KB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 30 - Forks: 4

datanovatrust/federated-data-valuation
Federated Learning implementation for Data Valuation and Differential Privacy, supporting Block-chain DP FL.
Language: Python - Size: 57.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

yzhang511/TimeInf
Time series data contribution via influence functions
Language: Python - Size: 146 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 0

vincecloutier/federated-banzhaf
The experiments accompanying my research project at EPFL.
Language: Python - Size: 26.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

reds-lab/LAVA
This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
Language: Python - Size: 268 MB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 45 - Forks: 8

SJTU-DMTai/awesome-ml-data-quality-papers
Papers about training data quality management for ML models.
Size: 1.08 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 15 - Forks: 2

BokwaiHo/ITIPR
Code for our paper 'Interpretable Triplet Importance for Personalized Ranking' accepted by CIKM 2024.
Language: Python - Size: 13.6 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

aai-institute/re-classwise-shapley
Code for the reproduction of Class-wise Shapley paper from Schoch, Xu, Ji [2022].
Language: TeX - Size: 37 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

MadryLab/journey-TRAK
Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"
Language: Python - Size: 13.4 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

aai-institute/tfl-pydata2024-pydvl
The pyDVL slides for pyData Berlin 2024
Language: Python - Size: 1.38 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

reds-lab/2d-shapley
This is an official repository for "2D-Shapley: A Framework for Fragmented Data Valuation" (ICML2023).
Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

sail-sg/D-TRAK
Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)
Language: Jupyter Notebook - Size: 39.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 1

snoidetx/derdava
Supplementary programmes for DeRDaVa: Deletion-Robust Data Valuation for Machine Learning.
Language: Jupyter Notebook - Size: 95.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aai-institute/mlrc22-like-shapley-love-the-core
Code for the submission to the ML Reproducibility Challenge 2022, reproducing "If you like Shapley then you'll love the core"
Language: Python - Size: 277 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

lfwa/datadynamics
Simulation environment for data collection dynamics.
Language: Python - Size: 7.74 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

ajsanjoaquin/Shapley_Valuation
PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]
Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 3
