An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-valuation

daviddao/awesome-data-valuation

💱 A curated list of data valuation (DV) to design your next data marketplace

Size: 51.8 KB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 118 - Forks: 14

opendataval/opendataval

OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)

Language: Python - Size: 23.4 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 96 - Forks: 7

aai-institute/pyDVL

pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation

Language: Python - Size: 435 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 124 - Forks: 7

uvanlp/valda

A Python Data Valuation Package

Language: Python - Size: 57.6 KB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 30 - Forks: 4

datanovatrust/federated-data-valuation

Federated Learning implementation for Data Valuation and Differential Privacy, supporting Block-chain DP FL.

Language: Python - Size: 57.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 2

yzhang511/TimeInf

Time series data contribution via influence functions

Language: Python - Size: 146 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 0

vincecloutier/federated-banzhaf

The experiments accompanying my research project at EPFL.

Language: Python - Size: 26.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

reds-lab/LAVA

This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).

Language: Python - Size: 268 MB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 45 - Forks: 8

SJTU-DMTai/awesome-ml-data-quality-papers

Papers about training data quality management for ML models.

Size: 1.08 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 15 - Forks: 2

BokwaiHo/ITIPR

Code for our paper 'Interpretable Triplet Importance for Personalized Ranking' accepted by CIKM 2024.

Language: Python - Size: 13.6 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

aai-institute/re-classwise-shapley

Code for the reproduction of Class-wise Shapley paper from Schoch, Xu, Ji [2022].

Language: TeX - Size: 37 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

MadryLab/journey-TRAK

Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"

Language: Python - Size: 13.4 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

aai-institute/tfl-pydata2024-pydvl

The pyDVL slides for pyData Berlin 2024

Language: Python - Size: 1.38 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

reds-lab/2d-shapley

This is an official repository for "2D-Shapley: A Framework for Fragmented Data Valuation" (ICML2023).

Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

sail-sg/D-TRAK

Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)

Language: Jupyter Notebook - Size: 39.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 1

snoidetx/derdava

Supplementary programmes for DeRDaVa: Deletion-Robust Data Valuation for Machine Learning.

Language: Jupyter Notebook - Size: 95.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aai-institute/mlrc22-like-shapley-love-the-core

Code for the submission to the ML Reproducibility Challenge 2022, reproducing "If you like Shapley then you'll love the core"

Language: Python - Size: 277 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

lfwa/datadynamics

Simulation environment for data collection dynamics.

Language: Python - Size: 7.74 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

ajsanjoaquin/Shapley_Valuation

PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]

Language: Python - Size: 34.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 3