An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-pruning

aai-institute/pyDVL

pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation

Language: Python - Size: 435 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 124 - Forks: 7

Linxyhaha/DEALRec

Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)

Language: Python - Size: 52.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 25 - Forks: 3

naotoo1/BNSFRNPC

code for the paper Beyond Neural scaling laws for fast proven robust certification of nearest prototype classifiers

Language: Terra - Size: 2.21 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

CTCycle/DataExplorer

A simple GUI-wrapped app to perform data cleaning and preliminary analysis with a series of methods

Language: Python - Size: 89.8 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

minnesotanlp/infoVerse

Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"

Language: Python - Size: 8.9 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 16 - Forks: 1