GitHub topics: data-pruning
aai-institute/pyDVL
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
Language: Python - Size: 435 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 124 - Forks: 7

Linxyhaha/DEALRec
Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)
Language: Python - Size: 52.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 25 - Forks: 3

naotoo1/BNSFRNPC
code for the paper Beyond Neural scaling laws for fast proven robust certification of nearest prototype classifiers
Language: Terra - Size: 2.21 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

CTCycle/DataExplorer
A simple GUI-wrapped app to perform data cleaning and preliminary analysis with a series of methods
Language: Python - Size: 89.8 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

minnesotanlp/infoVerse
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"
Language: Python - Size: 8.9 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 16 - Forks: 1
