An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-efficient

luo-junyu/Awesome-Data-Efficient-LLM

A list of data-efficient and data-centric LLM (Large Language Model) papers. Our Survey Paper: Towards Efficient LLM Post Training: A Data-centric Perspective

Size: 884 KB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 36 - Forks: 4

mit-han-lab/data-efficient-gans

[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training

Language: Python - Size: 33.9 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 1,307 - Forks: 176

MarvinLer/tcga_segmentation

Whole Slide Image segmentation with weakly supervised multiple instance learning on TCGA | MICCAI2020 https://arxiv.org/abs/2004.05024

Language: Python - Size: 63.1 MB - Last synced at: 8 days ago - Pushed at: about 4 years ago - Stars: 139 - Forks: 35

mahmoodlab/CLAM

Open source tools for computational pathology - Nature BME

Language: Python - Size: 45.8 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1,332 - Forks: 422

hubtru/ASCDomain

Data loader and solution method for the DCASE 2024 Challenge Task1

Language: Python - Size: 1.01 GB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

google/lecam-gan

Regularizing Generative Adversarial Networks under Limited Data (CVPR 2021)

Language: Jupyter Notebook - Size: 4.31 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 164 - Forks: 16

Linxyhaha/DEALRec

Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)

Language: Python - Size: 52.8 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 25 - Forks: 3

divyakraman/ImPosterDiffusion2024

Codebase for the paper ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models

Language: Python - Size: 16.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

VITA-Group/Data-Efficient-Scaling

[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang

Language: Python - Size: 188 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 0

layumi/AdaBoost_Seg

TIP2022 Adaptive Boosting (AdaBoost) for Domain Adaptation ? :woman_shrugging: Why not ! :ok_woman:

Language: Python - Size: 671 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 47 - Forks: 3

zjunlp/RAP

[SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction

Language: Python - Size: 17.1 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 42 - Forks: 3

VITA-Group/Ultra-Data-Efficient-GAN-Training

[NeurIPS'21] "Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly", Tianlong Chen, Yu Cheng, Zhe Gan, Jingjing Liu, Zhangyang Wang

Language: Python - Size: 425 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 84 - Forks: 9

theolepage/ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 2

parshakova/GAMS-for-Data-Efficient-Learning

Global Autoregressive Models (GAMs) for Data-Efficient Sequence Learning

Language: Jupyter Notebook - Size: 1000 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

4m4n5/CLIP-Lite

Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023

Language: Python - Size: 104 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

lizhaoliu-Lec/CG-VLM

This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.

Size: 2.93 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

MichiganNLP/micromodels

Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.

Language: Python - Size: 24.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 4

GU-DataLab/misinformation-detection-DeMis

Resource for misinformation research on Twitter. Official resource of the paper "DeMis: Data-efficient Misinformation Detection using Reinforcement Learning", ECML-PKDD 2022

Language: Python - Size: 13.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

VITA-Group/Double-Win-LTH

[ICML 2022] "Data-Efficient Double-Win Lottery Tickets from Robust Pre-training" by Tianlong Chen, Zhenyu Zhang, Sijia Liu, Yang Zhang, Shiyu Chang, Zhangyang Wang

Language: Python - Size: 308 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 0

SahilC/knowledge-distill-fine-grained

Code for the Knowledge distillation work to enhance fine grained disease recognition.

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

Related Keywords
data-efficient 20 large-language-models 4 deep-learning 4 pytorch 4 whole-slide-imaging 2 weakly-supervised-learning 2 tcga-data 2 nlp 2 natural-language-processing 2 pathology 2 lottery-ticket-hypothesis 2 histopathology 2 computational-pathology 2 tensorflow 2 gan 2 image-generation 2 generative-adversarial-network 2 domain-adaptation 2 data-efficient-gan-training 1 augmentation 1 language-recognition 1 webnlg 1 self-supervised-learning 1 speaker-recognition 1 autoregressive-neural-networks 1 triple-extraction 1 energy-based-model 1 sigir2023 1 retrieval 1 relational-triple-extraction 1 relation-extraction 1 prompt 1 nyt 1 low-resource-nlp 1 low-resource 1 knowledge-informed-prompt-learning 1 knowledge-graph 1 knowledge-base-population 1 kg 1 kbp 1 information-extraction 1 ie 1 few-shot 1 knowledge-distillation 1 isbi-2019 1 fine-grained-classification 1 disease-classification 1 transfer-learning 1 sparsity 1 robust-pretraining 1 pretraining 1 generalization 1 adversarial-robustness 1 twitter 1 reinforcement-learning 1 misinformation 1 machine-learning 1 fake-news 1 ecml-pkdd 1 reusable 1 explainability 1 vision-language-model 1 vision-language 1 instruction-tuning 1 instruction-following 1 data-efficient-learning 1 contrastive-learning 1 multimodal 1 clip 1 global-features 1 data-augmentation 1 audio-processing 1 adversarial-networks 1 acoustic-scene-classification 1 quantitative-pathology 1 mahmoodlab 1 digital-pathology 1 clam 1 camelyon17 1 camelyon16 1 bioimage-informatics 1 wsi 1 tumor-segmentation 1 tcga 1 slide-image-segmentation 1 segmentation 1 multiple-instance-learning 1 miccai 1 medical-imaging 1 image-segmentation 1 neurips-2020 1 gans 1 llm 1 efficient 1 data-centric-machine-learning 1 data-centric-ai 1 data-centric 1 event-extraction 1 tip 1 gta5 1