GitHub topics: reward-modeling
Jialuo-Li/Science-T2I
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis
Language: Python - Size: 189 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 50 - Forks: 1

VectorInstitute/vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
Language: Python - Size: 2.59 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 58 - Forks: 10

sileod/tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
Language: Python - Size: 368 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 178 - Forks: 10

holarissun/RewardModelingBeyondBradleyTerry
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives
Language: Python - Size: 365 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 41 - Forks: 3

YangLing0818/IterComp
[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Language: Python - Size: 32.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 161 - Forks: 10

allenai/hybrid-preferences
Learning to route instances for Human vs AI Feedback
Language: Python - Size: 273 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 2
