An open API service providing repository metadata for many open source software ecosystems.

Topic: "preference-alignment"

zjukg/KnowPAT

[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

Language: Python - Size: 9.03 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 193 - Forks: 17

junkangwu/beta-DPO

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

Language: Python - Size: 43 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 43 - Forks: 2

Video-Bench/Video-Bench

Video Generation Benchmark

Language: Python - Size: 10.1 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 39 - Forks: 3

Shentao-YANG/Dense_Reward_T2I

Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

Language: Python - Size: 11.3 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 38 - Forks: 0

Meaquadddd/DPO-Shift

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

Language: Python - Size: 216 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 1

MingjunPan/PO4COPs

[ICML 25] "Preference Optimization for Combinatorial Optimization Problems"

Language: Python - Size: 44.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

reshalfahsi/gpt2chat

Creating a GPT-2-Based Chatbot with Human Preferences

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

junkangwu/Dr_DPO

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Language: Python - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BARUDA-AI/Awesome-Preference-Optimization

Survey of preference alignment algorithms

Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0