GitHub topics: rlaif-v
RLHF-V/RLAIF-V
[CVPR'25] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
Language: Python - Size: 60 MB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 335 - Forks: 13

khurramHashmi/LLaVA-v1.6-Mistral-7b-Finetune-ORPO-RLAIF-V Fork of haotian-liu/LLaVA
Align llava-v1.6-mistral-7b on RLAIF-V dataset using ORPO
Language: Python - Size: 19.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
