GitHub / techandy42 / LLM_Reward_Model
Developing a LLM response ranking reward model using HFRL except it's GPT-3.5 instead of human.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/techandy42%2FLLM_Reward_Model
Stars: 2
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 2 MB
Dependencies parsed at: Pending
Created at: over 1 year ago
Updated at: over 1 year ago
Pushed at: over 1 year ago
Last synced at: over 1 year ago
Topics: hfrl, language-model, reward-model
Loading...