AI & ML interests
None yet
Organizations
None yet
models
15
kemuxu/wikipedia-edit-reward-model
Text Classification
•
8B
•
Updated
•
2
kemuxu/mixed-ultra-wiki-reward-model
Updated
kemuxu/mixed-hh-wiki-reward-model
Updated
kemuxu/mixed-all-reward-model
Updated
kemuxu/mixed-hh-ultra-reward-model
Updated
kemuxu/triple-mixed-deepspeed-reward-model
Updated
kemuxu/mixed-rlhf-ultrafeedback-quad-a100-reward-model
Updated
kemuxu/ultrafeedback-reward-model
Text Classification
•
8B
•
Updated
•
4
kemuxu/hh-rlhf-lora-quad-a100-reward-model
Updated
kemuxu/hh-rlhf-lora-dual-h200-reward-model
Updated