MaxMin-RLHF: Alignment with Diverse Human Preferences
- Souradip Chakraborty
- , Jiahao Qiu
- , Hui Yuan
- , Alec Koppel
- , Dinesh Manocha
- , Furong Huang
- , Amrit Singh Bedi
- , Mengdi Wang
Research output: Contribution to journal › Conference article › peer-review
14
Link opens in a new tab
Scopus
citations