Lab members are shown in this color.
2025
Sara Rajaram, James R. Cotton, Fabian H. Sinz
Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
arXiv
|