2026
|
Sara Rajaram, James R. Cotton, Fabian H. Sinz
Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
RLBRew 3 Workshop
|
2025
|
Sara Rajaram, James R. Cotton, Fabian H. Sinz
Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
arXiv
|