Xufeng Zhao
Home
Publications
Posts
Presentations
Experience
Awards & Grants
Contact
CV
PersRM-R1: Enhance Personalized Reward Modeling with Reinforcement Learning
Mengdi Li*
,
Guanqiao Chen*
,
Xufeng Zhao
,
Haochen Wen
,
Shu Yang
,
Di Wang
August 2025
Go to Project Site
RL
LLMs
RM
Personalization
Next
Joint Design of Protein Surface and Backbone Using a Diffusion Bridge Model
Previous
Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback
Cite
×