Xufeng Zhao
Home
Publications
Posts
Presentations
Experience
Awards & Grants
Contact
CV
Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback
Mengdi Li*
,
Jiaye Lin*
,
Xufeng Zhao
,
Wenhao Lu
,
Peilin Zhao
,
Stefan Wermter
,
Di Wang
June 2025
Go to Project Site
RL
LLMs
Next
PersRM-R1: Enhance Personalized Reward Modeling with Reinforcement Learning
Previous
Mental Modeling of Reinforcement Learning Agents by Language Models
Cite
×