Xufeng Zhao
Home
Publications
Posts
Presentations
Experience
Awards & Grants
Contact
CV
Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback
Mengdi Li*
,
Jiaye Lin*
,
Xufeng Zhao
,
Wenhao Lu
,
Peilin Zhao
,
Stefan Wermter
,
Di Wang
June 2025
RL
LLMs
Previous
Mental Modeling of Reinforcement Learning Agents by Language Models
Cite
×