Xufeng Zhao
Home
Publications
Posts
Experience
Awards & Grants
Contact
CV
Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback
Mengdi Li*
,
Jiaye Lin*
,
Xufeng Zhao
,
Wenhao Lu
,
Peilin Zhao
,
Stefan Wermter
,
Di Wang
April 2026
Go to Project Site
RL
LLMs
Previous
Agentic Skill Discovery
Cite
×