Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback

RL LLMs
Previous