Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models Paper • 2602.01970 • Published about 22 hours ago • 1
Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models? Paper • 2507.04632 • Published Jul 7, 2025 • 2