Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 326
jasonrqh/Math-CoT-44k-Qwen3-32b-n32-16384-with-logprob-and-entropy Viewer • Updated Apr 11 • 44.4k • 6.64k • 1