DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use Paper • 2603.11076 • Published about 1 month ago • 5
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing Paper • 2601.21459 • Published Jan 29 • 10