SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published Apr 2 • 101
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning Paper • 2603.05863 • Published Mar 6 • 6
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 629
Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning Paper • 2602.01791 • Published Feb 2 • 2
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published Mar 17 • 311
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models Paper • 2604.10866 • Published Apr 13 • 66
RADAR: Robust AI-Text Detection via Adversarial Learning Paper • 2307.03838 • Published Jul 7, 2023 • 2