MoZoo:Unleashing Video Diffusion power in animal fur and muscle simulation Paper • 2605.13857 • Published Apr 8 • 2
PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers Paper • 2605.26730 • Published May 27 • 16
RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains Paper • 2605.29156 • Published May 27 • 14
Reflective Prompt Tuning through Language Model Function-Calling Paper • 2605.21781 • Published May 20 • 9
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 27 days ago • 59