ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published 7 days ago • 47
Toward Scalable Terminal Task Synthesis via Skill Graphs Paper • 2604.25727 • Published 8 days ago • 9
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published 16 days ago • 97
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 16 days ago • 45
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 16 days ago • 90
Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 22 days ago • 34
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs Paper • 2604.10480 • Published 24 days ago • 20
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published 26 days ago • 55
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models Paper • 2604.10949 • Published 23 days ago • 40
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published Mar 25 • 125
Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator Paper • 2604.08121 • Published 27 days ago • 43
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 23 days ago • 71
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 25 days ago • 79
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published Mar 13 • 38
HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration Paper • 2603.07815 • Published Mar 8 • 10