From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models Paper • 2601.15690 • Published 3 days ago • 4
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion Paper • 2601.16148 • Published 3 days ago • 9
VideoMaMa: Mask-Guided Video Matting via Generative Prior Paper • 2601.14255 • Published 5 days ago • 9
Numba-Accelerated 2D Diffusion-Limited Aggregation: Implementation and Fractal Characterization Paper • 2601.15440 • Published 4 days ago • 1
360Anything: Geometry-Free Lifting of Images and Videos to 360° Paper • 2601.16192 • Published 3 days ago • 6
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published 3 days ago • 72
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 4 days ago • 62
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published 3 days ago • 11
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published 3 days ago • 45
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation Paper • 2601.15369 • Published 4 days ago • 16
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 3 days ago • 44
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 4 days ago • 59
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 4 days ago • 28
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Paper • 2601.14171 • Published 5 days ago • 44