COLLECTION - a scottrx11 Collection

scottrx11 's Collections

COLLECTION

updated about 1 hour ago

AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation

Paper • 2602.17100 • Published 21 days ago • 2
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Paper • 2603.01059 • Published 11 days ago • 1
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models

Paper • 2603.00618 • Published 12 days ago
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 9 days ago • 163
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

Paper • 2603.04257 • Published 7 days ago • 18
InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Paper • 2603.03646 • Published 8 days ago • 8
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods

Paper • 2407.21630 • Published Jul 31, 2024 • 8
SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published 9 days ago • 16
Experiential Reinforcement Learning

Paper • 2602.13949 • Published 25 days ago • 70
On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published 6 days ago • 6
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 282
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 29 days ago • 241
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 109
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers

Paper • 2511.11062 • Published Nov 14, 2025 • 32
KLASS: KL-Guided Fast Inference in Masked Diffusion Models

Paper • 2511.05664 • Published Nov 7, 2025 • 37
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs

Paper • 2511.12710 • Published Nov 16, 2025 • 39
Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 38
Distribution-Conditioned Transport

Paper • 2603.04736 • Published 7 days ago • 2
Truncated Step-Level Sampling with Process Rewards for Retrieval-Augmented Reasoning

Paper • 2602.23440 • Published 13 days ago • 3
Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling

Paper • 2603.04553 • Published 7 days ago • 3
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Paper • 2603.05438 • Published 6 days ago • 33
Dynamic Chunking Diffusion Transformer

Paper • 2603.06351 • Published 6 days ago • 12
Beyond the Grid: Layout-Informed Multi-Vector Retrieval with Parsed Visual Document Representations

Paper • 2603.01666 • Published 10 days ago • 1
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling

Paper • 2603.06199 • Published 6 days ago • 9
π-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

Paper • 2603.02083 • Published 9 days ago • 9
EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding

Paper • 2603.04254 • Published 7 days ago • 1
LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding

Paper • 2602.20913 • Published 16 days ago • 10
Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns

Paper • 2602.22479 • Published 14 days ago
VecGlypher: Unified Vector Glyph Generation with Language Models

Paper • 2602.21461 • Published 15 days ago • 11
Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators

Paper • 2602.22647 • Published 14 days ago • 3
Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs

Paper • 2602.21198 • Published 15 days ago • 4
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published 1 day ago • 44
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published 1 day ago • 28
Towards a Neural Debugger for Python

Paper • 2603.09951 • Published 1 day ago • 2
TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery

Paper • 2603.08075 • Published 3 days ago
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning

Paper • 2603.05863 • Published 6 days ago • 2
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

Paper • 2603.09221 • Published 2 days ago
Multi-Head Low-Rank Attention

Paper • 2603.02188 • Published 9 days ago • 1
Compiler-First State Space Duality and Portable O(1) Autoregressive Caching for Inference

Paper • 2603.09555 • Published 1 day ago • 1