Wenxuan Song's picture

3 7

Wenxuan Song

Wenxuan123

·

https://songwxuan.github.io/

Songwxuan

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

upvoted a paper 2 months ago

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

upvoted a paper 3 months ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

View all activity

Organizations

None yet

commented a paper 3 months ago

Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process

Paper • 2511.01718 • Published Nov 3, 2025 • 7 •

commented 2 papers 4 months ago

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 147 •

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 147 •