Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenxuan Song's picture
3 7

Wenxuan Song

Wenxuan123
Cyn0's profile picture lagom2333's profile picture
·
https://songwxuan.github.io/
  • Songwxuan

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning
upvoted a paper 2 months ago
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
upvoted a paper 3 months ago
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
View all activity

Organizations

None yet

commented a paper 3 months ago

Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process

Paper • 2511.01718 • Published Nov 3, 2025 • 7 •
1
commented 2 papers 4 months ago

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 147 •
4

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 147 •
4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs