19 10

guoguoc PRO

woshichaoren123

AI & ML interests

None yet

Recent Activity

upvoted a paper about 11 hours ago

Cosmos 3: Omnimodal World Models for Physical AI

new activity 1 day ago

nvidia/LocateAnything-3B:Inference support for vLLM and SGLang OpenAI endpoints

liked a dataset 4 days ago

VCLab-PolyU/GGT-100K

View all activity

Organizations

None yet

upvoted a paper about 11 hours ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 4 days ago • 60

upvoted a paper 7 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 9 days ago • 87

upvoted 2 papers 9 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 10 days ago • 137

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 11 days ago • 134

upvoted a paper 13 days ago

Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Paper • 2605.22809 • Published 15 days ago • 27

upvoted a paper 16 days ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published 18 days ago • 112

upvoted 2 papers about 1 month ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 63

Seeing Fast and Slow: Learning the Flow of Time in Videos

Paper • 2604.21931 • Published Apr 23 • 19

upvoted 2 papers about 2 months ago

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Paper • 2604.14125 • Published Apr 15 • 21

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 189

upvoted 2 papers 2 months ago

WorldAgents: Can Foundation Image Models be Agents for 3D World Models?

Paper • 2603.19708 • Published Mar 20 • 13

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

Paper • 2603.18524 • Published Mar 19 • 58

upvoted 5 papers 3 months ago

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Paper • 2603.15618 • Published Mar 16 • 21

upvoted a paper 4 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

upvoted a paper 5 months ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 55

guoguoc PRO

AI & ML interests

Recent Activity

Organizations

woshichaoren123's activity