4 122 6

Donghao Zhou

donghao-zhou

https://correr-zhou.github.io

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 1 day ago

Context Unrolling in Omni Models

upvoted a paper 4 days ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

upvoted a paper 9 days ago

OneHOI: Unifying Human-Object Interaction Generation and Editing

View all activity

Organizations

upvoted a paper 1 day ago

Context Unrolling in Omni Models

Paper • 2604.21921 • Published 3 days ago • 9

upvoted a paper 4 days ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published 5 days ago • 82

upvoted a paper 9 days ago

OneHOI: Unifying Human-Object Interaction Generation and Editing

Paper • 2604.14062 • Published 11 days ago • 8

upvoted a paper 10 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 11 days ago • 152

upvoted a paper 12 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 13 days ago • 70

upvoted a paper 14 days ago

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published 17 days ago • 76

upvoted 2 papers 19 days ago

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

Paper • 2604.04184 • Published 21 days ago • 50

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 20 days ago • 200

upvoted a paper 24 days ago

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published 28 days ago • 68

upvoted a paper 25 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 28 days ago • 144

upvoted 3 papers about 1 month ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 62

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published Mar 16 • 21

MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

Paper • 2603.17117 • Published Mar 17 • 87

upvoted 7 papers about 2 months ago

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Paper • 2603.02210 • Published Mar 2 • 29

World Action Models are Zero-shot Policies

Paper • 2602.15922 • Published Feb 17 • 18

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

Paper • 2602.19895 • Published Feb 23 • 14

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

Donghao Zhou

AI & ML interests

Recent Activity

Organizations

donghao-zhou's activity