arxiv:2601.16208
Jihan Yang PRO
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
upvoted a paper 8 days ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining upvoted a paper 13 days ago
Solaris: Building a Multiplayer Video World Model in Minecraft liked
a dataset 23 days ago
nyu-visionx/scale-rae-data