arxiv:2505.04620
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
upvoted a paper about 23 hours ago
Audio-Visual Intelligence in Large Foundation Models upvoted a paper 5 months ago
SemanticGen: Video Generation in Semantic Space upvoted a paper 5 months ago
Kling-Omni Technical Report