None defined yet.
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size
Rethinking the Trust Region in LLM Reinforcement Learning