1 6

Zeyu Leo Liu

leo-liuzy

leo-liuzy

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Learning from Language Feedback via Variational Policy Distillation

upvoted a paper 7 days ago

Harnessing LLM Agents with Skill Programs

upvoted a paper 8 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

View all activity

Organizations

upvoted a paper 6 days ago

Learning from Language Feedback via Variational Policy Distillation

Paper • 2605.15113 • Published 9 days ago • 10

upvoted a paper 7 days ago

Harnessing LLM Agents with Skill Programs

Paper • 2605.17734 • Published 9 days ago • 34

upvoted a paper 8 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

Paper • 2605.16679 • Published 12 days ago • 52

upvoted a paper 2 months ago

CREATE: Testing LLMs for Associative Creativity

Paper • 2603.09970 • Published Mar 10 • 15

upvoted a paper 3 months ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published Feb 23 • 58

updated a model 9 months ago

structured-reasoning/exp_3218_135_ep5_step-annotation

8B • Updated Sep 1, 2025 • 1

published a model 9 months ago

structured-reasoning/exp_3218_135_ep5_step-annotation

8B • Updated Sep 1, 2025 • 1

New activity in leo-liuzy/Controlled-RippleEdit 12 months ago

Update README.md

#1 opened 12 months ago by

leo-liuzy

updated a dataset 12 months ago

leo-liuzy/Controlled-RippleEdit

Viewer • Updated Jun 15, 2025 • 6.05k • 21

published a dataset 12 months ago

leo-liuzy/Controlled-RippleEdit

Viewer • Updated Jun 15, 2025 • 6.05k • 21

upvoted a paper about 1 year ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper about 1 year ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

updated a dataset about 1 year ago

leo-liuzy/CodeUpdateArena

Viewer • Updated Mar 20, 2025 • 670 • 71 • 1

updated a dataset almost 2 years ago