arxiv:2503.18991
Cheng
RosyCheng
ยท
AI & ML interests
LLM Alignment&Security
Recent Activity
upvoted a paper 4 days ago
Internal Safety Collapse in Frontier Large Language Models upvoted a paper 27 days ago
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens authored a paper 9 months ago
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM
Alignment