arxiv:2602.07796
Changdae Oh
changdae
AI & ML interests
Distribution Shift; Uncertainty Quantification
Recent Activity
upvoted an article about 13 hours ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond upvoted a paper 6 days ago
UniGame: Turning a Unified Multimodal Model Into Its Own Adversary liked
a model 13 days ago
Qwen/Qwen3.5-27B