dailypaper
updated
Paper
•
2511.22475
•
Published
•
23
DiP: Taming Diffusion Models in Pixel Space
Paper
•
2511.18822
•
Published
•
29
Asking like Socrates: Socrates helps VLMs understand remote sensing images
Paper
•
2511.22396
•
Published
•
5
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning
Paper
•
2512.05591
•
Published
•
17
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
Paper
•
2512.00473
•
Published
•
26
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning
Paper
•
2512.03244
•
Published
•
17
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
Paper
•
2512.08153
•
Published
•
8
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder
Paper
•
2512.11749
•
Published
•
39
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models
Paper
•
2512.13607
•
Published
•
34
REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion
Paper
•
2512.16636
•
Published
•
26
Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards
Paper
•
2512.21625
•
Published
•
4
Self-Evaluation Unlocks Any-Step Text-to-Image Generation
Paper
•
2512.22374
•
Published
•
17
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper
•
2601.05242
•
Published
•
222
GARDO: Reinforcing Diffusion Models without Reward Hacking
Paper
•
2512.24138
•
Published
•
29
Boosting Latent Diffusion Models via Disentangled Representation Alignment
Paper
•
2601.05823
•
Published
•
17
Your Group-Relative Advantage Is Biased
Paper
•
2601.08521
•
Published
•
150
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
Paper
•
2601.10332
•
Published
•
28