TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration Paper • 2606.04743 • Published 6 days ago • 39
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 12 days ago • 77
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 12 days ago • 77
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 13 days ago • 59
Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents Paper • 2605.28775 • Published 13 days ago • 38
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 13 days ago • 90
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published 22 days ago • 12
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published 22 days ago • 12
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Paper • 2605.20258 • Published 22 days ago • 30
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Paper • 2605.20258 • Published 22 days ago • 30
Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR Paper • 2605.15726 • Published 25 days ago • 34
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p9999_ep30 Text Generation • Updated May 5 • 2
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p9999_ep30 Text Generation • Updated May 5 • 2
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p99_ep30 Text Generation • Updated May 3 • 3
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p99_ep30 Text Generation • Updated May 3 • 3
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B_bw0p5_fw0p5_ema0p999_ep30_rgcomq0p5 Text Generation • Updated May 1 • 3
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B_bw0p5_fw0p5_ema0p999_ep30_rgcomq0p5 Text Generation • Updated May 1 • 3
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-14B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated Apr 30 • 4