Efficient Agentic Reasoning Through Self-Regulated Simulative Planning Paper • 2605.22138 • Published 4 days ago • 7
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 12 days ago • 156
SpecDrift Collection Models released as a part of Attention-Drift Paper, trained for deployment on production • 2 items • Updated 15 days ago • 2
view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents nvidia • 27 days ago • 57
Fixed Chat Templates for Qwen 3.5 & 3.6 Collection Rewritten Jinja templates fixing 5 bugs in official Qwen 3.5/3.6. Works in LM Studio, llama.cpp, MLX, vLLM. • 1 item • Updated 25 days ago • 4
Proven REAPs Collection Benchmarked REAP checkpoints with >=500 all-time downloads. GLM/Qwen/MiniMax/DeepSeek/Kimi/gemma. • 21 items • Updated 8 days ago • 8
view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap • Feb 26 • 161
Learning to Continually Learn via Meta-learning Agentic Memory Designs Paper • 2602.07755 • Published Feb 8 • 8
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 5 days ago • 50
TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification Paper • 2604.14531 • Published Apr 16 • 7
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 18 items • Updated 5 days ago • 295
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated Apr 22 • 192