-
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
Paper • 2507.01352 • Published • 56 -
A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models
Paper • 2507.13563 • Published • 53 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 37 -
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation
Paper • 2511.14993 • Published • 231
Collections
Discover the best community collections!
Collections including paper arxiv:2512.16676
-
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Paper • 2512.19673 • Published • 64 -
Region-Constraint In-Context Generation for Instructional Video Editing
Paper • 2512.17650 • Published • 51 -
SpatialTree: How Spatial Abilities Branch Out in MLLMs
Paper • 2512.20617 • Published • 43
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 238 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 28
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 43 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 93 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219
-
UFO^3: Weaving the Digital Agent Galaxy
Paper • 2511.11332 • Published • 19 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 136
-
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Qwen3-VL Technical Report
Paper • 2511.21631 • Published • 156 -
Step-DeepResearch Technical Report
Paper • 2512.20491 • Published • 86 -
Deep Research: A Systematic Survey
Paper • 2512.02038 • Published • 72
-
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
Paper • 2510.06499 • Published • 33 -
FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
Paper • 2508.16514 • Published • 1 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 93 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 104 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 105
-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 32 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 52
-
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
Paper • 2507.01352 • Published • 56 -
A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models
Paper • 2507.13563 • Published • 53 -
Scaling Laws for Optimal Data Mixtures
Paper • 2507.09404 • Published • 37 -
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation
Paper • 2511.14993 • Published • 231
-
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Qwen3-VL Technical Report
Paper • 2511.21631 • Published • 156 -
Step-DeepResearch Technical Report
Paper • 2512.20491 • Published • 86 -
Deep Research: A Systematic Survey
Paper • 2512.02038 • Published • 72
-
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
Paper • 2512.19673 • Published • 64 -
Region-Constraint In-Context Generation for Instructional Video Editing
Paper • 2512.17650 • Published • 51 -
SpatialTree: How Spatial Abilities Branch Out in MLLMs
Paper • 2512.20617 • Published • 43
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 238 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 28
-
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
Paper • 2510.06499 • Published • 33 -
FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
Paper • 2508.16514 • Published • 1 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 43 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 93 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 93 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 104 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 105
-
UFO^3: Weaving the Digital Agent Galaxy
Paper • 2511.11332 • Published • 19 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 219 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 136
-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 32 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 52