-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2602.08222
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 2 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 171
-
The Trinity of Consistency as a Defining Principle for General World Models
Paper • 2602.23152 • Published • 196 -
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
Paper • 2602.22859 • Published • 148 -
OmniGAIA: Towards Native Omni-Modal AI Agents
Paper • 2602.22897 • Published • 52 -
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
Paper • 2602.22766 • Published • 40
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 44 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 45
-
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 108 -
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning
Paper • 2602.07845 • Published • 69 -
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper • 2602.08676 • Published • 70 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 59
-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 201 -
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 301 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 273 -
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 282
-
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 282 -
Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs
Paper • 2602.02338 • Published • 41 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 201
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 44 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 45
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 2 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 171
-
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 108 -
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning
Paper • 2602.07845 • Published • 69 -
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper • 2602.08676 • Published • 70 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 59
-
The Trinity of Consistency as a Defining Principle for General World Models
Paper • 2602.23152 • Published • 196 -
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
Paper • 2602.22859 • Published • 148 -
OmniGAIA: Towards Native Omni-Modal AI Agents
Paper • 2602.22897 • Published • 52 -
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
Paper • 2602.22766 • Published • 40
-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 201 -
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 301 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 273 -
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 282
-
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 282 -
Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs
Paper • 2602.02338 • Published • 41 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 102 -
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 201