-
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 129 -
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Paper • 2512.08765 • Published • 132 -
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
Paper • 2512.04677 • Published • 170 -
LongCat-Image Technical Report
Paper • 2512.07584 • Published • 23
Collections
Discover the best community collections!
Collections including paper arxiv:2507.04009
-
Step-Audio-R1 Technical Report
Paper • 2511.15848 • Published • 58 -
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Paper • 2410.17799 • Published • 9 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 53
-
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 43 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 53 -
WebDancer: Towards Autonomous Information Seeking Agency
Paper • 2505.22648 • Published • 33
-
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper • 2504.07128 • Published • 87 -
BM25S: Orders of magnitude faster lexical search via eager sparse scoring
Paper • 2407.03618 • Published • 14 -
Deep Think with Confidence
Paper • 2508.15260 • Published • 90 -
R-Zero: Self-Evolving Reasoning LLM from Zero Data
Paper • 2508.05004 • Published • 130
-
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
Paper • 2508.18106 • Published • 348 -
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
Paper • 2411.02959 • Published • 71 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 53 -
MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm
Paper • 2506.05218 • Published • 2
-
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 129 -
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Paper • 2512.08765 • Published • 132 -
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
Paper • 2512.04677 • Published • 170 -
LongCat-Image Technical Report
Paper • 2512.07584 • Published • 23
-
Step-Audio-R1 Technical Report
Paper • 2511.15848 • Published • 58 -
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Paper • 2410.17799 • Published • 9 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 53
-
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
Paper • 2508.18106 • Published • 348 -
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
Paper • 2411.02959 • Published • 71 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 53 -
MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm
Paper • 2506.05218 • Published • 2
-
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 43 -
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents
Paper • 2507.04009 • Published • 53 -
WebDancer: Towards Autonomous Information Seeking Agency
Paper • 2505.22648 • Published • 33
-
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper • 2504.07128 • Published • 87 -
BM25S: Orders of magnitude faster lexical search via eager sparse scoring
Paper • 2407.03618 • Published • 14 -
Deep Think with Confidence
Paper • 2508.15260 • Published • 90 -
R-Zero: Self-Evolving Reasoning LLM from Zero Data
Paper • 2508.05004 • Published • 130