-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper • 2503.14456 • Published • 153 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper • 2503.15265 • Published • 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper • 2503.15558 • Published • 50
Collections
Discover the best community collections!
Collections including paper arxiv:2506.07044
-
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Paper • 2411.18672 • Published -
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation
Paper • 2406.11451 • Published -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation
Paper • 2507.00316 • Published • 15
-
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
mlx-community/Lingshu-7B-4bit
Image-Text-to-Text • 1B • Updated • 1 -
mlx-community/Lingshu-7B-6bit
Image-Text-to-Text • Updated -
mlx-community/Lingshu-7B-8bit
Image-Text-to-Text • Updated • 1
-
lingshu-medical-mllm/Lingshu-7B
Image-Text-to-Text • Updated • 7.31k • 71 -
lingshu-medical-mllm/Lingshu-32B
Image-Text-to-Text • 33B • Updated • 1.49k • 74 -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
Mungert/Lingshu-32B-GGUF
Image-Text-to-Text • 33B • Updated • 166 • 6
-
CoRAG: Collaborative Retrieval-Augmented Generation
Paper • 2504.01883 • Published • 9 -
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning
Paper • 2504.08600 • Published • 32 -
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Paper • 2503.23157 • Published • 10 -
AI Agents: Evolution, Architecture, and Real-World Applications
Paper • 2503.12687 • Published • 2
-
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
Current Pathology Foundation Models are unrobust to Medical Center Differences
Paper • 2501.18055 • Published • 3 -
A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images
Paper • 2512.14640 • Published • 1 -
Atlas 2 -- Foundation models for clinical deployment
Paper • 2601.05148 • Published • 3
-
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Paper • 2505.02567 • Published • 80 -
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations
Paper • 2505.18125 • Published • 112 -
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper • 2505.17612 • Published • 81 -
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Paper • 2505.18129 • Published • 62
-
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
Paper • 2411.12814 • Published • 23 -
SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image Segmentation
Paper • 2411.14525 • Published • 19 -
MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Paper • 2412.04106 • Published • 5 -
PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion
Paper • 2412.17780 • Published • 5
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper • 2503.14456 • Published • 153 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper • 2503.15265 • Published • 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper • 2503.15558 • Published • 50
-
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
Current Pathology Foundation Models are unrobust to Medical Center Differences
Paper • 2501.18055 • Published • 3 -
A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images
Paper • 2512.14640 • Published • 1 -
Atlas 2 -- Foundation models for clinical deployment
Paper • 2601.05148 • Published • 3
-
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Paper • 2411.18672 • Published -
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation
Paper • 2406.11451 • Published -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation
Paper • 2507.00316 • Published • 15
-
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
mlx-community/Lingshu-7B-4bit
Image-Text-to-Text • 1B • Updated • 1 -
mlx-community/Lingshu-7B-6bit
Image-Text-to-Text • Updated -
mlx-community/Lingshu-7B-8bit
Image-Text-to-Text • Updated • 1
-
lingshu-medical-mllm/Lingshu-7B
Image-Text-to-Text • Updated • 7.31k • 71 -
lingshu-medical-mllm/Lingshu-32B
Image-Text-to-Text • 33B • Updated • 1.49k • 74 -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
Mungert/Lingshu-32B-GGUF
Image-Text-to-Text • 33B • Updated • 166 • 6
-
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Paper • 2505.02567 • Published • 80 -
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations
Paper • 2505.18125 • Published • 112 -
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper • 2505.17612 • Published • 81 -
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Paper • 2505.18129 • Published • 62
-
CoRAG: Collaborative Retrieval-Augmented Generation
Paper • 2504.01883 • Published • 9 -
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning
Paper • 2504.08600 • Published • 32 -
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Paper • 2503.23157 • Published • 10 -
AI Agents: Evolution, Architecture, and Real-World Applications
Paper • 2503.12687 • Published • 2
-
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
Paper • 2411.12814 • Published • 23 -
SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image Segmentation
Paper • 2411.14525 • Published • 19 -
MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Paper • 2412.04106 • Published • 5 -
PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion
Paper • 2412.17780 • Published • 5