luanafelbarros 's Collections Medical VLMs
updated
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in
Vision-Language Models
Paper
• 2503.13939
• Published
• 5
Med-Flamingo: a Multimodal Medical Few-shot Learner
Paper
• 2307.15189
• Published
• 24
MedFuzz: Exploring the Robustness of Large Language Models in Medical
Question Answering
Paper
• 2406.06573
• Published
• 11
BenchX: A Unified Benchmark Framework for Medical Vision-Language
Pretraining on Chest X-Rays
Paper
• 2410.21969
• Published
• 10
GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via
Reinforcement Learning
Paper
• 2506.17939
• Published
• 3
Medical large language models are easily distracted
Paper
• 2504.01201
• Published
• 3
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via
Knowledge Graphs
Paper
• 2504.00993
• Published
• 3
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust
MedVQA in Gastrointestinal Endoscopy
Paper
• 2506.09958
• Published
• 1
MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal
Medical Reasoning
Paper
• 2506.00555
• Published
• 1
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark
for Chest X-ray Diagnosis
Paper
• 2411.16778
• Published
• 1
PathVQA: 30000+ Questions for Medical Visual Question Answering
Paper
• 2003.10286
• Published
• 1
Overcoming Data Limitation in Medical Visual Question Answering
Paper
• 1909.11867
• Published
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language
Models
Paper
• 2407.05131
• Published
• 26
Hierarchical Modeling for Medical Visual Question Answering with
Cross-Attention Fusion
Paper
• 2504.03135
• Published
• 1
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for
Medical LVLM
Paper
• 2402.09181
• Published
• 1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal
Large Language Models on Medical Challenge Problems & Hallucinations
Paper
• 2402.07023
• Published
• 4
MedGemma Technical Report
Paper
• 2507.05201
• Published
• 16
A Survey of Medical Vision-and-Language Applications and Their
Techniques
Paper
• 2411.12195
• Published