Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge
Abstract
Multi-Task Reinforcement Learning framework improves multimodal large language models' judgment consistency and generalization across diverse visual tasks.
Multimodal Large Language Models (MLLMs) have been widely adopted as MLLM-as-a-Judges due to their strong alignment with human judgment across various visual tasks. However, most existing judge models are optimized for single-task scenarios and struggle to generalize to diverse contexts, which is a critical requirement for reliable evaluation. To address this limitation, we propose Multi-Task Reinforcement Learning for MLLM-as-a-Judge (MT-RL-Judge), a framework that jointly optimizes the judge model across multiple tasks, leveraging the generalization capabilities of RL. Experimental results against several strong baselines demonstrate that MT-RL-Judge outperforms strong baselines in both judgment consistency and correlation with human preferences. Furthermore, our approach exhibits robust generalization on out-of-distribution tasks, further validating its effectiveness.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Bi-Level Prompt Optimization for Multimodal LLM-as-a-Judge (2026)
- Omni-RRM: Advancing Omni Reward Modeling via Automatic Rubric-Grounded Preference Synthesis (2026)
- Endogenous Reprompting: Self-Evolving Cognitive Alignment for Unified Multimodal Models (2026)
- TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models (2026)
- EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models (2026)
- V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval (2026)
- PaLMR: Towards Faithful Visual Reasoning via Multimodal Process Alignment (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper