LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 6 days ago • 128
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models Paper • 2403.17589 • Published Mar 26, 2024 • 1
AdaNeg: Adaptive Negative Proxy Guided OOD Detection with Vision-Language Models Paper • 2410.20149 • Published Oct 26, 2024 • 1
ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning Paper • 2509.03951 • Published Mar 17 • 1
VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding Paper • 2507.13353 • Published Jul 17, 2025 • 2
Knowledge Regularized Negative Feature Tuning of Vision-Language Models for Out-of-Distribution Detection Paper • 2507.19847 • Published Jul 29, 2025 • 1