Vision - a neopolita Collection

neopolita 's Collections

LLMFunctionCalling&Structured

Vision

updated Jan 23, 2025

liuhaotian/llava-v1.6-34b

Image-Text-to-Text • 35B • Updated May 9, 2024 • 21.3k • 363
deepseek-ai/deepseek-vl-7b-base

7B • Updated Mar 15, 2024 • 94 • 65
deepseek-ai/deepseek-vl-7b-chat

Image-Text-to-Text • 7B • Updated Mar 15, 2024 • 2.61k • 270
HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 99k • 621
HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 152 • 95
HuggingFaceM4/idefics2-8b-base

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 1.29k • 28
google/paligemma-3b-pt-896

Image-Text-to-Text • 3B • Updated Jun 22, 2025 • 575 • 124
microsoft/Phi-3-vision-128k-instruct

Text Generation • Updated Dec 10, 2025 • 160k • 971
facebook/chameleon-7b

Image-Text-to-Text • 7B • Updated Jul 23, 2024 • 143k • 200
microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Dec 10, 2025 • 1.65M • 733
meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • 11B • Updated Sep 27, 2024 • 13k • 586
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 132k • 1.59k
meta-llama/Llama-3.2-90B-Vision

Image-Text-to-Text • 89B • Updated Sep 27, 2024 • 2.69k • 134
meta-llama/Llama-3.2-90B-Vision-Instruct

Image-Text-to-Text • 89B • Updated Mar 4, 2025 • 12.5k • 356
meta-llama/Llama-Guard-3-11B-Vision

Image-Text-to-Text • 11B • Updated Nov 18, 2024 • 5.56k • 72
allenai/Molmo-72B-0924

Image-Text-to-Text • 73B • Updated Oct 9, 2025 • 3.7k • 298
allenai/Molmo-7B-D-0924

Image-Text-to-Text • 8B • Updated Dec 15, 2025 • 39.9k • 566
allenai/Molmo-7B-O-0924

Image-Text-to-Text • 8B • Updated Oct 9, 2025 • 675 • 163
allenai/MolmoE-1B-0924

Image-Text-to-Text • Updated Apr 24, 2025 • 6.41k • 157
genmo/mochi-1-preview

Text-to-Video • Updated Sep 4, 2025 • 8.47k • • 1.32k
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 259 • 1.71k
Lightricks/LTX-Video

Image-to-Video • Updated Jul 16, 2025 • 473k • • 2.17k