Open-Models - a th3nolo Collection

th3nolo 's Collections

Open-Models

updated 12 days ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.69M • • 4.78k
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62
Nested Browser-Use Learning for Agentic Information Seeking

Paper • 2512.23647 • Published Dec 29, 2025 • 19
TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published Dec 26, 2025 • 25
ResembleAI/chatterbox-turbo

Text-to-Speech • Updated Dec 15, 2025 • • 648
mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 326
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published Dec 17, 2025 • 25
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 50
tencent/HY-Motion-1.0

Text-to-3D • Updated Dec 31, 2025 • 430 • 409
Lightricks/LTX-2

Image-to-Video • Updated Mar 2 • 683k • • 1.71k
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Paper • 2601.14251 • Published Jan 20 • 28
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 75
tencent/Youtu-VL-4B-Instruct

Image-Text-to-Text • 5B • Updated Feb 10 • 493 • 155
Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Paper • 2601.21406 • Published Jan 29 • 6
Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 44
DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published Jan 28 • 69
zai-org/GLM-OCR

Image-to-Text • Updated Apr 14 • 7.17M • • 1.75k
unsloth/Qwen3-Coder-Next-FP8-Dynamic

Text Generation • 80B • Updated Feb 3 • 12.6k • 42
Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated Feb 3 • 1.16M • • 1.37k
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 62
Lightricks/LTX-2.3

Image-to-Video • Updated Apr 13 • 2.04M • 1.19k
mistralai/Leanstral-2603

Updated 26 days ago • 123 • 156
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published Mar 11 • 154
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF

Image-Text-to-Text • 4B • Updated Apr 6 • 28.3k • 121
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 98
YTan2000/Qwen3.5-27B-TQ3_1S

Image-Text-to-Text • 27B • Updated 24 days ago • 868 • 37
bartowski/arcee-ai_Trinity-Large-Thinking-GGUF

Text Generation • 399B • Updated Apr 1 • 2.66k • 11
zed-industries/zeta-2

Text Generation • 8B • Updated Mar 23 • 4.03k • 178
mudler/Qwen3.5-35B-A3B-APEX-GGUF

Text Generation • 35B • Updated 20 days ago • 65.7k • 93
Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated Apr 16 • 7.04k • 229
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published Apr 1 • 12
0xSero/gemma-4-21b-a4b-it-REAP

Text Generation • 21B • Updated about 1 month ago • 638 • 93
datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated Mar 18 • 1.1M • 338
lightonai/LightOnOCR-2-1B

Image-Text-to-Text • 1B • Updated 14 days ago • 561k • 683
selimaktas/MiniMax-M2.75-460B-A20B

Text Generation • 453B • Updated 26 days ago • 910 • 26
Qwen/Qwen3.6-27B

Image-Text-to-Text • 28B • Updated 24 days ago • 3.41M • • 1.32k
XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated 27 days ago • 114k • • 728
openai/privacy-filter

Token Classification • 1B • Updated 25 days ago • 248k • • 1.46k
concavity-ai/superlinear-exp-v0.1

Text Generation • 32B • Updated Feb 6 • 38 • 22
openbmb/InfLLM-V2-Long-Sparse-Base

8B • Updated Dec 1, 2025 • 14 • 6
deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 202k • • 994
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

Paper • 2603.28458 • Published Mar 30 • 44