Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Providers
·
Metrics for top trending models
Browse all models
Learn more
Reset
Model
Provider
Input $/1M
Output $/1M
Context
Latency(s)
Throughput(t/s)
Tools
Structured
Qwen/Qwen3-32B
Qwen3-32B
groq
fastest
$0.29
$0.59
131,072
0.27
302
Yes
No
Qwen/Qwen3-32B
Qwen3-32B
novita
$0.10
$0.45
40,960
0.67
16
No
No
Qwen/Qwen3-32B
Qwen3-32B
sambanova
$0.40
$0.80
32,768
1.92
191
Yes
Yes
Qwen/Qwen3-32B
Qwen3-32B
nscale
cheapest
$0.08
$0.25
40,960
0.85
28
Yes
Yes
Qwen/Qwen3-32B
Qwen3-32B
ovhcloud
$0.09
$0.25
32,768
0.40
39
Yes
No