Inference Providers
Active filters: torchao
Text Generation
• Updated • 3
Image-Text-to-Text
• Updated • 3
jerryzh168/gemma3-4b-it-float8dq
Image-Text-to-Text
• Updated • 8
vymenets/yv-llama-quantized
Text Generation
• Updated • 4
jerryzh168/gemma3-4b-it-int4wo
Image-Text-to-Text
• Updated • 4
jerryzh168/gemma3-4b-it-int4wo-hqq
Image-Text-to-Text
• Updated • 3
Text Generation
• Updated • 3
medmekk/Llama-3.2-1B-torchao-int8wo-gs128
medmekk/Llama-3.2-1B-ao-autoquant
medmekk/Llama-3.2-1B-ao-int8wo-gs128
medmekk/Llama-3.2-1B-ao-int8wo
Text Generation
• Updated • 2
medmekk/Llama-3.2-1B-ao-int8da8w
Text Generation
• Updated • 1
medmekk/Llama-3.2-1B-ao-int8wo-gs16
Text Generation
• Updated • 2
medmekk/Llama-3.2-1B-ao-int8wo-gs32
Text Generation
• Updated • 6
medmekk/Qwen2.5-0.5B-Instruct-ao-int8wo-gs128
Text Generation
• Updated • 2
jerryzh168/phi4-int4wo-gptq
Text Generation
• Updated • 2
medmekk/Qwen2.5-0.5B-Instruct-ao-int8da8w
Text Generation
• Updated • 5
jerryzh168/phi4-mini-8da4w
Text Generation
• Updated • 3
RoadToNowhere/Qwen2.5-QwQ-35B-Eureka-Cubed-abliterated-uncensored-int8wo-g128
Text Generation
• Updated • 2
jerryzh168/phi4-int4wo-hqq
Text Generation
• Updated • 7
jerryzh168/phi4-torchao-gguf-q4_k
Text Generation
• Updated • 3
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int4wo-gs128
Text Generation
• Updated • 51
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int8wo-gs128
Text Generation
• Updated • 53
Novaciano/SEX_ROLEPLAY-3.2-1B-ao-int8da8w
Text Generation
• Updated • 167
• 1
pytorch/Phi-4-mini-instruct-INT8-INT4
Text Generation
• Updated • 5.17k
• 2
jerryzh168/phi4-mini-torchao-gguf-q4_k
Text Generation
• Updated • 4
pytorch/Phi-4-mini-instruct-INT4
Text Generation
• Updated • 50
pytorch/Phi-4-mini-instruct-FP8
Text Generation
• Updated • 11.3k
• 1
jerryzh168/phi4-mini-torchao-ar-gguf-q4_k
Text Generation
• Updated • 5