-
-
-
-
-
-
Inference Providers
Active filters: hf_jobs
Paradoxis/Qwen2.5-VL-3B-Instruct-GRPO-VanillaTransformer
Updated
quablab/SmolLM3-Custom-SFT
Text Generation
• 3B • Updated
• 1
mradermacher/CORE2-llama-3.2-3b-MATH-GGUF
3B • Updated
• 1
leom21/Qwen3-0.6B-SFT-20250923145207
Text Generation
• 0.6B • Updated
leom21/Qwen3-0.6B-SFT-20250924080059
Text Generation
• 0.6B • Updated
• 1
Text Generation
• 3B • Updated
• 56
Text Generation
• 3B • Updated
malanevans/SmolLM3-3B-Jobs-SFT
Text Generation
• 3B • Updated
• 1
communistrigger/b63b960f-79e4-47f6-895e-791fb0c8f1ca
Text Generation
• Updated
• 1
Arjun4707/smollm3-lora-sft
Updated
msquaredd/smollm3-dpo-aligned-202509291110
Text Generation
• 3B • Updated
• 1
burtenshaw/Llama-3.2-1B-Tulu3-LoRA
Updated
burtenshaw/Qwen2-0.5B-SFT-LoRA
Updated
burtenshaw/Qwen2.5-1.5B-Math-GRPO-LoRA
Updated
burtenshaw/grpo-Qwen2.5-VL-3B-Instruct-LoRA
Updated
burtenshaw/Qwen2.5-3B-OpenThoughts-LoRA
Updated
pmakiela/SmolLM3-3B-SFT-v1_1
Text Generation
• 3B • Updated
• 1
pmakiela/SmolLM3-3B-SFT-v1_2-peft
Updated
pmakiela/SmolLM3-3B-SFT-v1_3-peft
Updated
kshitijthakkar/loggenix-moe-0.3B-A0.1B-e3-lr7e5-b16-4090-v7-sft-v3-dpo-v0
Text Generation
• 0.3B • Updated
• 1
burtenshaw/lora-no-regret-grpo
Text Generation
• 0.5B • Updated
• 1
pmakiela/SmolLM3-3B-SFT-v1_4-peft
Updated
geradeluxer/smollm3-lora-sft
Updated
Text Generation
• 3B • Updated
• 2
bandiang2/smollm3-lora-sft
Updated
patrickfleith/smollm3-sft
Text Generation
• 3B • Updated
• 2
h-d-h/smollm3-dpo-aligned
Text Generation
• 3B • Updated
• 1
jweston/smollm3-dpo-aligned
Text Generation
• 3B • Updated
• 1
pmakiela/SmolLM3-3B-dpo-v0_1
Text Generation
• 3B • Updated
• 1