@danielhanchen on Hugging Face: "Qwen releases Qwen3-Coder-Next! 💜 Run the locally on 46GB RAM or less. Thhe…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

danielhanchen

posted an update 9 days ago

Post

3683

Qwen releases Qwen3-Coder-Next! 💜 Run the locally on 46GB RAM or less.

Thhe model excels at agentic coding & local use. With 256K context, it delivers similar performance to models with 10-20× more active parameters.

GGUF: unsloth/Qwen3-Coder-Next-GGUF
Guide: https://unsloth.ai/docs/models/qwen3-coder-next

YellowjacketGames

9 days ago

fits almst perfectly into an a6000!

danielhanchen

7 days ago

Hopefully it runs fast for you! :)

LeoHyperlink

8 days ago

I run it on threadripper 3970x with 256gb system ram and offloading computation layers to a gtx 1660 6gb vram. Using llama.cpp with -nkvo -kvu and all MoE on CPU. With an amazing speed on 14/TpS generation speed using q8_0. I’m amazed

danielhanchen

7 days ago

Awesome to hear, thanks for trying them out!

jjenny

6 days ago

awsome!

In this post

danielhanchen Daniel (Unsloth)
YellowjacketGames Ben Kelly
LeoHyperlink André Leo Bjørningstad
jjenny kim