Parakeet TDT 0.6B V2 (MLX, BF16)

NVIDIA Parakeet-TDT 0.6B V2 converted to MLX SafeTensors format for Apple Silicon inference. This is the reference BF16 checkpoint โ€” see quantized variants for reduced memory:

Performance (M3 Max, 64GB)

Metric Value
WER (LibriSpeech test-clean) 1.67%
RTFx 73x realtime
Peak memory ~3GB
Parameters 627M
Format BF16 SafeTensors

Usage

from parakeet import from_pretrained

model = from_pretrained("sonic-speech/parakeet-tdt-0.6b-v2")
result = model.transcribe("audio.wav")
print(result.text)

Origin

Weights converted from nvidia/parakeet-tdt-0.6b-v2 via the mlx-community conversion pipeline. Hosted by Sonic Speech for the Sonic voice AI project.

License

CC-BY-4.0 (following NVIDIA's original license)

Downloads last month
47
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for sonic-speech/parakeet-tdt-0.6b-v2

Finetuned
(28)
this model
Finetunes
2 models