Parakeet TDT 0.6B V2 (MLX, BF16)

NVIDIA Parakeet-TDT 0.6B V2 converted to MLX SafeTensors format for Apple Silicon inference. This is the reference BF16 checkpoint โ€” see quantized variants for reduced memory:

Performance (M3 Max, 64GB)

Metric Value
WER (LibriSpeech test-clean) 1.67%
RTFx 73x realtime
Peak memory ~3GB
Parameters 627M
Format BF16 SafeTensors

Usage

from parakeet import from_pretrained

model = from_pretrained("sonic-speech/parakeet-tdt-0.6b-v2")
result = model.transcribe("audio.wav")
print(result.text)

Origin

Weights converted from nvidia/parakeet-tdt-0.6b-v2 via the mlx-community conversion pipeline. Hosted by Sonic Speech for the Sonic voice AI project.

License

CC-BY-4.0 (following NVIDIA's original license)

Downloads last month
311
Safetensors
Model size
0.6B params
Tensor type
F32
ยท
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for sonic-speech/parakeet-tdt-0.6b-v2

Finetuned
(32)
this model
Finetunes
2 models