March 23, 2026
v26.03
Voxtral TTS
Our state-of-the-art text-to-speech model with zero-shot voice cloning. Supports 9 languages, streaming with ~100ms time-to-first-audio, and no transcript required for voice prompts.
Speed
Performance
Modalities
Price
$0
/M Chars
$16
/M Chars
Speed
Performance
Modalities
Price
$0
/M Chars
$16
/M Chars
Other Models