Getting StartedModelsProductsDevelopersAdminAPI
Reach outTry Studio
Models
  • Overview
  • Model Selection Guide
  • Best Practices
    • Prompting
    • Sampling
  • Deployment
    • Cloud
    • Self-Deployment
  • Labs
  1. Models
  2. Model Cards
  3. Voxtral TTS
Try in playground ↗
Voxtral TTS icon
Compare
Cat
March 23, 2026
v26.03

Voxtral TTS

Our state-of-the-art text-to-speech model with zero-shot voice cloning. Supports 9 languages, streaming with ~90ms time-to-first-audio, and no transcript required for voice prompts.

Speed
Performance
Modalities
Price
$0

/M Chars

$16

/M Chars

Speed
Performance
Modalities
Price
$0

/M Chars

$16

/M Chars

Features

Chat Completions
Function Calling
Agents & Conversations
Built-In Tools
Structured Outputs
Predicted Outputs
Prefix
OCR
Annotations - Structured
BBox Extraction
Document QnA
FIM
Embeddings
Moderations
Chat Moderations
Transcriptions
Text to Speech
Timestamps
Batching
Other Models

Other Models

Mistral Small 4 icon

Mistral Small 4

v26.03
Leanstral icon

Leanstral

v26.03
Mistral Large 3 icon

Mistral Large 3

v25.12

WHY MISTRAL

About usOur customersCareersContact us

EXPLORE

AI SolutionsPartnersResearch

DOCUMENTATION

DocumentationAmbassadorsCookbooks

BUILD

StudioMistral VibeMistral CodeMistral ComputeTry the API

LEGAL

Terms of servicePrivacy policyLegal noticeBrand

COMMUNITY

Discord↗X↗Github↗LinkedIn↗Ambassadors

Mistral AI © 2026

Sun
Grass
Grass
GrassGrassGrass
Grass
Grass
Grass
GrassGrassGrass
Grass
GrassGrassGrass
Cat