Known limitations

This page documents current limitations of the Mistral platform. We actively work to address these. Check the changelogs for updates.

Context window

For the full and up-to-date list, see the model cards.

Requests exceeding the model's context window return a 400 Bad Request error.
Token counts include both input and output tokens. Plan your max_tokens accordingly.

Rate limits

Rate limits vary by subscription tier and model. When exceeded, the API returns 429 Too Many Requests.

Tip

Check the X-RateLimit-Remaining response header to monitor your usage before hitting the limit.

File uploads

Batch processing

Maximum batch file size: 512 MB.
Maximum requests per batch: 100,000.
Batch jobs are processed asynchronously; completion time depends on queue depth and request complexity.
Batch results are available for download for 24 hours after completion.

Streaming

Streaming connections time out after 10 minutes of inactivity.
stream_options.include_usage must be explicitly set to receive token usage in stream events.
Some client HTTP libraries may buffer streamed responses; ensure chunked transfer encoding is handled correctly.

Function calling

Maximum number of tools per request: 128.
Tool descriptions are included in the token count. Long descriptions reduce available context for messages.
Parallel function calls are supported but may return calls in any order.
tool_choice: "any" forces a tool call but does not guarantee which tool is selected.

JSON mode

When response_format: {"type": "json_object"} is set, the model always returns valid JSON.
You must include "JSON" in the system or user prompt. Otherwise the model may produce an infinite whitespace stream.
JSON mode does not guarantee adherence to a specific schema. Use function calling for structured outputs.

Vision

Audio transcription

Supported formats: WAV, MP3, FLAC, OGG, WEBM.
Maximum audio duration: 60 minutes.
Maximum file size: 500 MB.
Transcription is optimized for clear speech; heavy background noise reduces accuracy.

Regional availability

The Mistral API is served from EU data centers by default.
Some models may not be available in all regions. Check the models page for details.