Migration guides

These guides cover the specific code changes needed to switch from another provider to the Mistral API. The Mistral Chat Completions API follows the same request structure as OpenAI, so most migrations involve changing the client import, base URL, and model name. Choose your source platform below to see the exact steps and a working code example.

The Mistral API follows the same Chat Completions structure as the OpenAI API. For most applications, migration requires changing three things: the client import, the initialization call, and the model name.

Migrate from OpenAI

Update the client

Before (OpenAI):

from openai import OpenAI

client = OpenAI(api_key="sk-...")
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}],
)

from openai import OpenAI

client = OpenAI(api_key="sk-...")
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}],
)

After (Mistral):

from mistralai import Mistral

client = Mistral(api_key="your_mistral_api_key")
response = client.chat.complete(
    model="mistral-large-latest",
    messages=[{"role": "user", "content": "Hello"}],
)

from mistralai import Mistral

client = Mistral(api_key="your_mistral_api_key")
response = client.chat.complete(
    model="mistral-large-latest",
    messages=[{"role": "user", "content": "Hello"}],
)

Key differences

	OpenAI	Mistral
Python client	`openai.OpenAI`	`mistralai.Mistral`
Chat method	`client.chat.completions.create`	`client.chat.complete`
Streaming method	`client.chat.completions.create(stream=True)`	`client.chat.stream`
Base URL	`https://api.openai.com/v1`	`https://api.mistral.ai/v1`

Model name mapping

OpenAI model	Mistral equivalent
`gpt-4o`	`mistral-large-latest`
`gpt-4o-mini`	`mistral-small-latest`
`text-embedding-3-small`	`mistral-embed`

Use the OpenAI-compatible base URL

If your application uses an OpenAI-compatible client (LangChain, LlamaIndex, or any other third-party library), point it at the Mistral API by changing only the base URL and model name. No library swap required.

from openai import OpenAI

client = OpenAI(
    api_key="your_mistral_api_key",
    base_url="https://api.mistral.ai/v1",
)

response = client.chat.completions.create(
    model="mistral-large-latest",
    messages=[{"role": "user", "content": "Hello"}],
)

from openai import OpenAI

client = OpenAI(
    api_key="your_mistral_api_key",
    base_url="https://api.mistral.ai/v1",
)

response = client.chat.completions.create(
    model="mistral-large-latest",
    messages=[{"role": "user", "content": "Hello"}],
)

Migrate from self-hosted Llama

Tokenizer

Mistral models use a different tokenizer than Llama. If you compute token counts manually or handle raw tokenization, update your tooling.

Install the official Mistral tokenizer:

pip install mistral-common

pip install mistral-common

Use it to tokenize text:

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer

tokenizer = MistralTokenizer.v3()
result = tokenizer.encode_chat_completion(
    messages=[{"role": "user", "content": "Hello, world!"}]
)
print(result.tokens)

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer

tokenizer = MistralTokenizer.v3()
result = tokenizer.encode_chat_completion(
    messages=[{"role": "user", "content": "Hello, world!"}]
)
print(result.tokens)

Mistral models on Hugging Face are also compatible with the transformers library. Use apply_chat_template to handle formatting automatically.

Prompt format

Warning

Mistral models don't use the [INST] / [/INST] prompt format from Llama 2. Passing raw Llama 2-formatted strings to Mistral models produces degraded output. Update your prompt templates before testing.

Use apply_chat_template to format prompts correctly:

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
messages = [{"role": "user", "content": "Hello"}]
formatted = tokenizer.apply_chat_template(messages, tokenize=False)

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
messages = [{"role": "user", "content": "Hello"}]
formatted = tokenizer.apply_chat_template(messages, tokenize=False)

Model selection

Self-hosted Llama model	Mistral equivalent
Llama 3 8B Instruct	Mistral 7B Instruct (open-weight)
Llama 3 70B Instruct	Mixtral 8x22B Instruct (open-weight)
Any self-hosted	API: `mistral-large-latest` (managed)

All Mistral open-weight models are available on Hugging Face under the Mistral license.

Migrate to Python SDK v2

Version 2.0.0 of the Python SDK introduces a small number of breaking changes. All other APIs (chat, streaming, embeddings, agents, function calling, batch) are unchanged.

Update the package

SDK V1 corresponds to mistralai<2 and V2 to mistralai>=2. Run:

pip install "mistralai>=2"

pip install "mistralai>=2"

Update the import path

Before (V1):

from mistralai import Mistral

from mistralai import Mistral

After (V2):

from mistralai.client import Mistral

from mistralai.client import Mistral

Azure AI (if applicable)

Before (V1):

# pip install mistralai-azure>=1.0.0
from mistralai_azure import MistralAzure

client = MistralAzure(
    azure_endpoint=os.environ["AZUREAI_ENDPOINT"],
    azure_api_key=os.environ["AZUREAI_API_KEY"],
)

# pip install mistralai-azure>=1.0.0
from mistralai_azure import MistralAzure

client = MistralAzure(
    azure_endpoint=os.environ["AZUREAI_ENDPOINT"],
    azure_api_key=os.environ["AZUREAI_API_KEY"],
)

After (V2):

# pip install mistralai>=2.0.0
from mistralai.azure.client import MistralAzure

client = MistralAzure(
    server_url=os.environ["AZUREAI_ENDPOINT"],
    api_key=os.environ["AZUREAI_API_KEY"],
)

# pip install mistralai>=2.0.0
from mistralai.azure.client import MistralAzure

client = MistralAzure(
    server_url=os.environ["AZUREAI_ENDPOINT"],
    api_key=os.environ["AZUREAI_API_KEY"],
)

Google Cloud / Vertex AI (if applicable)

Before (V1):

# pip install mistralai[gcp]
from mistralai_gcp import MistralGoogleCloud

client = MistralGoogleCloud(
    region=os.environ["GOOGLE_CLOUD_REGION"],
    project_id=os.environ["GOOGLE_CLOUD_PROJECT_ID"],
)

# pip install mistralai[gcp]
from mistralai_gcp import MistralGoogleCloud

client = MistralGoogleCloud(
    region=os.environ["GOOGLE_CLOUD_REGION"],
    project_id=os.environ["GOOGLE_CLOUD_PROJECT_ID"],
)

After (V2):

# pip install mistralai>=2.0.0  (no separate package needed)
from mistralai.gcp.client import MistralGCP

# Auth is handled automatically via google.auth.default()
# Region defaults to "europe-west4"; override if needed:
# client = MistralGCP(region="us-central1", project_id="my-project")
client = MistralGCP()

# pip install mistralai>=2.0.0  (no separate package needed)
from mistralai.gcp.client import MistralGCP

# Auth is handled automatically via google.auth.default()
# Region defaults to "europe-west4"; override if needed:
# client = MistralGCP(region="us-central1", project_id="my-project")
client = MistralGCP()

Summary

Area	V1	V2
Package	`mistralai<2`	`mistralai>=2`
Core import	`from mistralai import Mistral`	`from mistralai.client import Mistral`
Azure import	`from mistralai_azure import MistralAzure`	`from mistralai.azure.client import MistralAzure`
Azure constructor	`azure_endpoint=`, `azure_api_key=`	`server_url=`, `api_key=`
GCP import	`from mistralai_gcp import MistralGoogleCloud`	`from mistralai.gcp.client import MistralGCP`
GCP auth	`GOOGLE_CLOUD_REGION` + `GOOGLE_CLOUD_PROJECT_ID` env vars	automatic via `google.auth.default()`
All other APIs	—	unchanged

Common questions

Migration guides

Migrate from OpenAI

Update the client

Key differences

Model name mapping

Use the OpenAI-compatible base URL

Migrate from self-hosted Llama

Tokenizer

Prompt format

Model selection

Migrate to Python SDK v2

Update the package

Update the import path

Azure AI (if applicable)

Google Cloud / Vertex AI (if applicable)

Summary

Common questions

Will my streaming implementation work without changes?

Does Mistral support the OpenAI function calling format?

What about system prompt behavior?