Mistral technology

AI models

We have shipped the most capable open models to accelerate AI innovation, and drives the field forward with frontier models.

Developer platform

Our portable developer platform serves our open and optimized models for building fast and intelligent applications. We offer flexible access options!

AI models La Plateforme

Open source models

We’re committed to empower the AI community with open technology. Our open models sets the bar for efficiency, and are available for free under Apache 2.0, a fully permissive license, that allows to use the models anywhere without any restriction.

Mistral 7B

Our very first. A 7B transformer model, fast-deployed and easily customisable. Small, yet very powerful for a variety of use cases.

Performant in English and code
32k context window

Mixtral 8x7B

A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.

Fluent in English, French, Italian, German, Spanish, and strong in code
32k context window

Mixtral 8x22B

Mixtral 8x22B is currently the most performant open model. A 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.

Fluent in English, French, Italian, German, Spanish, and strong in code
64k context window
Native function calling capacities
Function calling and json mode available on our API endpoint

Try on la Plateforme Download

Optimized models

Our optimized commercial models are designed for performance and are available via our flexible deployment options.

Mistral Small

Cost-efficient reasoning for low-latency workloads.

Fluent in English, French, Italian, German, Spanish, and strong in code
Context window of 32k tokens, with excellent recall for retrieval augmentation
Native function calling capacities, JSON outputs
Concise, useful, unopinionated, with fully modular

Mistral Large

Top-tier reasoning for high-complexity tasks. The most powerful model of the Mistral AI family.

Fluent in English, French, Italian, German, Spanish, and strong in code
Context window of 32k tokens, with excellent recall for retrieval augmentation
Native function calling capacities, JSON outputs
Concise, useful, unopinionated, with fully modular

Mistral Embed

State-of-the-art semantic for extracting representation of text extracts.

English only for now
Achieves a retrieval score of 55.26 on the Massive Text Embedding Benchmark (MTEB)

Try on la Plateforme

We have the best Open source models - all under Apache 2.0

We’re constantly innovating to provide the most capable and efficient models.

Cost vs Performance

We build models that are highly efficient

We build models that offer unparalleled cost efficiency for their respective sizes, delivering the best performance-to-cost ratio available on the market. Mixtral 8x22B is the most powerful open source model with significant fewer parameters than its competition:

Much faster and performant than any 70B model
Outperforms Command R+ by being more than 2.5 times smaller
Mixtral 8x7B outperforms Llama 2 70B on most benchmarks with 6x faster inference

Under Apache 2.0

Our open models are truly open source, licensed under Apache 2.0, a fully permissive license that allows for unrestricted use in any context.

La Plateforme

Access our latest products via our developer platform, hosted in Europe

from mistralai.client import MistralClient
from mistralai.models.chat_completion import ChatMessage

api_key = os.environ["MISTRAL_API_KEY"]
model = "mistral-tiny"

client = MistralClient(api_key=api_key)

messages = [
    ChatMessage(role="user",
    content="Who is the most renowned French painter?")
]

Built for developers

La Plateforme is developers’ preferred way to access all Mistral Al’s models. Hosted and served on Mistral Al infrastructure, in Europe.

Our best models at the best price : Get access to our models at an unmatched price/performance point
Guides & community : Use our guides and community forums to build your own application and services
Secure by design : Your data are encrypted at rest (AES256) and in transit (TLS 1.2+); our servers are in the EU

Access la Plateforme Documentation

from mistralai.client import MistralClient
from mistralai.models.chat_completion import ChatMessage

api_key = os.environ["MISTRAL_API_KEY"]
model = "mistral-tiny"

client = MistralClient(api_key=api_key)

messages = [
    ChatMessage(role="user",
    content="Who is the most renowned French painter?")
]

Deploy anywhere

La Plateforme

Get started with Mistral models in a few clicks via our developer platform hosted on Mistral’s infrastructure and build your own applications and services. Our servers are hosted in EU.

Access la Plateforme

Cloud platforms

Access our models via your preferred cloud provider and use your cloud credits. Our open models are currently available via our cloud partners (GCP, AWS, Azure, Snowflake, NVIDIA).
Mistral Large is available on Azure AI Studio, AWS Bedrock and Snowflake.

Mistral on Azure AI

Self-deployment

Deploy Mistral models on virtual cloud or on-prem. Self-deployment offers more advanced levels of customisation and control. Your data stays within your walls. Try deploying our open models, and contact our team to deploy our optimized models similarly.

Contact our team

Pay-as-you-go pricing

Price in $ Price in €

Price in $

Price in €

Open source models

		Input	Output
open-mistral-7b	A 7B transformer model, fast-deployed and easily customisable.	$0.25/1M tokens	$0.25/1M tokens
open-mixtral-8x7b	A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.	$0.7/1M tokens	$0.7/1M tokens
open-mixtral-8x22b	Mixtral 8x22B is currently the most performant open model. A 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.	$2/1M tokens	$6/1M tokens

		Input	Output
open-mistral-7b	A 7B transformer model, fast-deployed and easily customisable.	0.2€/1M tokens	0.2€/1M tokens
open-mixtral-8x7b	A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.	0.65€/1M tokens	0.65€/1M tokens
open-mixtral-8x22b	Mixtral 8x22B is currently the most performant open model. A 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.	1.9€/1M tokens	5.6€/1M tokens

Optimized models

		Input	Output
mistral-small	Cost-efficient reasoning for low-latency workloads.	$2/1M tokens	$6/1M tokens
mistral-medium	Will soon be deprecated	$2.7/1M tokens	$8.1/1M tokens
mistral-large	Top-tier reasoning for high-complexity tasks. The most powerful model of the Mistral AI family.	$8/1M tokens	$24/1M tokens

		Input	Output
mistral-small	Cost-efficient reasoning for low-latency workloads.	1.85€/1M tokens	5.55€/1M tokens
mistral-medium	Will soon be deprecated	2.5€/1M tokens	7.5€/1M tokens
mistral-large	Top-tier reasoning for high-complexity tasks. The most powerful model of the Mistral AI family.	7.3€/1M tokens	22€/1M tokens

Embeddings

		Input	Output
mistral-embed	State-of-the-art semantic for extracting representation of text extracts.	$0.1/1M tokens	$0.1/1M tokens

		Input	Output
mistral-embed	State-of-the-art semantic for extracting representation of text extracts.	0.1€/1M tokens	0.1€/1M tokens