Image Generation

Image Generation is a built-in connector tool that enables agents to generate images of all kinds and forms. Enabling this tool allows models to create images at any given moment.

Create an Image Generation Agent

You can create an agent with access to image generation by providing it as one of the tools. Note that you can still add more tools to the agent. The model is free to create images on demand.

python
typescript
curl

image_agent = client.beta.agents.create(
    model="mistral-medium-2505",
    name="Image Generation Agent",
    description="Agent used to generate images.",
    instructions="Use the image generation tool when you have to create images.",
    tools=[{"type": "image_generation"}],
    completion_args={
        "temperature": 0.3,
        "top_p": 0.95,
    }
)

let imageAgent = await client.beta.agents.create({
    model:"mistral-medium-2505",
    name:"Image Generation Agent",
    description:"Agent used to generate images.",
    instructions:"Use the image generation tool when you have to create images.",
    tools:[{
        type: "image_generation"
    }],
    completionArgs:{
        temperature: 0.3,
        topP: 0.95,
    }
});

curl --location "https://api.mistral.ai/v1/agents" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer $MISTRAL_API_KEY" \
     --data '{
     "model": "mistral-medium-2505",
     "name": "Image Generation Agent",
     "description": "Agent used to generate images.",
     "instructions": "Use the image generation tool when you have to create images.",
     "tools": [
       {
         "type": "image_generation"
       }
     ],
     "completion_args": {
       "temperature": 0.3,
       "top_p": 0.95
     }
  }'

Output

{
  "model": "mistral-medium-2505",
  "name": "Image Generation Agent",
  "description": "Agent used to generate images.",
  "id": "ag_068359b1d997713480003c77113b8119",
  "version": 0,
  "created_at": "2025-05-27T10:59:41.602844Z",
  "updated_at": "2025-05-27T10:59:41.602846Z",
  "instructions": "Use the image generation tool when you have to create images.",
  "tools": [
    {
      "type": "image_generation"
    }
  ],
  "completion_args": {
    "stop": null,
    "presence_penalty": null,
    "frequency_penalty": null,
    "temperature": 0.3,
    "top_p": 0.95,
    "max_tokens": null,
    "random_seed": null,
    "prediction": null,
    "response_format": null,
    "tool_choice": "auto"
  },
  "handoffs": null,
  "object": "agent"
}

As with other agents, when creating one, you will receive an agent ID corresponding to the created agent. You can use this ID to start a conversation.

How It Works

Now that we have our image generation agent ready, we can create images on demand at any point.

Conversations with Image Generation

python
typescript
curl

response = client.beta.conversations.start(
    agent_id=image_agent.id,
    inputs="Generate an orange cat in an office."
)

let conversation = await client.beta.conversations.start({
      agentId: imageAgent.id,
      inputs:"Generate an orange cat in an office.",
      //store:false
});

curl --location "https://api.mistral.ai/v1/conversations" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer $MISTRAL_API_KEY" \
     --data '{
     "inputs": "Generate an orange cat in an office.",
     "stream": false,
     "agent_id": "<agent_id>"
  }'

For explanation purposes, lets take a look at the output in a readable JSON format.

{
  "conversation_id": "conv_068359b1dc6f74658000000a358b2357",
  "outputs": [
    {
      "name": "image_generation",
      "object": "entry",
      "type": "tool.execution",
      "created_at": "2025-05-27T10:59:53.092347Z",
      "completed_at": "2025-05-27T10:59:56.436333Z",
      "id": "tool_exec_068359b2917a7117800018b42bf8dc39"
    },
    {
      "content": [
        {
          "text": "Here is your image: an orange cat in an office.\n\n",
          "type": "text"
        },
        {
          "tool": "image_generation",
          "file_id": "933c5b5a-1c47-4cdd-84f6-f32526bd161b",
          "type": "tool_file",
          "file_name": "image_generated_0",
          "file_type": "png"
        }
      ],
      "object": "entry",
      "type": "message.output",
      "created_at": "2025-05-27T10:59:57.718377Z",
      "completed_at": "2025-05-27T10:59:58.818205Z",
      "id": "msg_068359b2db7e74eb8000d11444e03eb8",
      "agent_id": "ag_068359b1d997713480003c77113b8119",
      "model": "mistral-medium-2505",
      "role": "assistant"
    }
  ],
  "usage": {
    "prompt_tokens": 129,
    "total_tokens": 292,
    "completion_tokens": 94,
    "connector_tokens": 69,
    "connectors": {
      "image_generation": 1
    }
  },
  "object": "conversation.response"
}

Explanation of the Outputs

There are 2 main entries in the outputs of our request:

tool.execution: This entry corresponds to the execution of the image generation tool. It includes metadata about the execution, such as:
- name: The name of the tool, which in this case is image_generation.
- object: The type of object, which is entry.
- type: The type of entry, which is tool.execution.
- created_at and completed_at: Timestamps indicating when the tool execution started and finished.
- id: A unique identifier for the tool execution.
message.output: This entry corresponds to the generated answer from our agent. It includes metadata about the message, such as:
- content: The actual content of the message, which in this case is a list of chunks. These chunks can be of different types, and the model can interleave different chunks, using text chunks and others to complete the message. In this case, we got a two chunks corresponding to a text chunk and a tool_file, which represents the generated file, specifically the generated image. The content section includes:
  - tool: The name of the tool used for generating the file, which in this case is image_generation.
  - file_id: A unique identifier for the generated file.
  - type: The type of chunk, which in this case is tool_file.
  - file_name: The name of the generated file.
  - file_type: The type of the generated file, which in this case is png.
- object: The type of object, which is entry.
- type: The type of entry, which is message.output.
- created_at and completed_at: Timestamps indicating when the message was created and completed.
- id: A unique identifier for the message.
- agent_id: A unique identifier for the agent that generated the message.
- model: The model used to generate the message, which in this case is mistral-medium-2505.
- role: The role of the message, which is assistant.

Download Images

To access that image you can download it via our files endpoint.

python
typescript
curl

# Download using the ToolFileChunk ID
file_bytes = client.files.download(file_id=file_chunk.file_id).read()

# Save the file locally
with open(f"image_generated.png", "wb") as file:
    file.write(file_bytes)

Generated Image:

A full code snippet to download all generated images from a response could look like so:

from mistralai.models import ToolFileChunk

for i, chunk in enumerate(response.outputs[-1].content):
    # Check if chunk corresponds to a ToolFileChunk
    if isinstance(chunk, ToolFileChunk):

      # Download using the ToolFileChunk ID
      file_bytes = client.files.download(file_id=chunk.file_id).read()

      # Save the file locally
      with open(f"image_generated_{i}.png", "wb") as file:
          file.write(file_bytes)

Add the following imports:

import *  as fs from 'fs';
import type { ToolFileChunk, MessageOutputEntry, ConversationResponse } from "@mistralai/mistralai/models/components/index.js";

Function used to save your image:

async function saveStreamToFile(stream: ReadableStream<Uint8Array>, filePath: string): Promise<void> {
    const reader = stream.getReader();
    const writableStream = fs.createWriteStream(filePath);

    while (true) {
        const { done, value } = await reader.read();
        if (done) break;
        writableStream.write(Buffer.from(value));
    }

    writableStream.end();
}

Conversation content retrieval, and call the saveStreamToFile function.

const entry = conversation.outputs[conversation.outputs.length - 1];
const messageOutputEntry = entry as MessageOutputEntry;

const chunk = messageOutputEntry.content[1];
if (typeof(chunk) != "string" && 'fileId' in chunk) {
    const fileChunk = chunk as ToolFileChunk;
    const fileStream = await client.files.download({ fileId: fileChunk.fileId });
    await saveStreamToFile(fileStream, `image_generated.png`);
}

Generated Image:

A full code snippet to download all generated images from a response could look like so:

async function processFileChunks(conversation: ConversationResponse) {
    const entry = conversation.outputs[conversation.outputs.length - 1];
    const messageOutputEntry = entry as MessageOutputEntry;
    for (let i = 0; i < messageOutputEntry.content.length; i++) {
        const chunk = messageOutputEntry.content[i];
        if (typeof(chunk) != "string" && 'fileId' in chunk) {
            const fileChunk = chunk as ToolFileChunk;
            const fileStream = await client.files.download({ fileId: fileChunk.fileId });
            await saveStreamToFile(fileStream, `image_generated_${i}.png`);
        }
    }
}

curl --location "https://api.mistral.ai/v1/files/<file_id>/content" \
     --header 'Accept: application/octet-stream' \
     --header 'Accept-Encoding: gzip, deflate, zstd' \
     --header "Authorization: Bearer $MISTRAL_API_KEY"

Create an Image Generation Agent​

How It Works​

Conversations with Image Generation​

Explanation of the Outputs​

Download Images​

Create an Image Generation Agent

How It Works

Conversations with Image Generation

Explanation of the Outputs

Download Images