We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…
mistralai/Mistral-Small-3.2-24B-Instruct-2506 cover image

mistralai/Mistral-Small-3.2-24B-Instruct-2506

Mistral-Small-3.2-24B-Instruct is a drop-in upgrade over the 3.1 release, with markedly better instruction following, roughly half the infinite-generation errors, and a more robust function-calling interface—while otherwise matching or slightly improving on all previous text and vision benchmarks.

Mistral-Small-3.2-24B-Instruct is a drop-in upgrade over the 3.1 release, with markedly better instruction following, roughly half the infinite-generation errors, and a more robust function-calling interface—while otherwise matching or slightly improving on all previous text and vision benchmarks.

Public
$0.05/$0.10 in/out Mtoken
fp8
128,000
Function
mistralai/Mistral-Small-3.2-24B-Instruct-2506 cover image

Mistral-Small-3.2-24B-Instruct-2506

Ask me anything

0.00s

Mistral-Small-3.2-24B-Instruct-2506 is a minor update of Mistral-Small-3.1-24B-Instruct-2503.

Small-3.2 improves in the following categories:

  • Instruction following: Small-3.2 is better at following precise instructions
  • Repetition errors: Small-3.2 produces less infinite generations or repetitive answers
  • Function calling: Small-3.2's function calling template is more robust (see here and examples)

In all other categories Small-3.2 should match or slightly improve compared to Mistral-Small-3.1-24B-Instruct-2503.

Key Features

Benchmark Results

We compare Mistral-Small-3.2-24B to Mistral-Small-3.1-24B-Instruct-2503. For more comparison against other models of similar size, please check Mistral-Small-3.1's Benchmarks'

Text

Instruction Following / Chat / Tone

ModelWildbench v2Arena Hard v2IF (Internal; accuracy)
Small 3.1 24B Instruct55.6%19.56%82.75%
Small 3.2 24B Instruct65.33%43.1%84.78%

Infinite Generations

Small 3.2 reduces infinite generations by 2x on challenging, long and repetitive prompts.

ModelInfinite Generations (Internal; Lower is better)
Small 3.1 24B Instruct2.11%
Small 3.2 24B Instruct1.29%

STEM

ModelMMLUMMLU Pro (5-shot CoT)MATHGPQA Main (5-shot CoT)GPQA Diamond (5-shot CoT )MBPP Plus - Pass@5HumanEval Plus - Pass@5SimpleQA (TotalAcc)
Small 3.1 24B Instruct80.62%66.76%69.30%44.42%45.96%74.63%88.99%10.43%
Small 3.2 24B Instruct80.50%69.06%69.42%44.22%46.13%78.33%92.90%12.10%

Vision

ModelMMMUMathvistaChartQADocVQAAI2D
Small 3.1 24B Instruct64.00%68.91%86.24%94.08%93.72%
Small 3.2 24B Instruct62.50%67.09%87.4%94.86%92.91%