We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

MiniMaxAI logo

MiniMaxAI/

MiniMax-M3

$0.30

in

$1.20

out

$0.06

cached

/ 1M tokens

MiniMax-M3 is a native multimodal model with 1M context. It has ~428B parameters and ~23B activated parameters.

Deploy Private Endpoint
Public
524,288
JSON
Function
Multimodal
ProjectPaper
MiniMaxAI/MiniMax-M3 cover image
MiniMaxAI/MiniMax-M3 cover image
MiniMax-M3

Ask me anything

0.00s

You need to log in to use this model

Log In

Settings

Model Information

Highlights:

  • Native Multimodality: M3 undergoes mixed-modality training from the very first step, enabling deeper semantic fusion across text, image, and video.
  • Context Scaling via Sparse Attention: M3 introduces MiniMax Sparse Attention (MSA) to improve long context efficiency. M3 delivers 9× prefill and 15× decode speedups compared to M2 at 1M context, reducing per-token compute to 1/20.
  • Coding & Cowork Capability: M3 achieves frontier-level performance across long-horizon agentic benchmarks, excelling in both coding and cowork.