We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

XiaomiMiMo/

MiMo-V2.5

$0.40

in

$2.00

out

$0.08

cached

/ 1M tokens

MiMo-V2.5 is a native omnimodal model with strong agentic capabilities, supporting text, image, video, and audio understanding within a unified architecture. Built upon the MiMo-V2-Flash backbone and extended with dedicated vision and audio encoders, it delivers robust performance across multimodal perception, long-context reasoning, and agentic workflows.

Deploy Private Endpoint
Public
262,144
JSON
Function
XiaomiMiMo/MiMo-V2.5 cover image