DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
XiaomiMiMo/
$0.40
in
$2.00
out
$0.08
cached
/ 1M tokens
MiMo-V2.5 is a native omnimodal model with strong agentic capabilities, supporting text, image, video, and audio understanding within a unified architecture. Built upon the MiMo-V2-Flash backbone and extended with dedicated vision and audio encoders, it delivers robust performance across multimodal perception, long-context reasoning, and agentic workflows.

fESkO6nJ
2026-04-29T23:38:10+00:00
© 2026 DeepInfra. All rights reserved.