DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
deepseek-ai/
$0.14
in
$0.28
out
$0.028
cached
/ 1M tokens
DeepSeek V4 Flash is an efficiency-focused MoE model with 284B total parameters (13B active) and a 1M-token context window. It's tuned for fast inference and high-throughput use cases while still holding up on reasoning and coding tasks.

© 2026 DeepInfra. All rights reserved.