DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
meta-llama/
$0.10
in
$0.30
out
/ 1M tokens
| Tier | Input | Output |
|---|---|---|
Priority (1.5×)Learn More | $0.15 | $0.45 |
per 1M tokens
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding. Llama 4 Scout, a 17 billion parameter model with 16 experts

© 2026 DeepInfra. All rights reserved.