DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
By Category
Automatic Speech Recognition
Embeddings
Reranker
Text Generation
Text To Image
Text To Speech
Text To Video
Zero Shot Image Classification
By Family
/Claude
/DeepSeek
/Flux
/Gemini
/Llama
/Mistral
/Nemotron
/Qwen
Models
ByteDance/
$0.10
in
$0.40
out
$0.02
cached
/ 1M tokens
Built for low-latency, high-concurrency, cost-sensitive use cases, with flexible deployment, four-tier thinking, and multimodal
Have questions or need a custom solution?
Company
Latest Models
Featured Models
© 2026 DeepInfra. All rights reserved.