Llama 4 Scout & Maverick models are now available. Try them!
Models
Docs
Pricing
Chat
DeepStart
Blog
Documentation
GPU Instances
Contents
GPU Instances provide on-demand access to high-performance GPU compute resources in the cloud:
Latest Models
openai/
whisper-tiny
bigcode/
starcoder2-15b
Gryphe/
MythoMax-L2-13b
Phind/
Phind-CodeLlama-34B-v2
openchat/
openchat_3.5
Featured Models
microsoft/
phi-4-reasoning-plus
Qwen/
Qwen3-14B
Phi-4-multimodal-instruct
meta-llama/
Llama-3.3-70B-Instruct
sesame/
csm-1b
deepseek-ai/
DeepSeek-V3
Company
Compare
About
Careers
Trust Center
Privacy
Terms
© 2025 Deep Infra. All rights reserved.