DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

DeepInfra is serving NVIDIA Cosmos 3, NVIDIA's open world foundation model for physical AI, from day zero of its release. As the first omnimodel for physical AI that reasons before it generates, Cosmos 3 is live on DeepInfra today as two variants—Cosmos 3 Nano and Cosmos 3 Super—at the industry's best prices, empowering developers to build physical AI systems without compromising on budget or performance.
Most generative models just generate. Cosmos 3 does something different: it reasons first, then generates. That distinction matters a great deal if you're building physical AI systems like robots or autonomous vehicles, where generating plausible-but-wrong outputs isn't just a quality issue—it's a safety one. As NVIDIA describes it, Cosmos 3 is the first OmniModel that unifies reasoning, world, and action generation in a single architecture.
Under the hood it uses a Mixture-of-Transformer architecture that combines an autoregressive reasoner with a diffusion-based generator. Inputs and outputs span text, image, video, audio, and action, making Cosmos 3 genuinely multimodal in both directions—not just for perception, but for generation and decision-making as well.
Ranked #1 open world generation model for synthetic data generation. Use it to generate training data for physical AI at scale, without expensive real-world data collection.
Ranked #1 backbone for world action models. A strong foundation for robotics, embodied AI, and AV policy training.
Ranked #1 open model for visual understanding on fixed infrastructure cameras—useful for smart city, warehouse, logistics deployments, infrastructure monitoring, and industrial automation.
Designed for closed-loop learning and simulation workflows. Pairs with NVIDIA AV Sim and Isaac Sim for training, testing, and evaluating physical AI systems in simulated environments before deployment.
The lighter variant. A good starting point for experimentation, fine-tuning, and latency-sensitive workloads.
The full-capability variant. Tops the PAI Bench and R-Bench leaderboards. Use it where quality and reasoning performance are the priority.
Both are available on DeepInfra today via our standard API—the same setup as any other model, with no special configuration needed to get started.
Cosmos 3 Nano and Cosmos 3 Super are live on DeepInfra now. If you're building physical AI, robots, or AV systems and want to experiment with world modeling, reasoning, action generation, and synthetic data creation, this is a strong place to start.
Visit our models page to explore competitive rates for Cosmos 3 inference, or check out the DeepInfra docs to learn more about our complete model ecosystem and developer resources.
Build a Streaming Chat Backend in 10 Minutes<p>When large language models move from demos into real systems, expectations change. The goal is no longer to produce clever text, but to deliver predictable latency, responsive behavior, and reliable infrastructure characteristics. In chat-based systems, especially, how fast a response starts often matters more than how fast it finishes. This is where token streaming becomes […]</p>
OpenClaw Use Cases That Deliver Real ROI<p>An OpenClaw agent that reads your email, opens pull requests, and watches a server is only useful if running it doesn’t feel like leaving the meter running. That’s the quiet constraint behind every OpenClaw use cases discussion. Most of the workflows people show off (morning briefings, multi-agent research, ambient monitoring) only make sense if each […]</p>
OpenClaw Security: Prevent Prompt Injection & Supply Chain Attacks<p>In early 2026, the China’s Ministry of Industry and Information Technology issued an emergency warning about an AI agent runtime that had quietly grown to 135,000 GitHub stars. By mid-February, security researchers were tracking a coordinated campaign called ClawHavoc. The Moltbook breach had exposed customer email archives from 41 enterprises. OpenClaw’s maintainers had shipped three […]</p>
© 2026 DeepInfra. All rights reserved.