DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
anthropic/
$1.00
in
$5.00
out
/ 1M tokens
The next generation of Anthropic's fastest and most cost-effective model, optimal for use cases where speed and affordability matter.

Ask me anything
Settings
Claude Haiku 4.5 delivers near-frontier performance for a wide range of use cases, and stands out as one of the best coding and agent models–with the right speed and cost to power free products and high-volume user experiences.
What was recently at the frontier is now cheaper and faster. Five months ago, Claude Sonnet 4 was a state-of-the-art model. Today, Claude Haiku 4.5 gives you similar levels of coding performance but at one-third the cost and more than twice the speed.
Claude Haiku 4.5 even surpasses Claude Sonnet 4 at certain tasks, like using computers.
Users who rely on AI for real-time, low-latency tasks like chat assistants, customer service agents, or pair programming will appreciate Haiku 4.5’s combination of high intelligence and remarkable speed. And users of Claude Code will find that Haiku 4.5 makes the coding experience—from multiple-agent projects to rapid prototyping—markedly more responsive.
© 2026 DeepInfra. All rights reserved.