🚀 New models by Bria.ai, generate and edit images at scale 🚀
Developed by Anthropic, Claude is a family of highly performant, trustworthy AI models built for complex reasoning, advanced coding, and nuanced language understanding. The latest Claude 4 generation delivers breakthrough capabilities in analytical thinking, with Claude 4 Opus setting new standards for intelligence and Claude 4 Sonnet providing exceptional performance with remarkable efficiency.
Claude models excel at understanding context, following complex instructions, and maintaining coherent conversations across extended interactions. With advanced features like extended thinking for deeper reasoning, prompt caching that reduces costs by up to 90%, vision capabilities for image analysis, and robust safety measures, Claude is designed for enterprise applications that demand both sophistication and reliability.
Available with comprehensive API features including streaming responses, batch processing for 50% cost savings, multilingual support across dozens of languages, and flexible context windows up to 200K tokens (1M in beta), Claude is perfect for building intelligent applications like customer support agents, content analysis systems, coding assistants, and complex reasoning workflows that require both accuracy and trustworthiness.
Claude 4 Sonnet delivers the perfect balance of intelligence and efficiency—exceptional reasoning capabilities with fast response times and competitive pricing, making it ideal for production applications that need reliable performance at scale.
Price per 1M input tokens
$3.30
Price per 1M output tokens
$16.50
Release Date
06/12/2025
Context Size
200,000
Quantization
# Assume openai>=1.0.0
from openai import OpenAI
# Create an OpenAI client with your deepinfra token and endpoint
openai = OpenAI(
api_key="$DEEPINFRA_TOKEN",
base_url="https://api.deepinfra.com/v1/openai",
)
chat_completion = openai.chat.completions.create(
model="anthropic/claude-4-sonnet",
messages=[{"role": "user", "content": "Hello"}],
)
print(chat_completion.choices[0].message.content)
print(chat_completion.usage.prompt_tokens, chat_completion.usage.completion_tokens)
# Hello! It's nice to meet you. Is there something I can help you with, or would you like to chat?
# 11 25
DeepInfra provides access to Anthropic's latest Claude models, featuring the most advanced reasoning capabilities and balanced performance options, all with enterprise-grade safety and reliability.
Model | Context | $ per 1M input tokens | $ per 1M output tokens | Actions |
---|---|---|---|---|
claude-4-opus | 195k | $16.50 | $82.50 | |
claude-4-sonnet | 195k | $3.30 | $16.50 | |
claude-3-7-sonnet-latest | 195k | $3.30 | $16.50 |
Claude is a family of state-of-the-art large language models developed by Anthropic, designed for trustworthy and intelligent AI applications. Built with advanced reasoning capabilities, Claude models excel at complex analytical tasks, coding, multilingual communication, and nuanced language understanding. Available in multiple variants including Claude 4 Opus for maximum intelligence, Claude 4 Sonnet for balanced performance, and Claude 3.7 Sonnet with extended thinking capabilities, Claude is engineered with robust safety measures and constitutional AI training to provide reliable, helpful, and harmless AI assistance for enterprise and developer applications.
Claude offers several enterprise-grade features that set it apart from other AI models:
These features combine to create a trustworthy, scalable AI solution for mission-critical applications.
© 2025 Deep Infra. All rights reserved.