🚀 New models by Bria.ai, generate and edit images at scale 🚀
Developed by Google DeepMind, Gemini is a family of state-of-the-art thinking models with native multimodal capabilities, designed for advanced reasoning, complex problem-solving, and comprehensive understanding across text, audio, video, and images. Built with revolutionary thinking architecture, Gemini models reason through problems step-by-step before responding, delivering enhanced accuracy and performance for sophisticated applications.
Gemini 2.5 Pro sets new standards for complex reasoning and coding excellence, while Gemini 2.5 Flash provides optimal price-performance for high-volume tasks. With massive context windows up to 1 million tokens, native multimodal processing that handles hours of video and audio, and transparent reasoning capabilities that show step-by-step thinking processes, Gemini excels at document analysis, code generation, scientific research, and agentic workflows.
Perfect for building intelligent applications that require deep reasoning, multimodal understanding, long-context processing, and transparent AI decision-making with Google's enterprise-grade reliability and performance.
Gemini 2.5 Pro is Google's most advanced thinking model, leading in complex reasoning, advanced coding, and multimodal understanding—with transparent step-by-step reasoning and state-of-the-art performance across academic and real-world benchmarks.
Price per 1M input tokens
$0.875
Price per 1M output tokens
$7.00
Release Date
04/17/2025
Context Size
1,000,000
Quantization
# Assume openai>=1.0.0
from openai import OpenAI
# Create an OpenAI client with your deepinfra token and endpoint
openai = OpenAI(
api_key="$DEEPINFRA_TOKEN",
base_url="https://api.deepinfra.com/v1/openai",
)
chat_completion = openai.chat.completions.create(
model="google/gemini-2.5-pro",
messages=[{"role": "user", "content": "Hello"}],
)
print(chat_completion.choices[0].message.content)
print(chat_completion.usage.prompt_tokens, chat_completion.usage.completion_tokens)
# Hello! It's nice to meet you. Is there something I can help you with, or would you like to chat?
# 11 25
DeepInfra provides access to Google's latest Gemini models, featuring advanced thinking capabilities, native multimodal processing, and industry-leading performance for complex reasoning and development tasks.
Model | Context | $ per 1M input tokens | $ per 1M output tokens | Actions |
---|---|---|---|---|
gemini-2.5-pro | 976k | $0.875 | $7.00 | |
gemini-2.5-flash | 976k | $0.21 | $1.75 | |
gemini-2.0-flash-001 | 976k | $0.10 | $0.40 |
Gemini is a family of state-of-the-art thinking models developed by Google DeepMind, designed with native multimodal capabilities and advanced reasoning architecture. Built as thinking models, Gemini can reason through complex problems step-by-step before responding, resulting in enhanced accuracy and performance.
Available in multiple variants including Gemini 2.5 Pro for maximum reasoning capabilities, Gemini 2.5 Flash for optimal price-performance, and Gemini 2.0 Flash for next-generation features, Gemini models excel at complex coding, scientific reasoning, document analysis, and multimodal understanding across text, audio, video, and images with transparent reasoning processes and enterprise-grade reliability.
Yes. DeepInfra's infrastructure delivers optimized performance for Gemini models with intelligent load balancing and efficient resource allocation. Gemini 2.5 Flash is specifically designed for low-latency, high-volume tasks while maintaining thinking capabilities. The models feature adjustable thinking budgets that automatically calibrate processing time based on query complexity—providing faster responses for simple requests and deeper reasoning for complex problems.
Gemini's thinking capabilities represent a breakthrough in AI reasoning through several key innovations:
This combination enables sophisticated problem-solving, strategic planning, and complex coding tasks while maintaining full visibility into the AI's reasoning process.
© 2025 Deep Infra. All rights reserved.