We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

Browse deepinfra models:

All categories and models you can try out and directly use in deepinfra:

Viewing all

featured

text-generation

automatic-speech-recognition

text-to-speech

embeddings

text-to-video

text-to-image

reranker

zero-shot-image-classification

multimodal

Category/all

32k

$0.025 / Mtoken

Qwen/

Qwen3-Reranker-4B

reranker

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B)

32k

$0.050 / Mtoken

Qwen/

Qwen3-Reranker-8B

reranker

$10.00 per M characters

ResembleAI/

chatterbox

text-to-speech

New model named Chatterbox by Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations. Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out.

fp8

Replaced

Sao10K/

L3-70B-Euryale-v2.1

text-generation

Euryale 70B v2.1 is a model focused on creative roleplay from Sao10k

bfloat16

Replaced

Sao10K/

L3-8B-Lunaris-v1

text-generation

A generalist / roleplaying model merge based on Llama 3. Sao10K has carefully selected the values based on extensive personal experimentation and has fine-tuned them to create a customized recipe.

fp8

$0.02/$0.05 in/out Mtoken

Sao10K/

L3-8B-Lunaris-v1-Turbo

text-generation

Sao10K/L3.1-70B-Euryale-v2.2 cover image

fp8

128k

$0.65/$0.75 in/out Mtoken

Sao10K/

L3.1-70B-Euryale-v2.2

text-generation

Euryale 3.1 - 70B v2.2 is a model focused on creative roleplay from Sao10k

Sao10K/L3.3-70B-Euryale-v2.3 cover image

fp8

128k

$0.65/$0.75 in/out Mtoken

Sao10K/

L3.3-70B-Euryale-v2.3

text-generation

L3.3-70B-Euryale-v2.3 is a model focused on creative roleplay from Sao10k

$0.10 / video

Wan-AI/

Wan2.1-T2V-1.3B

text-to-video

The Wan2.1 1.3B model is a lightweight, efficient text-to-video generator. Despite its compact size, it delivers impressive performance across benchmarks and generates high-quality 480P videos.

$0.40 / video

Wan-AI/

Wan2.1-T2V-14B

text-to-video

The Wan2.1 14B model is a high-capacity, state-of-the-art video foundation model capable of producing both 480P and 720P videos. It excels at capturing complex prompts and generating visually rich, detailed scenes, making it ideal for high-end creative tasks.

Replaced

XpucT/

Deliberate

text-to-image

The Deliberate Model allows for the creation of anything desired, with the potential for better results as the user's knowledge and detail in the prompt increase. The model is ideal for meticulous anatomy artists, creative prompt writers, art designers, and those seeking explicit content.

$7.00 per M characters

Zyphra/

Zonos-v0.1-hybrid

text-to-speech

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. Our model enables highly natural speech generation from text prompts when given a speaker embedding or audio prefix, and can accurately perform speech cloning when given a reference clip spanning just a few seconds. The conditioning setup also allows for fine control over speaking rate, pitch variation, audio quality, and emotions such as happiness, fear, sadness, and anger. The model outputs speech natively at 44kHz.

$7.00 per M characters

Zyphra/

Zonos-v0.1-transformer

text-to-speech

195k

$3.30/$16.50 in/out Mtoken

anthropic/

claude-3-7-sonnet-latest

text-generation

bigcode/starcoder2-15b-instruct-v0.1 cover image

fp16

Replaced

bigcode/

starcoder2-15b-instruct-v0.1

text-generation

We introduce StarCoder2-15B-Instruct-v0.1, the very first entirely self-aligned code Large Language Model (LLM) trained with a fully permissive and transparent pipeline. Our open-source pipeline uses StarCoder2-15B to generate thousands of instruction-response pairs, which are then used to fine-tune StarCoder-15B itself without any human annotations or distilled data from huge and proprietary LLMs.

$0.012 x (width / 1024) x (height / 1024) x (iters / 25)

black-forest-labs/

FLUX-1-Redux-dev

text-to-image

FLUX.1 Redux [dev] is an image variation generation adapter for all FLUX.1 base models. It enables users to refine images with slight variations and supports text-based restyling via API. Integrated with FLUX1.1 [pro] Ultra, it allows for high-quality 4-megapixel outputs. The model can be used with Diffusers in Python for efficient image generation. While powerful, it has ethical and factual limitations and is governed by a non-commercial license.

$0.009 x (width / 1024) x (height / 1024) x (iters / 25)

black-forest-labs/

FLUX-1-dev

text-to-image

FLUX.1-dev is a state-of-the-art 12 billion parameter rectified flow transformer developed by Black Forest Labs. This model excels in text-to-image generation, providing highly accurate and detailed outputs. It is particularly well-regarded for its ability to follow complex prompts and generate anatomically accurate images, especially with challenging details like hands and faces.

$0.0005 x (width / 1024) x (height / 1024) x iters

black-forest-labs/

FLUX-1-schnell

text-to-image

FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images from text descriptions. This model offers cutting-edge output quality and competitive prompt following, matching the performance of closed source alternatives. Trained using latent adversarial diffusion distillation, FLUX.1 [schnell] can generate high-quality images in only 1 to 4 steps.

Unlock the most affordable AI hosting

Run models at scale with our fully managed GPU infrastructure, delivering enterprise-grade uptime at the industry's best rates.

Contact Sales Get Started

Latest Models

Gryphe/

MythoMax-L2-13b

bigcode/

starcoder2-15b

openchat/

openchat_3.5

openai/

whisper-tiny

Phind/

Phind-CodeLlama-34B-v2

Featured Models

deepseek-ai/

DeepSeek-V3-0324-Turbo

mistralai/

Mistral-Small-3.2-24B-Instruct-2506

deepseek-ai/

DeepSeek-V3

hexgrad/

Kokoro-82M

mistralai/

Devstral-Small-2507

meta-llama/

Llama-3.3-70B-Instruct

Company

Pricing

Docs

Compare

DeepStart

About

Careers

Trust Center

Privacy

Terms

Have questions or need a custom solution?

Contact Sales