Browse deepinfra models:

All categories and models you can try out and directly use in deepinfra:

Viewing all

featured

automatic-speech-recognition

text-generation

text-to-image

embeddings

custom

zero-shot-image-classification

Category/all

featured

$0.00045 / minute

openai/

whisper-large-v3

automatic-speech-recognition

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

meta-llama/Meta-Llama-3.1-405B-Instruct cover image

meta-llama/

Meta-Llama-3.1-405B-Instruct

text-generation

Meta developed and released the Meta Llama 3.1 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8B, 70B and 405B sizes

meta-llama/Meta-Llama-3.1-70B-Instruct cover image

featured

bfloat16

128k

$0.52/$0.75 in/out Mtoken

meta-llama/

Meta-Llama-3.1-70B-Instruct

text-generation

Meta developed and released the Meta Llama 3.1 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8B, 70B and 405B sizes

meta-llama/Meta-Llama-3.1-8B-Instruct cover image

meta-llama/

Meta-Llama-3.1-8B-Instruct

text-generation

Meta developed and released the Meta Llama 3.1 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8B, 70B and 405B sizes

featured

$0.27 / Mtoken

google/

gemma-2-27b-it

text-generation

Gemma is a family of lightweight, state-of-the-art open models from Google. Gemma-2-27B delivers the best performance for its size class, and even offers competitive alternatives to models more than twice its size.

featured

$0.09 / Mtoken

google/

gemma-2-9b-it

text-generation

Gemma is a family of lightweight, state-of-the-art open models from Google. The 9B Gemma 2 model delivers class-leading performance, outperforming Llama 3 8B and other open models in its size category.

cognitivecomputations/dolphin-2.9.1-llama-3-70b cover image

featured

bfloat16

$0.59/$0.79 in/out Mtoken

cognitivecomputations/

dolphin-2.9.1-llama-3-70b

text-generation

Dolphin 2.9.1, a fine-tuned Llama-3-70b model. The new model, trained on filtered data, is more compliant but uncensored. It demonstrates improvements in instruction, conversation, coding, and function calling abilities.

featured

fp8

$0.59/$0.79 in/out Mtoken

Sao10K/

L3-70B-Euryale-v2.1

text-generation

Euryale 70B v2.1 is a model focused on creative roleplay from Sao10k

meta-llama/Meta-Llama-3-70B-Instruct cover image

featured

bfloat16

$0.52/$0.75 in/out Mtoken

meta-llama/

Meta-Llama-3-70B-Instruct

text-generation

Model Details Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes.

featured

bfloat16

32k

$0.56/$0.77 in/out Mtoken

Qwen/

Qwen2-72B-Instruct

text-generation

The 72 billion parameter Qwen2 excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning.

microsoft/Phi-3-medium-4k-instruct cover image

featured

bfloat16

$0.14 / Mtoken

microsoft/

Phi-3-medium-4k-instruct

text-generation

The Phi-3-Medium-4K-Instruct is a powerful and lightweight language model with 14 billion parameters, trained on high-quality data to excel in instruction following and safety measures. It demonstrates exceptional performance across benchmarks, including common sense, language understanding, and logical reasoning, outperforming models of similar size.

featured

bfloat16

$0.064 / Mtoken

openchat/

openchat-3.6-8b

text-generation

Openchat 3.6 is a LLama-3-8b fine tune that outperforms it on multiple benchmarks.

mistralai/Mistral-7B-Instruct-v0.3 cover image

mistralai/

Mistral-7B-Instruct-v0.3

text-generation

Mistral-7B-Instruct-v0.3 is an instruction-tuned model, next iteration of of Mistral 7B that has larger vocabulary, newer tokenizer and supports function calling.

meta-llama/Meta-Llama-3-8B-Instruct cover image

featured

bfloat16

$0.06 / Mtoken

meta-llama/

Meta-Llama-3-8B-Instruct

text-generation

Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes.

mistralai/Mixtral-8x22B-Instruct-v0.1 cover image

mistralai/

Mixtral-8x22B-Instruct-v0.1

text-generation

This is the instruction fine-tuned version of Mixtral-8x22B - the latest and largest mixture of experts large language model (LLM) from Mistral AI. This state of the art machine learning model uses a mixture 8 of experts (MoE) 22b models. During inference 2 experts are selected. This architecture allows large models to be fast and cheap at inference.

microsoft/

WizardLM-2-8x22B

text-generation

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to those leading proprietary models.

microsoft/

WizardLM-2-7B

text-generation

WizardLM-2 7B is the smaller variant of Microsoft AI's latest Wizard model. It is the fastest and achieves comparable performance with existing 10x larger open-source leading models

mistralai/Mixtral-8x7B-Instruct-v0.1 cover image

mistralai/

Mixtral-8x7B-Instruct-v0.1

text-generation

Mixtral is mixture of expert large language model (LLM) from Mistral AI. This is state of the art machine learning model using a mixture 8 of experts (MoE) 7b models. During inference 2 expers are selected. This architecture allows large models to be fast and cheap at inference. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks.

Latest Models

Phind/

Phind-CodeLlama-34B-v2

openchat/

openchat_3.5

openai/

whisper-tiny

Gryphe/

MythoMax-L2-13b

bigcode/

starcoder2-15b

Featured Models

lizpreciatior/

lzlv_70b_fp16_hf

mistralai/

Mixtral-8x22B-Instruct-v0.1

google/

gemma-2-27b-it

meta-llama/

Meta-Llama-3.1-70B-Instruct

stability-ai/

sdxl

llava-hf/

llava-1.5-7b-hf

Company

Pricing

Docs

Compare

DeepStart

About

Careers

Privacy

Terms