Browse deepinfra models:

All categories and models you can try out and directly use in deepinfra:

Viewing all

featured

text-generation

text-to-image

automatic-speech-recognition

embeddings

token-classification

fill-mask

text-classification

question-answering

image-classification

object-detection

custom

zero-shot-image-classification

Category/text-generation

Text generation AI models can generate coherent and natural-sounding human language text, making them useful for a variety of applications from language translation to content creation.

There are several types of text generation AI models, including rule-based, statistical, and neural models. Neural models, and in particular transformer-based models like GPT, have achieved state-of-the-art results in text generation tasks. These models use artificial neural networks to analyze large text corpora and learn the patterns and structures of language.

While text generation AI models offer many exciting possibilities, they also present some challenges. For example, it's essential to ensure that the generated text is ethical, unbiased, and accurate, to avoid potential harm or negative consequences.

meta-llama/Meta-Llama-3-70B-Instruct cover image

$0.59/$0.79 in/out Mtoken

meta-llama/

Meta-Llama-3-70B-Instruct

text-generation

Model Details Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes.

meta-llama/Meta-Llama-3-8B-Instruct cover image

meta-llama/

Meta-Llama-3-8B-Instruct

text-generation

mistralai/Mixtral-8x22B-Instruct-v0.1 cover image

mistralai/

Mixtral-8x22B-Instruct-v0.1

text-generation

This is the instruction fine-tuned version of Mixtral-8x22B - the latest and largest mixture of experts large language model (LLM) from Mistral AI. This state of the art machine learning model uses a mixture 8 of experts (MoE) 22b models. During inference 2 experts are selected. This architecture allows large models to be fast and cheap at inference.

microsoft/WizardLM-2-8x22B cover image

microsoft/

WizardLM-2-8x22B

text-generation

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to those leading proprietary models.

microsoft/WizardLM-2-7B cover image

microsoft/

WizardLM-2-7B

text-generation

WizardLM-2 7B is the smaller variant of Microsoft AI's latest Wizard model. It is the fastest and achieves comparable performance with existing 10x larger open-source leading models

google/gemma-1.1-7b-it cover image

google/

gemma-1.1-7b-it

text-generation

Gemma is an open-source model designed by Google. This is Gemma 1.1 7B (IT), an update over the original instruction-tuned Gemma release. Gemma 1.1 was trained using a novel RLHF method, leading to substantial gains on quality, coding capabilities, factuality, instruction following and multi-turn conversation quality.

mistralai/Mixtral-8x7B-Instruct-v0.1 cover image

mistralai/

Mixtral-8x7B-Instruct-v0.1

text-generation

Mixtral is mixture of expert large language model (LLM) from Mistral AI. This is state of the art machine learning model using a mixture 8 of experts (MoE) 7b models. During inference 2 expers are selected. This architecture allows large models to be fast and cheap at inference. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks.

mistralai/Mistral-7B-Instruct-v0.2 cover image

mistralai/

Mistral-7B-Instruct-v0.2

text-generation

The Mistral-7B-Instruct-v0.2 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.2 generative text model using a variety of publicly available conversation datasets.

meta-llama/Llama-2-70b-chat-hf cover image

$0.64/$0.80 in/out Mtoken

meta-llama/

Llama-2-70b-chat-hf

text-generation

LLaMa 2 is a collections of LLMs trained by Meta. This is the 70B chat optimized version. This endpoint has per token pricing.

cognitivecomputations/dolphin-2.6-mixtral-8x7b cover image

cognitivecomputations/

dolphin-2.6-mixtral-8x7b

text-generation

The Dolphin 2.6 Mixtral 8x7b model is a finetuned version of the Mixtral-8x7b model, trained on a variety of data including coding data, for 3 days on 4 A100 GPUs. It is uncensored and requires trust_remote_code. The model is very obedient and good at coding, but not DPO tuned. The dataset has been filtered for alignment and bias. The model is compliant with user requests and can be used for various purposes such as generating code or engaging in general chat.

lizpreciatior/lzlv_70b_fp16_hf cover image

$0.59/$0.79 in/out Mtoken

lizpreciatior/

lzlv_70b_fp16_hf

text-generation

A Mythomax/MLewd_13B-style merge of selected 70B models A multi-model merge of several LLaMA2 70B finetunes for roleplaying and creative work. The goal was to create a model that combines creativity with intelligence for an enhanced experience.

openchat/openchat_3.5 cover image

openchat/

openchat_3.5

text-generation

OpenChat is a library of open-source language models that have been fine-tuned with C-RLFT, a strategy inspired by offline reinforcement learning. These models can learn from mixed-quality data without preference labels and have achieved exceptional performance comparable to ChatGPT. The developers of OpenChat are dedicated to creating a high-performance, commercially viable, open-source large language model and are continuously making progress towards this goal.

llava-hf/llava-1.5-7b-hf cover image

llava-hf/

llava-1.5-7b-hf

text-generation

LLaVa is a multimodal model that supports vision and language models combined.

deepinfra/airoboros-70b cover image

$0.70/$0.90 in/out Mtoken

deepinfra/

airoboros-70b

text-generation

Latest version of the Airoboros model fine-tunned version of llama-2-70b using the Airoboros dataset. This model is currently running jondurbin/airoboros-l2-70b-2.2.1

meta-llama/Llama-2-7b-chat-hf cover image

meta-llama/

Llama-2-7b-chat-hf

text-generation

Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format.

01-ai/Yi-34B-Chat cover image

01-ai/

Yi-34B-Chat

text-generation

Austism/chronos-hermes-13b-v2 cover image

Austism/

chronos-hermes-13b-v2

text-generation

This offers the imaginative writing style of chronos while still retaining coherency and being capable. Outputs are long and utilize exceptional prose. Supports a maxium context length of 4096. The model follows the Alpaca prompt format.

Gryphe/MythoMax-L2-13b cover image

Gryphe/

MythoMax-L2-13b

text-generation

Latest Models

Phind/

Phind-CodeLlama-34B-v2

Gryphe/

MythoMax-L2-13b

openai/

whisper-tiny

bigcode/

starcoder2-15b

openchat/

openchat_3.5

Featured Models

microsoft/

WizardLM-2-8x22B

mistralai/

Mixtral-8x7B-Instruct-v0.1

meta-llama/

Llama-2-70b-chat-hf

microsoft/

WizardLM-2-7B

stability-ai/

sdxl

BAAI/

bge-large-en-v1.5

Company

Pricing

Docs

Compare

DeepStart

About

Privacy

Terms

© 2024 Deep Infra. All rights reserved.