openai/whisper-small.en

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation, trained on 680k hours of labelled data without the need for fine-tuning. It is a Transformer based encoder-decoder model, trained on either English-only or multilingual data, and is available in five configurations of varying model sizes. The models were trained on the tasks of speech recognition and speech translation, predicting transcriptions in the same or different languages as the audio.

Due to low usage this model has been replaced by openai/whisper-large-v3. Your inference requests are still working but they are redirected. Please update your code to use another model.

Public

demoversions

HTTP/cURL API

You can use cURL or any other http client to run inferences:

curl -X POST \
    -H "Authorization: bearer $DEEPINFRA_TOKEN"  \
    -F audio=@my_voice.mp3  \
    'https://api.deepinfra.com/v1/inference/openai/whisper-small.en'

which will give you back something similar to:

{
  "text": "",
  "segments": [
    {
      "id": 0,
      "text": "Hello",
      "start": 0.0,
      "end": 1.0
    },
    {
      "id": 1,
      "text": "World",
      "start": 4.0,
      "end": 5.0
    }
  ],
  "language": "en",
  "input_length_ms": 0,
  "request_id": null,
  "inference_status": {
    "status": "unknown",
    "runtime_ms": 0,
    "cost": 0.0,
    "tokens_generated": 0,
    "tokens_input": 0
  }
}

Input fields

`audio`string

audio to transcribe

`task`string

task to perform

Default value: "transcribe"

Allowed values: transcribetranslate

Input Schema

Output Schema

Latest Models

openai/

whisper-tiny

Phind/

Phind-CodeLlama-34B-v2

Gryphe/

MythoMax-L2-13b

bigcode/

starcoder2-15b

openchat/

openchat_3.5

Featured Models

Qwen/

Qwen2-72B-Instruct

meta-llama/

Meta-Llama-3-70B-Instruct

meta-llama/

Meta-Llama-3.1-405B-Instruct

meta-llama/

Meta-Llama-3.1-8B-Instruct

microsoft/

WizardLM-2-7B

microsoft/

WizardLM-2-8x22B

Company

Pricing

Docs

Compare

DeepStart

About

Careers

Privacy

Terms

openai/whisper-small.en

HTTP/cURL API

Input fields

`audio`string

`task`string

`initial_prompt`string

`temperature`number

`language`string

`webhook`file

Input Schema

Output Schema

openai/whisper-small.en

HTTP/cURL API

Input fields

audiostring

taskstring

initial_promptstring

temperaturenumber

languagestring

webhookfile

Input Schema

Output Schema

`audio`string

`task`string

`initial_prompt`string

`temperature`number

`language`string

`webhook`file