Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
condition_on_previous_text
booleanprovide the previous output of the model as a prompt for the next window
Default value: true
temperature_increment_on_fallback
numbertemperature to increase when falling back when the decoding fails to meet either of the thresholds below
Default value: 0.2
webhook
fileThe webhook to call when inference is done, by default you will get the output in the response of your inference request