Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalise to many datasets and domains without fine-tuning. The primary intended users of these models are AI researchers studying robustness, generalisation, and capabilities of the current model.
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalise to many datasets and domains without fine-tuning. The primary intended users of these models are AI researchers studying robustness, generalisation, and capabilities of the current model.
condition_on_previous_text
booleanprovide the previous output of the model as a prompt for the next window
Default value: true
temperature_increment_on_fallback
numbertemperature to increase when falling back when the decoding fails to meet either of the thresholds below
Default value: 0.2
webhook
fileThe webhook to call when inference is done, by default you will get the output in the response of your inference request