Browse deepinfra models:

All categories and models you can try out and directly use in deepinfra:
Search

Category/text-to-speech

hexgrad/Kokoro-82M cover image
featured
$5.00 per M characters
  • text-to-speech

Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out). On 25 Dec 2024, Kokoro v0.19 weights were permissively released in full fp32 precision under an Apache 2.0 license. As of 2 Jan 2025, 10 unique Voicepacks have been released, and a .onnx version of v0.19 is available.