New model named Chatterbox by Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations. Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out.
New model named Chatterbox by Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations. Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out.
DeepInfra supports custom voices.
The following creates a voice using the curl
command.
curl -X POST "https://api.deepinfra.com/v1/voices/add" \
-H "Content-Type: multipart/form-data" \
-H "Authorization: Bearer $DEEPINFRA_TOKEN" \
-F "audio=@hello.wav" \
-F "name=John Doe" \
-F "description=John Doe's voice"
which will return something similar to
{
"user_id": "gh:10000000",
"voice_id": "abcd1234abcd1234abcd",
"name": "John Doe",
"description": "John Doe's voice",
"created_at": 1723851387,
"updated_at": 1723851387
}
description
stringDescription of the voice