Text-to-Speech (TTS) technology converts written text into spoken words using advanced speech synthesis. TTS systems are used in applications like virtual assistants, accessibility tools for visually impaired users, and language learning software, enabling seamless human-computer interaction.
Text-to-Speech (TTS) technology converts written text into spoken words using advanced speech synthesis. TTS systems are used in applications like virtual assistants, accessibility tools for visually impaired users, and language learning software, enabling seamless human-computer interaction.
DeepInfra supports custom voices.
The following creates a voice using the curl
command.
curl -X POST "https://api.deepinfra.com/v1/voices/add" \
-H "Content-Type: multipart/form-data" \
-H "Authorization: Bearer $DEEPINFRA_TOKEN" \
-F "audio=@hello.wav" \
-F "name=John Doe" \
-F "description=John Doe's voice"
which will return something similar to
{
"user_id": "gh:10000000",
"voice_id": "abcd1234abcd1234abcd",
"name": "John Doe",
"description": "John Doe's voice",
"created_at": 1723851387,
"updated_at": 1723851387
}
description
stringDescription of the voice