We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

FLUX.2 is live! High-fidelity image generation made simple.

bosonai/

HiggsAudioV2.5

$20.00

/ 1M characters

HiggsAudioV2.5 is a high-quality neural text-to-speech (TTS) model designed for natural-sounding voice generation across a wide range of use cases. It focuses on clarity, stable prosody, and consistent pacing, making it suitable for both short prompts and longer narration.

Public
bosonai/HiggsAudioV2.5 cover image

Input

Input text

Text to convert to speech

Settings

IN Voice --

Response Format

Output format (only pcm supported). (Default: pcm)

Stream

Whether to stream audio bytes in chunks

Output

Waiting for audio data... Submit request to start streaming.