FLUX.2 is live! High-fidelity image generation made simple.
bosonai/
$20.00
/ 1M characters
HiggsAudioV2.5 is a high-quality neural text-to-speech (TTS) model designed for natural-sounding voice generation across a wide range of use cases. It focuses on clarity, stable prosody, and consistent pacing, making it suitable for both short prompts and longer narration.

Input text
Text to convert to speech
Settings
IN Voice --
Response Format
Output format (only pcm supported). (Default: pcm)
Stream
Whether to stream audio bytes in chunks
Waiting for audio data... Submit request to start streaming.
© 2026 Deep Infra. All rights reserved.