GLM-5.1 - state-of-the-art agentic engineering, now available on DeepInfra!

inworld-ai/
$50.00
/ 1M characters
High-quality multilingual text-to-speech model by Inworld AI with 130+ preset voices across 15 languages. Supports voice cloning, word-level timestamps, and streaming. Optimized for natural, expressive speech with <250ms time-to-first-audio.

Voices
© 2026 Deep Infra. All rights reserved.