FLUX.2 is live! High-fidelity image generation made simple.
nvidia/
$0.20
in
$0.60
out
/ 1M tokens
The model is an auto-regressive vision language model that uses an optimized transformer architecture. The model enables multi-image reasoning and video understanding, along with strong document intelligence, visual Q&A and summarization capabilities.

9rfoqv3r
2025-10-28T19:24:26+00:00
© 2025 Deep Infra. All rights reserved.