🚀 New models by Bria.ai, generate and edit images at scale 🚀
nvidia/
$0.20
in
$0.60
out
The model is an auto-regressive vision language model that uses an optimized transformer architecture. The model enables multi-image reasoning and video understanding, along with strong document intelligence, visual Q&A and summarization capabilities.

© 2025 Deep Infra. All rights reserved.