GLM-5.1 - state-of-the-art agentic engineering, now available on DeepInfra!

Wan-AI/
$0.10 / second
*Generates video content from images while stably preserving details such as subject, style, and text elements. Ensures visual consistency and information fidelity throughout dynamic transitions.

Prompt
Text prompt describing the video content. (Default: empty)
You need to login to use this model
LoginSettings
Negative Prompt
Negative prompt describing content to exclude. (Default: empty)
First Frame
First-frame image: URL or base64-encoded image data. Used for first-frame and first+last-frame video generation modes. (Default: empty)
Last Frame
Last-frame image: URL or base64-encoded image data. Used together with first_frame or first_clip. (Default: empty)
Driving Audio
URL of audio file used as a driving source for lip-sync and timing (WAV/MP3, 2-30s, max 15MB). Only valid with first_frame. (Default: empty)
First Clip
URL of initial video clip for video continuation (MP4/MOV, 2-10s, max 100MB). (Default: empty)
Resolution
Resolution tier of the generated video (720P or 1080P). Default 1080P
Duration
Duration of the generated video in seconds (2-15) (Default: empty, 2 ≤ duration ≤ 15)
Prompt Extend
Whether to enable prompt rewriting for better quality. Default true
Watermark
Whether to add AI Generated watermark. Default false
Seed
Random seed for reproducibility (Default: empty, 0 ≤ seed ≤ 2147483647)
Generates video content from images while stably preserving details such as subject, style, and text elements. Ensures visual consistency and information fidelity throughout dynamic transitions.
Wan2.7 image to video, performance fully reimagined. Delivering nuanced and organic emotional depth in narrative arcs and visceral, bone-crunching impact in action sequences. Enhanced by rhythmic cinematic cuts for unparalleled storytelling power.
© 2026 Deep Infra. All rights reserved.