LTX 2.3 Fast image-to-video example: Use the uploaded image as the
This LTX 2.3 Fast image to video example shows Use the uploaded image as the. It highlights audio-enabled output with 14-second timing · 16:9 · 1080p output.
Prompt breakdown
Text-to-video prompt used to generate this render.
Subject
Use the uploaded image as the strict start-frame anchor. Preserve the exact same crowded metro carriage, the same gorilla in a dark tailored suit, the same alpaca in a formal suit and glasses, and the same surrounding c…
Workflow
Image to video
Camera
Image To Video
Output
14s · 16:9 · 1080p
Audio
Enabled
Constraints
Image To Video, Audio Enabled, Reference Image
Reference image
Provided
Show full promptHide full prompt
Use the uploaded image as the strict start-frame anchor. Preserve the exact same crowded metro carriage, the same gorilla in a dark tailored suit, the same alpaca in a formal suit and glasses, and the same surrounding commuters. Keep the framing realistic and cinematic. The train is moving steadily through the tunnel with subtle carriage sway, soft metallic rattling, low rail noise, distant tunnel rumble, fluorescent hum, and realistic motion blur outside the windows. No exaggerated action. The entire scene is driven by performance, timing, breathing, silence, and eye contact. The gorilla and the alpaca stand face to face in the middle of the crowded metro, both completely serious, tired, and slightly awkward, like two strangers who are not sure whether a social interaction has just happened. Performance direction: - very subtle body movement only - natural breathing visible in the chest and shoulders - tiny eye movements - slight hesitation before each line - uncomfortable but controlled silence - deadpan British-style social awkwardness - surrounding commuters remain mostly quiet and serious, with minimal reaction Dialogue timing and acting: 0:00–0:03 The train sways gently. The gorilla briefly glances toward the alpaca, then away, then back again. Gorilla, low voice, awkward, almost apologetic: “Sorry… were you talking to me?” 0:03–0:05 A short silence. The alpaca blinks once, keeps a straight face, tiny inhale. Alpaca, calm and dry: “No.” 0:05–0:07 Another pause. The gorilla looks slightly confused, shifts his grip, breathes out through the nose. Gorilla: “Right… and you?” 0:07–0:09 The alpaca gives the smallest possible side glance, still perfectly serious. Alpaca: “No, not really.” 0:09–0:11 A longer silence. The train rattles. One nearby commuter subtly looks up, then looks away again. Gorilla, almost to himself: “No one talks anymore anyway.” 0:11–0:14 Silence. The alpaca stares forward, then gives a tiny thoughtful nod. Alpaca, quietly: “That’s true, actually.” Audio direction: - realistic moving metro ambience throughout - soft rail clatter and low tunnel rumble - fluorescent carriage hum - subtle clothing movement and breathing during pauses - dialogue clean, dry, understated, intimate, no theatrical projection - leave natural silence between lines - no music - no subtitles - no text on screen - no logos - no extra fantasy elements Visual direction: prestige cinematic realism, restrained performance comedy, subtle depth of field, grounded lighting, natural commuter stillness, premium film look, humor comes entirely from timing, silence, and serious acting.
Why LTX 2.3 Fast fits this shot
Generate fast AI video with LTX 2.3 Fast on MaxVideoAI. Text and image workflows support 6–20s clips, 1080p/1440p/4K, native audio, and 25/50 fps options.
Image input
Audio option
20s max
Key frames



Related examples
View all examples
LTX 2.3 FastLTX 2.3 Pro anime sword attack image-to-video example
This LTX 2.3 Pro draft uses a first-frame image to animate an anime sword attack with water effects, forward camera motion, sharp character detail and 16:9 framing.
LTX 2.3 FastLTX 2.3 Pro office image-to-video transition example
This LTX 2.3 Pro example uses image-to-video controls to preserve a scene across a directed office transition with strong frame continuity.
LTX 2.3 FastKling 3 Pro dark warrior red katana example
This Kling 3 Pro draft animates a dark warrior reference image into a 16:9 red-katana temple scene with five camera beats, fog, moonlight and audio.
LTX 2.3 FastSeedance 2.0 Fast space dog red button example
This Seedance 2.0 Fast draft uses a start image to build a 16:9 space-station scene where a suited dog drifts toward a red button while mission control reacts.