Wan 2.5 Text & Image to Video audio-enabled video example: city camera move
This Wan 2.5 Text & Image to Video text to video example shows city camera move. It highlights audio-enabled output with 10-second timing · 9:16 output.
Prompt breakdown
Text-to-video prompt used to generate this render.
Subject
Ultra-realistic walking selfie shot filmed with a smartphone held in one hand. The person is speed-walking through a busy urban street in daylight. Camera movement is dynamic: fast steps, sudden micro-shakes, quick tilt…
Workflow
Text to video
Camera
Audio Enabled
Output
10s · 9:16
Audio
Enabled
Constraints
Text To Video, Audio Enabled, Camera Move
Show full promptHide full prompt
Ultra-realistic walking selfie shot filmed with a smartphone held in one hand. The person is speed-walking through a busy urban street in daylight. Camera movement is dynamic: fast steps, sudden micro-shakes, quick tilts as the person avoids people and obstacles. Natural motion blur, realistic stabilization drift, shifting sunlight and shadows on their face. High-detail skin texture, real reflections in the eyes. The person speaks extremely fast, slightly out of breath, trying to explain something urgently while walking. Lip-sync must perfectly match the following rapid line: “Okay listen, I don’t have much time but everything’s happening way faster than I expected and I swear I’ll explain everything once I get there!” Audio: realistic city ambience (footsteps, passing cars, faint horns), wind hitting the phone mic, breath sounds, occasional clothing rustle. Keep the phone-mic quality: compressed, slightly distorted on loud peaks. Mood: energetic, chaotic, spontaneous. No filters, no beautification. Keep it raw and real.
Why Wan 2.5 Text & Image to Video fits this shot
Wan 2.5 handles 5 or 10 second clips with optional background audio plus prompt expansion when you need extra detail.
Audio option
5s or 10s
480p–1080p
Key frames



Related examples
View all examples
Wan 2.5 Text & Image to VideoWan 2.5 vertical spy-to-Zoom comedy video example
This Wan 2.5 watch page shows a vertical comedy prompt that opens like a spy action scene and ends with a Zoom-call reveal.
Wan 2.5 Text & Image to VideoWan 2.5 vertical smartwatch runner ad example
This Wan 2.5 example turns a smartwatch prompt into a vertical runner ad with beat-timed motion, rain details and audio-enabled pacing.
Wan 2.5 Text & Image to VideoLTX 2.3 Pro rooftop lightning fashion shot example
This LTX 2.3 Pro page shows a rooftop fashion prompt with storm lighting, neon city atmosphere and cinematic subject isolation.
Wan 2.5 Text & Image to VideoSora 2 gorilla dance video example with strobe lighting
This Sora 2 watch page shows a gorilla-mask dance prompt rendered with strobe lighting, changing camera angles, native audio and a 16:9 output.