Start a render
Wan 2.5 Text & Image to Video
Audio enabled
0:00 / 0:00

Wan 2.5 Text & Image to Video audio-enabled video example: Ultra-realistic handheld…

This Wan 2.5 Text & Image to Video text to video example shows Ultra-realistic handheld selfie filmed inside a. It highlights audio-enabled output with 10-second timing · 9:16 output.

Wan 2.5 Text & Image to VideoText to video10s9:16Enabled$0.65
Wan 2.5 Text & Image to VideoText to video10s9:16Audio

Prompt breakdown

Text-to-video prompt used to generate this render.

Ultra-realistic handheld selfie filmed inside a parked car at night. The person is sitting in the driver’s seat, illuminated softly by streetlights and reflections of rain droplets sliding down the windows. Camera held…

Subject

Ultra-realistic handheld selfie filmed inside a parked car at night. The person is sitting in the driver’s seat, illuminated softly by streetlights and reflections of rain droplets sliding down the windows. Camera held…

Workflow

Text to video

Camera

Audio Enabled

Output

10s · 9:16

Audio

Enabled

Constraints

Text To Video, Audio Enabled

Show full prompt

Ultra-realistic handheld selfie filmed inside a parked car at night. The person is sitting in the driver’s seat, illuminated softly by streetlights and reflections of rain droplets sliding down the windows. Camera held close to the face, slight breathing motion, narrow depth of field, cinematic low-light grain. Realistic skin texture, natural eye reflections from passing headlights. The person speaks with a quiet, reflective tone. Lip-sync must match the line: "I didn’t expect tonight to end like this… but maybe it’s exactly what I needed." Audio: include soft rain hitting the windshield, distant traffic, the muffled hum of the car interior. Phone-mic quality with slight reverb from the cabin. Mood: introspective, raw, intimate. No beauty filters. No smoothing. Keep the moment grounded, honest, emotional.

Why Wan 2.5 Text & Image to Video fits this shot

Wan 2.5 handles 5 or 10 second clips with optional background audio plus prompt expansion when you need extra detail.

Audio option

5s or 10s

480p–1080p

Key frames

Opening frame
Motion beat
Final shot

Related examples

View all examples