Strong prompts read like a director’s shot note. Stack these six parts, in this order, and the model has far less to guess.
Who or what is on screen. Use concrete nouns, not vague ones.
e.g. a vintage red convertibleOne clear thing happening. Keep it to a single action per shot.
e.g. driving along a coastal roadWhere and when — location plus time of day or weather.
e.g. Amalfi cliffs at golden hourShot size and movement. Direct it like a cinematographer.
e.g. aerial tracking shot, slow followThe light source and its quality — this sets the whole mood.
e.g. warm low sun, long shadowsMedium, look and grade. Name a film stock or genre if it helps.
e.g. cinematic, shot on 35mm, warm gradeAll six parts in one prompt — copy it and swap the details for your own shot.
The vocabulary that actually changes a shot. Click any word to copy it.
Copy a template, swap the [bracketed] parts for your own, and generate. 13 categories, ready to use.
Cinematic [wide/aerial] establishing shot of [location] at [time of day], [weather/atmosphere], [camera move], anamorphic, shot on film, [mood] grade — no text.
[shot size] of [character] [emotion/action] in [setting], [lighting], shallow depth of field, slow [camera move], cinematic, [film stock] — no text.
Dynamic [shot] of [subject] [fast action] through [environment], motion blur, fast [camera move], high frame rate, cinematic, dramatic lighting.
A [product] slowly rotating on a [surface], studio softbox lighting, soft reflections, seamless [color] background, macro detail, photoreal product shot — no text.
[product] being used by [person] in [real setting], natural light, shallow depth of field, candid, photoreal, [mood].
Extreme macro shot of [product detail], slow [camera move], dramatic side lighting, glistening texture, photoreal, shallow depth of field.
A [person] speaking directly to camera in [setting], natural lighting, eye-level medium shot, shallow depth of field, authentic, [tone].
Handheld vertical selfie video of a [person] excitedly showing [product] in [casual setting], phone-camera look, natural light, authentic UGC style.
[person] sitting and talking warmly in [home/office], soft window light, shallow depth of field, documentary interview framing, realistic.
Aerial drone shot sweeping over [landscape] at [time of day], [weather], volumetric light, cinematic, ultra-wide, [mood] grade.
Slow-motion close-up of [animal] [action] in [habitat], natural light, shallow depth of field, documentary, photoreal.
Macro slow-motion of [natural element], soft natural light, mesmerizing detail, cinematic, shallow depth of field.
Anime style, [character] [action] in [setting], expressive, vibrant colors, dynamic camera, studio-quality 2D animation.
Pixar-style 3D animation of [character] [action], soft global illumination, expressive, colorful, cinematic depth of field.
Sleek motion-graphics animation of [shape/logo concept], smooth easing, [color] gradient, clean and minimal, looping, modern.
Smooth gliding walkthrough of a [style] [room], bright natural light, wide-angle, real-estate showcase, photoreal, inviting.
Slow aerial reveal of a [property type] surrounded by [environment], golden hour, cinematic, luxury real-estate style.
Slow pan across [amenity/detail], warm light, shallow depth of field, premium real-estate aesthetic, photoreal.
Close-up of [dish] with [appetizing detail], soft natural light, shallow depth of field, mouth-watering food-film style.
Slow-motion macro of [liquid] being poured over [food/drink], glistening, warm light, satisfying, photoreal.
Top-down shot of [cooking action], dynamic, fresh ingredients, bright clean kitchen light, vibrant, foodie social style.
[model] wearing [outfit] walking through [setting], confident movement, fashion-editorial lighting, slow motion, cinematic, [mood].
Beauty close-up of [subject/detail], glowing skin, soft diffused light, shallow depth of field, luxury cosmetics style, photoreal.
Macro slow-motion of [fabric/texture] moving in the wind, soft light, elegant, high-fashion aesthetic.
A [device] emerging from darkness with light tracing its edges, sleek, futuristic, studio lighting, premium tech-ad style — no text.
Abstract 3D visualization of [tech concept], glowing nodes and lines, dark background, sleek, futuristic, smooth camera motion.
Close-up of hands using [device] in [setting], crisp screen, natural light, shallow depth of field, modern tech lifestyle.
Cinematic travel montage of [destination] — [landmarks/scenes], golden light, dynamic camera, vibrant, wanderlust mood.
POV walking through [place], handheld, immersive, natural light, authentic travel-vlog style.
Aerial drone orbit around [landmark] at [time of day], cinematic, ultra-wide, breathtaking, [mood] grade.
Add subtle motion: [what moves gently], slow [camera move], keep the subject and composition unchanged, natural, seamless loop.
Animate this product photo: slow [rotation/parallax], soft moving reflections, studio feel, keep the product exactly as shown.
Cinemagraph: only [element] moves while everything else stays perfectly still, [subtle motion], slow, mesmerizing loop.
A [presenter] explaining [topic] to camera in [setting], friendly, clear, medium shot, soft even light, professional.
Clean animated visual illustrating [concept], simple shapes, [brand colors], minimal, modern explainer style, smooth motion.
Step-by-step visual of [process], clear icon-style stages, bright friendly style, smooth transitions, educational.
The quickest wins come from avoiding these.
Name the subject, the action and the setting — the model can’t guess what “cool” looks like.
Concrete craft terms (light, lens, camera) beat a pile of vague hype adjectives.
Models render one main action per shot reliably; cramming several muddies all of them.
A short list of negative cues removes the most common AI artefacts.
Paste this at the end of any prompt to strip out the usual AI artefacts.
One or two clear sentences is the sweet spot. Be specific, not long — a few precise details (subject, action, camera, light) beat a wall of adjectives.
Yes. The structure is model-agnostic. On Vivideo you can run the same prompt through different models and compare — some favor realism, others motion or speed.
A negative prompt lists what to exclude (text, watermarks, extra fingers). It’s optional but cleans up results noticeably — copy the recommended one above as a starting point.
Models weight the start of the prompt most heavily and can drop later details. Move the key element earlier, or split it into its own scene.
Reuse the exact same subject description in every prompt, or use image-to-video / an avatar so the look stays locked. For products, image-to-video keeps them accurate.
Absolutely — that’s what they’re for. Copy a template, swap the [bracketed] parts for your own, and generate. Then change one thing at a time to refine.
Social & ads
Hook opener (vertical)
Vertical 9:16, fast punchy [shot] of [attention-grabbing subject/action], bold motion, vibrant, energetic, social-ad style — leave headroom for captions.
Before / after reveal
Vertical, [subject] transforming from [before] to [after], satisfying transition, bright clean lighting, upbeat, social style.
Trend-style montage
Fast-cut vertical montage of [theme], punchy beats, vibrant grade, dynamic camera, trendy social-media energy.