A good AI video prompt is not a magic spell. It is a production note written clearly enough for a model to follow and specific enough for an editor to use.
Most prompt failures come from missing context: no subject details, no motion, no camera direction, no duration, no style boundaries, and no explanation of what must stay consistent. Better prompts do not need to be longer. They need to be more intentional.
The prompt formula
Use this structure: subject, action, setting, camera, lighting, style, duration, aspect ratio, audio needs, and constraints. If you are using image-to-video, include what should stay stable and what should move.
40 examples
- 1. A skincare bottle on wet stone, slow push-in, morning light, condensation, 6 seconds, vertical 9:16, label stays readable.
- 2. A founder explaining a SaaS dashboard as animated UI cards appear beside them, clean office, 30 seconds.
- 3. A real-estate listing exterior at golden hour, drone-like rise, no people, realistic, 8 seconds.
- 4. A chef plating pasta, close-up hands, steam, warm restaurant lighting, 10 seconds.
- 5. A teacher avatar explains photosynthesis with three simple animated diagrams, friendly tone, 45 seconds.
- 6. A phone case dropped onto a table, impact absorbs cleanly, macro lens, product ad style.
- 7. A before/after desk setup transformation, fast cuts, captions, creator-style vertical video.
- 8. A SaaS onboarding step: user imports a CSV, dashboard populates, cursor highlights three actions.
- 9. A fitness coach avatar explains one beginner mistake in 20 seconds, direct and non-hype tone.
- 10. A coffee shop menu board animates into three featured drinks, local Instagram Reel style.
- 11. A blog article becomes a 45-second explainer with animated headings and clean B-roll.
- 12. A customer support concept: confused user becomes relieved after using live chat, no fake testimonial.
- 13. A hotel room photo becomes a slow cinematic walkthrough, stable furniture, no layout changes.
- 14. A product unboxing sequence, hands open box naturally, packaging accurate, label unchanged.
- 15. A medical clinic avatar explains appointment preparation, calm tone, no diagnosis claims.
- 16. A restaurant lunch special with close-up sizzling sound, vertical 9:16, no extra text.
- 17. A faceless history short with map animation and archival-style generated visuals, clearly labeled as reconstruction.
- 18. A music visualizer with abstract neon waves moving to beat, no lyrics shown.
- 19. A comparison split screen: old workflow vs AI workflow, minimal icons, business style.
- 20. A city neighborhood guide, street-level shots, subtle map labels, relocation audience.
- 21. A UGC-style ad hook: creator holds product and says one surprising objection.
- 22. A product demo with three use cases in 15 seconds, captions synced to actions.
- 23. An app release-note video, three feature cards, crisp UI animation.
- 24. A language-localized avatar intro in Spanish, same brand background and pacing.
- 25. A toy product scene, bright room, parent hand shows scale, no child likeness.
- 26. A luxury watch macro shot, rotating bezel, black background, controlled reflections.
- 27. A restaurant chef introduces a new dish with subtitles and quick ingredient cuts.
- 28. A college lesson recap with animated whiteboard diagrams and quiz question close.
- 29. A B2B webinar clip repurposed into vertical highlights with speaker cutaways and captions.
- 30. A real estate agent explains “three inspection red flags” with simple visuals.
- 31. A nonprofit fundraising story using illustrated scenes, no fake beneficiaries.
- 32. A travel packing tip with top-down suitcase shots and quick text overlays.
- 33. A fashion e-commerce clip showing fabric movement in wind, label and color accurate.
- 34. A cybersecurity training avatar explains phishing in 40 seconds, plain language.
- 35. A product FAQ video answering “Will it fit?” with scale comparison.
- 36. A YouTube Shorts loop: final frame returns to the first visual.
- 37. A TikTok teardown style video: highlight why the first three seconds work.
- 38. A founder pitch video with generated B-roll of problem, solution, and result.
- 39. A dental office patient education clip about what to expect at a cleaning, no treatment promises.
- 40. A developer API demo: prompt enters terminal, video preview appears, dashboard logs request.
How to revise prompts
Do not rewrite everything at once. Change one variable: camera, action, style, duration, or constraint. Keep a simple prompt log so you learn which details actually improve outputs.
Keep a prompt notebook, including the misses

Most people copy the prompt that worked and toss the three that didn't. Flip that habit: the rejects are the lesson. A clip that came back wrong is the clearest record you have of how the model read your words, and the way it broke points straight at the fix. Watch for the usual culprits when you log a miss: the camera move you asked for that never happened, motion that stuttered or never started, something that disappeared partway through, on-screen text that came out as gibberish, a brand element that shifted, or timing that dragged or rushed.
Set up a small table with one row per attempt and these columns:
- Goal — the outcome you wanted from the clip
- Prompt — the exact wording you typed
- Inputs — any image, product shot, reference clip, voice, or brand kit you attached
- Result — what came out right and what came out wrong
- Next try — the change you made for the following generation
Twenty or thirty rows in, the table writes its own conclusions. You will see which model keeps product labels crisp, which one moves an image-to-video clip without warping, which one wobbles on faces, and which one shines on abstract or stylized scenes. A notebook built from your own clips beats a borrowed list of "perfect prompts" every time, because it is tuned to the work you actually make.
Change one thing, then re-roll
Here is the rule that saves the most renders: move exactly one big lever between generations. Swap the subject, the camera, the lighting, the style, and the length all at once and you get a different clip with no idea which edit earned the win. Isolate the variable and every re-roll teaches you something.
Work the fixes in this sequence:
- Correct anything factually or brand-wrong first.
- Sort out the composition next.
- Then deal with the motion.
- Tune the style after that.
- Save polish for the end.
The instinct to fight is the one that pushes you to perfect the look first. Plenty of people will re-roll for a more cinematic grade while the product label in the shot is still misspelled. Get the label right, then make it pretty.
A practical AI video prompt workflow
Start with one shot, not a whole video. A single prompt describes one continuous clip, so trying to cram a three-scene story into one prompt is the fastest way to get drift, morphing, and confused motion. Pick the one shot you want and write it cleanly.
Name the subject, then the action, then the camera move, then the constraints. Write that base prompt once. Generate it, look at exactly one thing that failed, and change only the variable that controls it. Re-roll, compare the two outputs side by side, and keep the winner as your new base. Repeat until the clip holds.
That is the prompt loop for a single shot:
- Subject
- Action
- Setting
- Camera and motion
- Lighting and style
- Duration and aspect ratio
- Stability rules (image-to-video)
- Avoid line
- Generate
- Revise one variable
Most prompts fail because the writer types a vague idea and hits generate, hoping the model fills in the blanks. It rarely does. Decide the shot, the motion, and the constraints in writing first, then prompt.
The pre-prompt checklist
Before you hit generate, read your prompt back against five questions:
- Is the subject described specifically enough that the model cannot guess wrong?
- Did you direct the motion and the camera, not just the look?
- Did you state the duration, aspect ratio, and any audio needs?
- For image-to-video, did you say what must stay stable and what should move?
- Did you add the "avoid" line that blocks the known failures (warped text, drifting logos, extra limbs)?
If any answer is no, fix the prompt before you spend a render on it. A clearer prompt is cheaper than a re-roll, and it gives you a result you can actually revise one variable at a time.
The prompt formula that actually helps

Use this order: subject, action, setting, camera, motion, mood, constraints, and output format. For example: “A close-up vertical product shot of a matte black travel mug on a wooden desk, steam rising slowly, morning window light, subtle push-in camera, realistic commercial style, no text, no logo distortion, 8 seconds.”
That prompt works because it tells the model what matters and what not to invent. When a result fails, revise one variable at a time. Prompting is not guessing. It is controlled iteration.
Where prompting fits in Vivideo
Vivideo gives you three ways to put these prompts to work. One-prompt generation is where the formula above pays off directly: write the subject, action, camera, and constraints, and get a draft to revise. When you would rather describe the goal than craft the prompt, the agentic AI chat can plan the shots and build the video for you, and manual mode is there when a clip needs hands-on control. Templates, brand kits, avatars, and AI voices keep the output on-brand, and API/CLI/MCP access lets you run the same prompt patterns at scale once they work.
AI video prompt examples: write for controllable motion
A strong AI video prompt does not just describe a scene. It directs time. That means the prompt should tell the model what happens first, what changes, what stays consistent, and how the camera behaves.
Use this compact structure:
[Subject] does [action] in [environment]. Camera [movement/framing]. Style is [visual style]. Keep [important object/person/detail] consistent. Avoid [known failure].Example:
A founder opens a laptop on a small cafe table and reviews a clean analytics dashboard. Camera starts over the shoulder, then slowly pushes toward the screen. Natural morning light, realistic documentary style. Keep the dashboard layout consistent and avoid unreadable text.The “avoid” line is underrated. It tells the model what failure looks like: extra fingers, warped logos, flickering faces, unreadable product labels, floating objects, unrealistic camera movement, or sudden outfit changes.
Perfect prompting is not about magic words. It is about making the model’s job easier and your review process cleaner.
Conclusion
A prompt works best when it reads like a production note, not a wish. The 40 examples above all share the same backbone: a specific subject, a directed motion, a stated duration and aspect ratio, and a clear line about what must not break. The model fills in less, so you re-roll less.
Use the formula in this guide as a checklist on every prompt you write: name the subject, direct the action and camera, set the duration and aspect ratio, lock what must stay consistent, and add the "avoid" line that blocks warped text and drifting logos. Then revise one variable at a time. That is how prompting becomes controlled iteration instead of slot-machine guessing.
If you want one place to write these prompts, generate from them, and revise variant by variant with avatars, voices, and brand kits attached, try Vivideo free at vivideo.ai.
