Calabi Labs · Guide · 2026-05-24
Use cinematic prompts, add reference images, and refine with post-processing — here's the exact workflow.
To make Midjourney video output look real, focus on three stages: prompt design, parameter tuning, and post-processing polish. Each stage adds a layer of realism that stacks into results indistinguishable from live action.
The single biggest lever is your prompt language. Vague prompts produce flat, generic output. Specific, scene-building prompts trigger Midjourney's strongest rendering pathways.
Key prompt additions for realism:
Example prompt:
> Cinematic documentary shot on ARRI Alexa, 35mm lens, shallow depth of field, overcast natural light, woman walking through a rain-slicked Tokyo alley, slight film grain, kodak portra 800, ultra-realistic, 8k --ar 16:9 --s 750 --style raw
The --style raw parameter is critical — it reduces Midjourney's default stylization, pulling output closer to photographic reference.
Parameters to always use:
--style raw — disables artistic enhancement, prioritizes accuracy over stylization--ar 16:9 or --ar 21:9 — widescreen ratios trigger cinematic rendering modes--s 250 to --s 600 — stylize at medium range; too high adds painterly artifacts--v 6.1 or --v 7 — newer models render significantly more photorealistic skin tones and texturesParameters to avoid for realism:
--s 1000+ — high stylize values add glow, oversaturation, and painterly effects--style expressive or --style diverse — these push output toward illustrationUpload a reference photo alongside your prompt using the --cref parameter. This anchors Midjourney to a specific lighting environment, camera profile, and subject appearance.
Reference image tips:
--sref (style reference) with --cref (character reference) for maximum realism control`` /imagine [your prompt] --cref [reference-url] --sref [style-reference-url] --style raw ``
This two-reference technique is one of the most underused methods for achieving convincing real-world visuals.
Raw Midjourney output has subtle digital artifacts — overly smooth skin, slightly wrong micro-reflections, soft edges — that read as AI-generated when watched in motion. Clean these up:
Primary fixes in order of impact:
If you're assembling frames into video, Midjourney's main weakness is inconsistency between frames — lighting shifts, identity drift, micro-movement artifacts.
Fix it:
--seed locked, varying only the seed offset slightly between frames--tile for seamless spatial patterns where applicable--cw 100) to lock identity across generations| Mistake | Why It Fails | Fix |
|---|---|---|
| High stylize values | Adds glow and painterly effects | Keep --s at 250–600 |
| Generic prompts | Outputs look like stock photos | Add specific lighting, lens, and material language |
| Skipping post-processing | Reveals AI artifacts on motion | Add grain and color grade every time |
| No reference images | Model defaults to stylized rendering | Always use --cref with --style raw |
| Single-frame generation | Inconsistency between frames | Lock seed and use character reference |
`` /imagine [specific scene description] cinematic documentary style shot on ARRI Alexa 35mm natural overcast light slight film grain ultra photorealistic --style raw --ar 16:9 --s 400 --v 7 ``
Then feed output into your editor: add real grain → mild color grade → ambient audio → stabilize.
That three-step post workflow does more for perceived realism than any prompt trick.
The gap between "AI video" and "looks real" is mostly post-processing. Midjourney gives you raw photorealistic frames — the real work is in the polish after generation. Start with precise prompts, lock your parameters, and never skip the grain and grade layer.
Try Calabi free at calabilabs.com — 3 cleans, no card.