Calabi Labs · Guide · 2026-06-18
Three major research efforts — Kapwing's 2026 AI Diversity Report, a peer-reviewed study published in Nature Scientific Reports, and a Brookings Institution analysis — all reach the same conclusion: AI video generators exhibit significant race and gender bias, and the problem is not merely visual. The bias lives in the metadata layer too, which is exactly what platforms scan to detect AI content in the first place.
Kapwing's AI Diversity Report, released January 2026, analyzed a large sample of videos from the leading AI video platforms — Google's Veo 3, OpenAI's Sora 2, Kling, and Hailuo Claude — to examine gender and racial representation across occupation types. The numbers are stark. Women appeared in high-paying jobs in only 8.67% of AI-generated workplace videos. In low-paying jobs, that figure jumped to 53.73%. The disparity is not subtle — it is a systematic skew baked into what the models consider "normal" to generate.
A Nature Scientific Reports study (April 2025) found something equally important: inclusive AI-generated faces measurably reduce people's existing racial and gender biases, while non-inclusive faces reinforce them. This means the problem compounds — AI models that overrepresent one group in professional roles and underrepresent others shape real-world perception, not just digital content.
The Brookings Institution analysis (April 2024) of AI image generators found a dual failure: models either reproduce existing stereotypes with a lack of diversity, or overcorrect in ways that feel artificial and inauthentic. Both outcomes are harmful. UNESCO's separate study (July 2024) confirmed that large language models and generative AI tools produce homophobia and racial stereotyping as a baseline behavior, not an edge case.
Here is what most people miss: bias in AI-generated content operates on two layers simultaneously. There is the visible layer — who appears on screen, in what roles, with what skin tones. And there is an invisible layer — the metadata that travels with the file, identifying it as AI-generated at a forensic level.
Platforms like Instagram, TikTok, YouTube, and Reddit do not flag content by watching it. They scan files automatically using forensic tools — the same tools newsrooms and researchers use. Specifically, they look for:
DigitalSourceType: trainedAlgorithmicMedia explicitly label a file as AI-generated in its metadata structure.When a creator posts an AI-generated video showing diverse representation — exactly the kind of content the research says the field needs more of — the platform may still flag it as AI-generated because of these invisible signals, regardless of what appears on screen.
If you have tried to post an AI video only to have it flagged, you may have attempted some common workarounds:
The core issue is that the detection metadata — C2PA, XMP flags, encoder fingerprints — is structural, not visual. Cropping a frame does not change the file's manifest. Re-encoding does not automatically purge XMP namespaces. Only targeted metadata stripping addresses it.
Calabi is a one-pass web tool that strips the detection metadata from AI-generated video and image files and replaces it with authentic phone-capture identity. Here is how the process works:
DigitalSourceType: trainedAlgorithmicMedia XMP flag, and encoder fingerprints like Lavc and x264 SEI from video bitstreams. A raw AI export carrying 144 metadata tags is reduced to roughly 94 neutral structural tags.This matters for creators specifically because platforms like Instagram, TikTok, YouTube, and Reddit all run automated scans within seconds of upload. A diverse, well-crafted AI-generated video can still be flagged based on metadata alone. Calabi addresses the detection layer rather than the visual content.
Does stripping metadata change how the video looks?
No. Calabi works entirely on invisible file signals and metadata. The visual content — the people, scenes, and diversity represented on screen — is untouched. This is not an editing tool; it operates at the forensic metadata layer.
I already crop my AI videos before posting. Is that enough?
Cropping removes visible elements from the frame, but the C2PA manifest, XMP flags, and encoder fingerprints are embedded in the file structure, not the pixels. Those signals survive cropping and will be detected by platform scanners. Targeted metadata stripping is required to remove them.
Will this guarantee my video is not flagged on TikTok or Instagram?
No tool can guarantee platform outcomes, which vary by platform and source model. Calabi fully removes the metadata detection layer — C2PA, XMP flags, encoder fingerprints — that automated scanners read. Visible content policies (terms about AI-generated content disclosure) are separate and must be managed independently.
Try Calabi free at calabilabs.com — 10 cleans, no card.