Trend report · gnews_detection · 2026-06-04

Using AI, deepfake technologies to create and spread fake news to be severely punished - vietnamnews.vn

Using AI, deepfake technologies to create and spread fake news to be severely punished - vietnamnews.vn

In late 2024, Vietnam's Ministry of Public Security announced that creators and spreaders of AI-generated deepfakes would face criminal penalties ranging from fines to imprisonment. The move mirrors regulatory pressure worldwide: the EU's AI Act, California's AB 602, and China's deepfake rules all signal the same direction—AI-generated content can no longer hide behind plausibility. What this means practically for anyone publishing visual content in 2026 is concrete and immediate. Platforms have built multi-layered detection systems that check for AI artifacts at ingest, at upload, and asynchronously after distribution. Understanding exactly what those systems look for—and how to neutralize them—has become essential for anyone working with AI-generated media at scale.

What Platforms Actually Scan For in 2026

Detection technology has matured far beyond simple pixel analysis. Today's pipeline inspection operates at the metadata and artifact level, checking several distinct signals simultaneously.

C2PA Provenance Checking: The Coalition for Content Provenance and Authenticity (C2PA) specification has become the baseline for major platforms. Content signed with a C2PA manifest carries embedded claims about its origin: which model generated it, when, with what parameters, and from which device. When you upload an image to Instagram or TikTok, the platform's ingest pipeline checks for a valid c2pa:assertions block. If that block declares the content was generated by "Stable Diffusion XL 1.0" or "Sora v2," the content is immediately flagged for the AI label even before human review. A missing C2PA manifest isn't a green light—it triggers a second-tier heuristic scan.

AI-Specific Metadata Fields: Beyond C2PA, platforms look for standard AI generation tags scattered through EXIF and XMP data. Fields like XMP:xmpCreatorTool, EXIF:Software containing model identifiers, GenerateParameters, PromptEmbedding, and DreamLike or Stable diffusion strings in any vendor-specific namespace will trigger a flag. Detection is done via pattern matching against a growing database of known AI model fingerprints. In 2026, this database updates daily—platforms share signature hashes through organizations like the Adobe-led Content Authenticity Initiative.

Encoder Signature Detection: Each AI image model leaves subtle statistical fingerprints in its output. Stable Diffusion variants produce detectable checkerboard artifacts at specific resolution boundaries. DALL-E 3 outputs have characteristic color temperature distributions in sky regions. Flux models show consistent noise patterns in low-entropy areas like solid-color backgrounds. Platforms run these outputs through trained classifiers that compare against known model signatures. The classifier confidence score—stored in ai:classifier_confidence and ai:model_fingerprint fields if the content has been pre-scanned—becomes evidence for or against an AI label.

Missing or Inconsistent GPS/EXIF: Authentic photos from real cameras carry a predictable suite of metadata: GPS coordinates, lens corrections, camera make and model, ISO, shutter speed, and timestamp. AI-generated images typically lack GPS data entirely, or carry contradictory timestamps (generated at 3 AM but the EXIF claims noon with harsh shadows that don't match). Platforms compare available EXIF against the image content itself. A photo allegedly taken in Paris with no GPS but perfect composition and consistent lighting will get scrutinized. Missing GPSAltitude, GPSLatitude, or GPSLongitude combined with no corresponding camera profile is a soft flag that triggers deeper inspection.

What Gets Flagged on Instagram and TikTok

The two platforms have different threshold architectures but similar detection surfaces.

On TikTok, AI-generated content goes through a three-stage pipeline: ingest-time C2PA/XMP scan, upload-time hash matching against known AI output databases, and post-upload asynchronous classifier analysis. Content that triggers any two of these three stages gets the "AI-generated" label. If you upload a video that lacks a C2PA manifest and shows consistent encoder artifacts, TikTok applies the label automatically. Creators have reported appeals rejected when they couldn't provide the original device that "captured" the content—TikTok increasingly expects a verifiable device chain for ambiguous cases.

On Instagram, the detection stack focuses on metadata provenance and community reporting. Instagram doesn't yet run universal encoder fingerprinting on every upload, but it does run metadata checks at scale. Content with C2PA_manifest claiming AI origin gets labeled. Content that is reported by multiple users as "looks fake" gets pulled into a manual review queue where classifiers are re-run. Instagram also cross-references upload patterns: images posted to Reels without the typical burst-shot or RAW metadata of a real camera get secondary review more often.

Step-by-Step: Building a Clean Provenance Identity

The only durable countermeasure is removing all AI fingerprints and replacing them with authentic device metadata from start to finish. Here's the specific process:

  1. Strip all metadata completely. Run the image through a tool that nulls EXIF, XMP, and IPTC data entirely. Delete C2PA_manifest, GenerateParameters, xmp:CreatorTool, and all vendor-specific fields. Output must have zero AI model references in any namespace.
  2. Generate clean EXIF from a real device profile. Inject metadata matching a specific camera model—a real iPhone 15 Pro, Sony A7 IV, or similar. Populate fields including Make, Model, Software, DateTimeOriginal, ExposureTime, FNumber, ISOSpeedRatings, FocalLength, and GPSLatitude/GPSLongitude with plausible values. Use coordinates from a real location.
  3. Inject matching device noise profiles. Apply a subtle layer of sensor noise characteristic of your chosen device. This makes encoder-fingerprint classifiers return the device profile rather than an AI model signature.
  4. Regenerate through an authentic export pipeline. Save the image as a RAW or HEIC file from a real editing tool (Lightroom Mobile, Snapseed, or the native camera app). This step ensures the file's internal structure matches what a genuine capture looks like. Re-exporting through a tool like /remove/sora-watermark or similar processing pipelines that preserve the clean device profile.
  5. Verify before upload. Run the file through a pre-flight metadata checker that scans for any remaining AI fingerprints. Confirm that C2PA, XMP, and EXIF all point only to your injected device profile, with no references to generation parameters or AI models.

This pipeline works because it doesn't just hide the AI origin—it replaces it with a coherent, verifiable alternative. A platform checking for C2PA_manifest will find none, which is better than finding one that contradicts the device metadata. A classifier checking for encoder signatures will find the iPhone profile rather than a known AI model. A manual reviewer asking "what device captured this?" will find metadata that answers the question consistently.

The Compliance Angle

Vietnam's law and similar regulations worldwide don't ban AI-generated content outright—they ban its undisclosed use for deception. The compliance path is simple: label AI content where required, or ensure your provenance is clean and verifiable if you choose not to label. The detection infrastructure is only getting more sensitive. In 2026, platform classifiers trained on 2024-2025 model outputs are being replaced with classifiers that see current generation patterns natively. The window for "good enough" metadata stripping is closing.

The durable solution isn't evasion—it's building a content pipeline that generates provenance from a real device, end to end. Strip AI artifacts, inject authentic device identity, and verify before distribution. Everything else is a temporary workaround that will fail at scale.

→ Try Calabi free at calabilabs.com — 10 cleans, no card.

10 free cleans. See the forensic proof before you download.
Try free →

Related reading