Trend report · hn_show · 2026-06-12
The Hacker News community is buzzing about a uncomfortable truth: "Co-Authored-By" labels on AI-generated code are largely theatrical. The attribution exists in metadata, but metadata is trivially stripped, rewritten, or spoofed. What HN user @rduffy exposed in Co-Authored-By Is a Lie applies far beyond code commits—it exposes the fundamental weakness in every platform's AI detection regime. If you want your AI-assisted content to survive 2026's increasingly aggressive scanners, you need to understand exactly what they're looking for and how to beat them.
Modern AI detection operates in layers. Each layer checks a different signal, and a piece of content fails if any layer flags it. Here's what's actually running under the hood.
The Coalition for Content Provenance and Authenticity standard has moved from draft to deployment. Major platforms now parse C2PA metadata embedded in JUMBF boxes. The critical fields:
digital_source_type set to "http://cv.iptc.org/newscodes/digitalSourceType/trainedAlgorithmicMedia" is a death sentence.Instagram and TikTok parse C2PA on upload. If your image carries an Adobe Firefly provenance claim, expect an immediate label: "AI-generated" in small gray text. Some accounts get throttled; others get shadowbanned for "synthetic media."
Beyond C2PA, each AI model leaves distinctive artifacts. These aren't official standards—they're machine learning fingerprints that detection models have learned to recognize:
Here's a subtle but critical signal: real photos have GPS coordinates or, failing that, timezone-inferred location data. They have:
A "photo" with no GPS IFD, no DeviceMake, and DateTimeOriginal in UTC (rather than a device-set timezone) looks like a rendered frame, not a captured image.
Based on platform enforcement patterns documented across 2024-2025:
digital_source_type indicating AI generation. Applies "AI info" labels. For repeated offenders or "misleading synthetic media," reduces reach by 40-60%.Metadata stripping alone doesn't work. You strip the Firefly provenance claim, but you're left with a file that has no metadata—the absence itself is suspicious. The solution requires two steps in sequence.
JUMBF byte sequences after running your stripper.The "Co-Authored-By is a Lie" analysis on HN showed that removing attribution is easy—but a file with no identity is more suspicious than one with a clean AI label. Platforms have learned this. They're not just looking for "AI present"—they're looking for "plausible natural origin." A Canon EOS R6 image with GPS, realistic timestamps, and standard LensModel metadata passes the smell test. The same content with zero metadata does not.
Phone identity injection works because it shifts the question from "Is this AI?" to "Does this look like a real photo from a real device?" A properly injected device identity makes the answer yes—even when the underlying image is AI-generated.
The arms race continues. C2PA adoption grows. Detection models sharpen. But right now, in 2026, the gap between "AI content" and "plausibly natural content" is bridgeable. The window is open. How you traverse it determines whether your content thrives or gets labeled into irrelevance.
→ Try Calabi free at calabilabs.com — 10 cleans, no card.