Nothing You See Online Can Be Trusted—And AI Is Making It Worse

In January 2024, a photograph began circulating on social media showing an explosion at the Pentagon. The image was visually convincing: billowing smoke, emergency vehicles, a crowd of onlookers. It appeared on accounts with hundreds of thousands of followers. Within minutes, it had been shared millions of times. The stock market briefly dipped. It was, of course, entirely fake—generated by an AI image synthesis system and spread without verification by accounts that may have had financial motivations to manipulate markets. The incident was resolved quickly, the fake was debunked, and the market recovered. But the warning it sent was received by very few people: the era of AI-generated visual disinformation is not coming. It is already here.

The term "deepfake"—a portmanteau of "deep learning" and "fake"—was coined in 2017 when an anonymous Reddit user began posting AI-synthesized faces superimposed on pornographic videos, primarily targeting celebrities. The technology has advanced at a pace that makes that crude original look like cave painting. By 2024, systems like Midjourney Version 6, DALL-E 3, and Stable Diffusion XL could generate photorealistic images from text prompts of arbitrary complexity. Sora, released by OpenAI in February 2024, demonstrated the ability to generate high-fidelity videos of complex scenes from text descriptions—opening capabilities that had previously existed only in science fiction. A video of a CEO announcing unexpected layoffs, a politician confessing to crimes they never committed, a financial regulator revealing market manipulation—any of these can now be fabricated with a consumer laptop and no specialized technical skills.

The Detection Arms Race

The response to synthetic media has spawned what security researchers call an "arms race"—a continuous competition between generation systems that make fakes more convincing and detection systems that try to identify them. This competition has a structural asymmetry that favors the generators: it is generally easier to create a convincing fake than to prove that a given piece of media is fake. Detection systems must find universal tells—the fingerprints left by generation models—while generation systems are specifically trained to eliminate those fingerprints.

The current state of the art in deepfake detection relies on several distinct technical approaches. The first is frequency analysis—the observation that AI image generators tend to produce artifacts in the high-frequency Fourier components of images that are subtly different from natural photographs. When you decompose a real photograph into its frequency components, the distribution follows predictable statistical patterns that emerge from the physics of real cameras and optics. AI-generated images, synthesized from learned probability distributions rather than physical processes, tend to have frequency distributions that deviate from these patterns in ways that are detectable by trained classifiers.

The second approach focuses on physiological signals. Many deepfake videos of real people involve face swapping, which often fails to fully replicate the natural patterns of blinking, pulse-induced skin color changes, and fine blood vessel patterns in the eyes. Detection systems trained to analyze these signals have achieved detection rates above 90% on standard benchmark datasets. However, these systems degrade significantly when evaluated on "in the wild" videos—compressed, resized, edited, and otherwise degraded in the process of being shared on social media.

The third approach uses watermarking analysis. Several AI generation companies have committed to embedding invisible watermarks in AI-generated content—statistical patterns that allow the content to be identified as synthetic even after processing. The problem is that there is no technical mechanism forcing all generators to watermark their outputs, and watermarks can be removed by basic image processing operations. The approach provides useful provenance tracking for honest actors but does not address malicious actors who deliberately avoid watermarking.

The Deepfake Detection Challenge: Industry Benchmark Performance

Comprehensive evaluations of deepfake detection systems are conducted regularly by academic groups and, increasingly, by government agencies. The most rigorous benchmarks use "in the wild" datasets—collections of real and fake videos scraped from the internet as they actually appear in social media contexts, with compression, resizing, and editing artifacts included. A 2025 benchmark conducted by NIST (National Institute of Standards and Technology) evaluated 12 leading deepfake detection systems against a standardized challenge set. Top-performing systems achieved approximately 87% accuracy on the full dataset, with performance dropping to 62% on the most challenging subset—videos that had been significantly compressed and edited before reaching the test set.

These numbers reveal a sobering reality: current deepfake detection technology is good enough to be useful as a first line of defense, but not reliable enough to be the sole determinant of authenticity. A journalist or fact-checker using these tools would correctly identify the majority of fakes but would miss a significant minority—and missing a sophisticated fake that appears authentic can be more damaging than identifying a genuine piece of media as suspicious, because the false negative creates false confidence in disinformation.

The Dark Table: Deepfake Detection Methods Compared

Method	Best For	Accuracy (Lab)	Accuracy (In-The-Wild)	Key Limitation
Frequency/Fourier Analysis	AI-generated images	91–94%	72–78%	Degrades with compression
Physiological Signal Detection	Face swap videos	88–93%	65–75%	Fails on high-quality recent models
Caption/Metadata Analysis	Provenance tracking	85–90%	60–70%	Metadata stripped by social media
Behavioral Biometrics	Speaker verification	89–95%	75–82%	Requires reference samples
Digital Provenance (C2PA)	Chain of custody	N/A (structural)	Depends on adoption	Requires creator cooperation
Ensemble Methods	General-purpose	90–95%	80–87%	Computationally expensive

Adobe's Content Authenticity Initiative: Betting on Provenance

Adobe, whose Photoshop software is both a casualty of the synthetic media revolution and a potential solution provider, has invested heavily in the Content Authenticity Initiative (CAI)—a coalition now numbering over 2,000 members across media, technology, and academic organizations. The CAI's core technology is the Content Credentials standard (now formalized as C2PA, the Coalition for Content Provenance and Authenticity), which embeds cryptographic metadata into digital files at the moment of creation, recording information about the editing history, authorship, and creation process of the content.

Under the C2PA framework, a photograph taken on a smartphone would carry metadata indicating: who took it, when, with what device, and what software was used to process it. If that photo is edited in Photoshop, the editing software would update the metadata to reflect the changes. The result is a verifiable "nutrition label" for digital content—viewable through a free browser extension and mobile app called Check Your Work—that allows anyone to see the complete provenance of an image, from capture to final publication.

The critical limitation of C2PA is adoption. For the system to work, it must be implemented at every point in the creation and distribution pipeline—from the camera sensor in a smartphone to the social media platform that hosts the final image. Apple, Adobe, Microsoft, Google, and most major camera manufacturers have committed to implementing C2PA in their products, and the first C2PA-enabled smartphones began shipping in late 2023. But social media platforms have been slower to adopt, and the existing base of non-C2PA-enabled devices and software remains enormous. The framework is the right long-term solution but will take years to reach the coverage needed to be truly effective.

"We are not going to win the detection arms race by building better classifiers. Every time we identify a new artifact, the generators learn to eliminate it. The real solution is provenance—proving what is authentic, not just flagging what is fake. We need a chain of custody for digital content, the same way we have for physical evidence." — Andy Parsons, Senior Director, Adobe Content Authenticity Initiative

The Societal Impact: What Deepfakes Actually Break

The public discourse around deepfakes tends to focus on the most dramatic scenarios: fake videos of world leaders starting wars, fabricated evidence in court, celebrity pornographic content. These scenarios are real risks and deserve serious attention. But the more pervasive damage from synthetic media may come from a subtler phenomenon: the general corrosion of evidentiary trust. If any piece of media can potentially be fake, the default assumption shifts from "authentic until proven otherwise" to "suspicious until proven otherwise"—and this epistemic shift has consequences that go far beyond any individual deepfake.

Researchers at UC Berkeley and MIT have documented what they call the "liar's dividend"—the observation that the mere existence of deepfake technology gives political figures, corporate executives, and other public figures a ready-made excuse to dismiss authentic video evidence of their own words or actions. "That's a deepfake," a public figure can now say, and the statement creates enough doubt to be valuable even if it is false. The New York Times reported in 2024 that legal teams for several high-profile defendants facing credible video evidence had explicitly explored the deepfake defense—not because the videos were actually fake, but because the defense was available and created reasonable doubt.

The psychological impact is also measurable. A 2024 survey by the Reuters Institute found that 73% of news consumers in the United States said they were more skeptical of video news content than they had been three years earlier. Among that group, 41% said their increased skepticism had made them more likely to dismiss accurate news content—a phenomenon researchers call "truth skepticism" that benefits bad actors by making audiences distrust genuine reporting alongside fakes. The weaponization of doubt is, perhaps, the deepest harm that synthetic media technology enables.

Regulatory Responses: Piecemeal and Reactive

Legislative responses to synthetic media have been fragmented and inconsistent. China's 2022 regulations on deep synthesis technology require AI-generated content to be labeled, mandate consent from individuals before their likeness can be synthesized, and prohibit the creation of fake news. The EU's AI Act addresses synthetic media as part of its broader AI governance framework, requiring transparency disclosures for AI-generated content. In the United States, no comprehensive federal legislation exists as of mid-2026, though several states have passed targeted laws: California prohibits deepfake political ads within 60 days of an election, Texas criminalized deepfake content intended to influence elections, and New York has specific laws targeting deepfake pornography without consent.

The gap between the pace of technology development and the pace of legislation is not surprising—it is a structural feature of democratic governance in a technology-driven world. But the gap is wider in synthetic media than in almost any other AI application area, because the technology is advancing so rapidly that any specific regulatory provision risks becoming obsolete before it is even fully implemented. The most effective responses are likely to come from industry self-regulation combined with technical standards—platform-level policies that require synthetic content to be labeled, provenance standards that allow authenticity verification, and detection tools that are freely available to journalists, fact-checkers, and the public.

The photograph of the Pentagon explosion was taken down within 30 minutes of its first appearance on Twitter. That 30 minutes was long enough for it to reach millions of people, briefly move markets, and demonstrate how quickly a single convincing fake can propagate through a connected world. In the next major synthetic media incident—and there will be a next one—the response time may be the only variable that matters. Everything else is infrastructure we need to build before, not after, the crisis hits.