Manufacturing Reality: When Seeing Is No Longer Believing

An overview of deepfakes, how to detect them, and implications for society at large

Dec 17, 2024

When explicit AI-generated images of Taylor Swift flooded social media earlier this year, it sparked immediate Congressional attention and public outrage. But while celebrity deepfakes capture headlines, they represent only the visible surface of a much deeper technological revolution. The real story – and the real danger – lies in the thousands of synthetic media incidents that never make the news.

Consider what happened in Singapore last month: over 100 public servants received blackmail attempts using deepfake images. These weren't celebrities or political figures, just ordinary people targeted with extraordinary technology.

This is where the true threat of deepfakes reveals itself – not in the high-profile cases that draw media attention, but in the quiet, everyday incidents that are increasingly reshaping our digital reality.

The numbers paint a startling picture. Since 2019, we've witnessed a 550% increase in deepfake creation, with synthetic media now accounting for 7% of all fraud attempts according to a recent study.

But these statistics only hint at the revolutionary change taking place: the democratization of sophisticated media manipulation.

Evolution of deepfakes

Gone are the days when video manipulation required expensive software and years of expertise. Today's deepfake technology runs in web browsers, often for free or at minimal cost, accessible to anyone with basic computer skills.

What once demanded hours of work from skilled professionals can now be accomplished in minutes by amateurs. This democratization of synthetic media creation represents one of the most significant shifts in how we produce and consume digital content since the advent of social media.

The technology behind these developments has evolved through several distinct generations. The early days of deepfakes, around 2017, produced crude face-swaps with obvious artifacts.

By 2019, we saw improved facial mapping and the emergence of synthetic audio. Today's technology enables full-body manipulation, real-time voice synthesis, and environmental generation that can fool even trained observers.

How Deepfakes Actually Work: A Non-Technical Explanation

Think of deepfake creation like having an artist and an art critic in the same room. One is trying to create forgeries, while the other is attempting to spot them. They repeat this cycle, of creation and critique, until the forger becomes so good that the critic cannot tell the difference.

This is essentially how Generative Adversarial Networks (GANs) work – but they’re not the main method for generating deepfakes. Modern deepfake tools typically use one of two approaches: encoder-decoder pairs or a first-order model

First, we’ll discuss encoder-decoder pairs

Let’s think about a cat. Immediately, you form a mental image of cat based on cats you’ve seen in the past.
Imagine teaching a computer to understand the essence of a face the same you understand the essence of a cat.
When you think “cat,” your brain constructs an image from your understanding of cat-ness. These systems try to operate in the same way.
The encoder extracts latent features of a source image, and the decoder uses this information to reconstruct the source image from the latent version.
The encoder and the decoder are recurrent neural networks that train themselves to improve by practicing on millions of source/target images.

Next, a first-order model

If you’ve ever seen or used a Snapchat filter, you’ve seen this in action.
The system tracks key features of image, say a face for example, and maps them onto something else.
The initial neural network is trained on hours of real video footage, in this instance faces or people, and learns to recognize elements such as the eyes, mouth, teeth, ears, eyebrows, etc.
Since a video is a collection of images (think: frames), in sequential order, the neural network allows us to “photoshop” each frame in a video.

Business implications and real world consequences

The business implications of these advances extend far beyond entertainment and social media. Financial institutions face new forms of fraud where synthetic video conferences feature perfect imitations of executives authorizing transfers. Customer service centers struggle with voice-cloned authorization calls that pass traditional security checks. Marketing departments confront waves of AI-generated review campaigns that can make or break a product launch.

Corporate security teams now grapple with increasingly sophisticated social engineering attacks.

Imagine receiving a video call from your CEO asking you to handle an urgent wire transfer, complete with their familiar mannerisms and voice patterns. How would you verify its authenticity under time pressure? This isn't a hypothetical scenario – such attacks have already cost companies millions.

Detection Strategies: A Multi-Layered Approach

The challenge of detecting deepfakes grows more complex as the technology improves. Early detection methods focused on obvious visual artifacts: unnatural blinking patterns, lighting inconsistencies, or asymmetric facial features.

While these indicators remain useful, modern deepfakes have evolved to avoid such obvious tells. Today's detection requires a multi-layered approach combining visual analysis, metadata examination, and behavioral pattern recognition.

Borrowing from an excellent research paper put out by the MIT Media Lab, here’s a breakdown on ways to detect deepfakes.

Visual Analysis

Look for these key indicators for images and video:

Temporal Inconsistencies:

Unnatural jump cuts
Perspective shifts
Frame rate variations

Physical Anomalies:

Shadow inconsistencies
Lighting mismatches
Reflection errors
Blinking patterns
Facial asymmetry

Contextual Errors

Environmental inconsistencies
Physical impossibilities
Cultural anachronisms or faux pas

Technical Detection Methods

Going beyond visual inspection, there are platform tools and data investigative techniques that can confirm or deny your initial assessment:

Metadata Analysis:

Digital fingerprinting
Compression artifacts
File structure analysis

AI-powered Detection:

Pattern recognition
Behavioral analysis
Anomaly detection

Platform-specific Tools:

Content authentication systems
Digital watermarking
Blockchain verification

Deepfakes are here to stay – what then?

On the human side, although we can try our best to spot deepfakes as we come across them, focusing solely on detection misses the larger point. Humans are best at spotting unnatural images or videos, since we have such a depth of information to draw on.

When it comes to voice deepfakes and other social engineering type problems, our intuition is far less reliable. The real solution isn't better detection technology – it's fundamentally rethinking how we establish and verify truth in digital communications.

Organizations are beginning to implement multi-factor verification systems that don't rely solely on what we see and hear. Some have introduced time-delayed authorization for significant actions, while others maintain separate communication channels for critical decisions.

Despite these challenges, deepfake technology isn't entirely malevolent. Educational institutions use it to create engaging historical reenactments. Healthcare providers employ it for medical training and patient education. Film studios leverage it for creative content production.

These positive applications remind us that, like any powerful technology, deepfakes are ultimately tools whose impact depends on how we choose to use them.

In closing

Looking ahead, we're entering an era where seeing and hearing can no longer serve as the basis for believing. This isn't unprecedented – society has adapted to new forms of information manipulation before, from the printing press to Photoshop.

But the speed and scale of this change present unique challenges.

We're still in the awkward teenage years of synthetic media, where our social and legal frameworks haven't caught up to technical capabilities.

The solution won't come from better detection algorithms or stricter regulations alone. It requires a fundamental shift in how we think about digital truth and trust.

Organizations need to build systems that remain reliable even when traditional trust mechanisms fail. Individuals must develop new forms of digital literacy. Society as a whole needs new frameworks for establishing and maintaining truth in an increasingly synthetic world.

The deepfake revolution isn't just changing how we create and consume media – it's forcing us to rethink the very nature of truth in the digital age.

The sooner we accept and adapt to this reality, the better prepared we'll be for what comes next. Because in a world where seeing is no longer believing, our systems for establishing truth must evolve beyond what our eyes and ears tell us.

This isn't just about spotting fakes anymore. It's about building a future where truth can prevail even when we can't trust our senses. And that might be the most important challenge of our digital age.

Discussion about this post

Ready for more?