Big Tech says AI watermarks could curb misinformation, but they're easy to sidestep

Mar 28, 2024

Major tech companies driving artificial intelligence have agreed to watermarking standards, but people who use their technology to mislead and harm others aren’t participating.

By Kat Tenbarge and Kevin Collier

Watermarking has been floated by Big Tech as one of the most promising methods to combat the escalating AI misinformation problem online. But so far, the results don’t seem promising, according to experts and a review of misinformation conducted by NBC News.

Adobe’s general counsel and trust officer Dana Rao wrote in a February blog post that Adobe’s C2PA watermarking standard, which Meta and other Big Tech companies have signed onto, would be instrumental in educating the public about misleading AI.

“With more than two billion voters expected to participate in elections around the world this year, advancing C2PA’s mission has never been more critical,” Rao wrote.

The technologies are only in their infancy and in a limited state of deployment but, already, watermarking has proven to be easy to bypass.

Many contemporary watermarking technologies meant to identify AI-generated media use two components: an invisible tag contained in an image’s metadata and a visible label superimposed on an image.

But both invisible watermarks, which can take the form of microscopic pixels or metadata, and visible labels can be removed, sometimes through rudimentary methods such as screenshotting and cropping.

So far, major social media and tech companies have not strictly mandated or enforced that labels be put on AI-generated or AI-edited content.

The vulnerabilities of watermarking were on display Wednesday when Meta CEO Mark Zuckerberg updated his cover photo on Facebook with an AI-generated image of llamas standing on computers. It was created with Meta’s AI image generator Imagine that launched in December. The generator is supposed to produce images with built-in labels, which show up as a tiny symbol in the bottom left corner of images like Zuckerberg’s llamas.

Mark Zuckerberg's AI-generated cover photo crops out Meta's watermark. Facebook

But on Zuckerberg’s AI-generated llama image, the label wasn’t visible to users logged out of Facebook. It also wasn’t visible unless you clicked on and opened Zuckerberg’s cover photo. When NBC News created AI-generated images of llamas with Imagine, the label could easily be removed by screenshotting part of the image that didn’t have the label in it. According to Meta, the invisible watermark is carried over in screenshots.

In February, Meta announced it would begin identifying AI-generated content through watermarking technology and labeling AI-generated content on Facebook, Instagram and Threads. The watermarks Meta uses are contained in metadata, which is invisible data that can only be viewed with technology built to extract it. In its announcement, Meta acknowledged that watermarking isn’t totally effective and can be removed or manipulated in bad faith efforts.

The company said it will also require users to disclose whether content they post is AI-generated and “may apply penalties” if they don’t. These standards are coming in the next several months, Meta said.

AI watermarks can even be removed if a user doesn’t intend to. Sometimes uploading photos online strips the metadata from them in the process.

The visible labels associated with watermarking pose further issues.

“It takes about two seconds to remove that sort of watermark,” said Sophie Toura, who works for a U.K. tech lobbying and advocacy firm called Control AI, which launched in October 2023. “All these claims about being more rigorous and hard to remove tend to fall flat.”

The original AI-generated image on the left is watermarked, but the watermark was easy to crop out to create the image on the right.Generated with Imagine by Meta

A senior technologist for the Electronic Frontier Foundation, a digital civil liberties nonprofit group, wrote that even the most robust and sophisticated watermarks can be removed by someone who has the skill and desire to manipulate the file itself.

Aside from stripping watermarks, they can also be replicated, opening up the possibility of false positives to imply unedited and real media is actually AI-generated.

The companies that have committed to cooperative watermarking standards are major players such as Meta, Google, OpenAI, Microsoft, Adobe and Midjourney. But there are thousands of AI models available to download and use on app stores like Google Play and websites like Microsoft’s GitHub that aren’t beholden to watermarking standards.

For Adobe’s C2PA standard, which has been adopted by Google, Microsoft, Meta, OpenAI, major news outlets including NBCU News Group, and major camera companies, images are intended to automatically have a watermark paired with a visible label called “content credentials.”

The label, which is a small symbol composed of the letters “CR” in the corner of an image, is similar to Meta’s Imagine label. These invisible watermarks are contained in metadata located in a pixel in a visually important part of the image, Adobe’s Rao told NBC News in February. Both the visual label and the metadata would contain information like whether the image is AI-generated or edited with AI tools.

“It’s well-intentioned, it’s a step in the right direction. I don’t think it should be remotely relied on as the solution to, for example, all the issues that come with deepfakes,” Toura said.

Deepfakes are misleading images, videos and audio that have been edited or generated with AI. They’re frequently used to target people — overwhelmingly women and girls — with images and videos that depict their faces and likenesses in nude and sexually explicit scenarios without their consent. More of these deepfakes were posted online in 2023 than every other year combined, and high-profile incidents have continued into 2024. Earlier this month, NBC News found Meta hosted hundreds of ads since September for a deepfake app that offered the ability to “undress” photos — 11 ads showed blurred, nude photos of “undressed” images of actress Jenna Ortega taken when she was just 16. Having suspended dozens of the ads previously, Meta only suspended the company behind the ads after NBC News reached out.

Deepfakes have also increasingly been used in scams and political disinformation, including about 2024 elections.

In January, a deepfake robocall that phoned thousands of New Hampshire Democrats imitated Joe Biden’s voice with AI and told them not to vote in the primary election. NBC News reported that a Democratic consultant with ties to a rival campaign paid a magician to create the audio, which he did with AI software from the company ElevenLabs.

ElevenLabs embeds watermarks, inaudible to the human ear, into audio files produced with its software. Anyone can upload a sample to its free “speech classifier” to scan for those watermarks.

But the act of using deepfake audio for nefarious purposes in the real world can alter the sound file and remove those watermarks. When NBC News uploaded the magician’s original file to the speech classifier, ElevenLabs said there was a 98% chance its software made that sample. But when NBC News uploaded a recording of the same fake Biden call that had been recorded from the voicemail of a New Hampshire resident who received the call — a process that added some distortion to the audio file — the classifier said there was only a 2% chance that ElevenLabs’ software was involved.

Social media platforms and search engines are already littered with deepfakes, and app stores are full of services that advertise their creation. Some of these posts and advertisements have deepfake nude and sexually-explicit images featuring the faces of children.

Rao was pragmatic about the reach that Adobe’s own watermarking initiative could have. First, he said, the public has to recognize the labels that indicate AI-generated content. In order to be widely effective, the public would have to learn to verify visual media before trusting it. This would be a major feat. Rao compared the potential shift to expecting and recognizing content credentials in visual media to public awareness of online phishing campaigns — which, meanwhile, have sharply increased alongside the rise of ChatGPT.

“We don’t have to believe everything correctly,” he said in an NBC News interview in February. “It’s really the important things that we should do the extra work on to believe whether it’s true or not.”

Recommend

CYCJET: Providing innovative marking solutions for industrial intelligence along the Belt and Road Initiative.

AIFeb 16, 2026

Atrend Ltd Launches Primary Market Accounts to Empower Members in Pre-Market Trading and IPO Subscription, Achieving Breakthroughs in Investment Learning and Returns

AIAug 5, 2025

SGTC and CHCC Capital Launch a Carbon Neutrality Fund to Drive Global Emissions Reduction

AIMar 18, 2025

SHEIN Kicks Off Festival Season at Rolling Loud California

AIMar 18, 2025

Winning Chinese Tourists: Insights from SCCCI x FY Ads x Meituan Dianping Event

AIMar 18, 2025

Big Tech says AI watermarks could curb misinformation, but they're easy to sidestep

Recommend

CYCJET: Providing innovative marking solutions for industrial intelligence along the Belt and Road Initiative.

Atrend Ltd Launches Primary Market Accounts to Empower Members in Pre-Market Trading and IPO Subscription, Achieving Breakthroughs in Investment Learning and Returns

SGTC and CHCC Capital Launch a Carbon Neutrality Fund to Drive Global Emissions Reduction

SHEIN Kicks Off Festival Season at Rolling Loud California

Winning Chinese Tourists: Insights from SCCCI x FY Ads x Meituan Dianping Event

Navigation

Quick Search