Technology

Creators say they didn’t know Google uses YouTube to train AI

Published

6 months ago

June 19, 2025

admin

Silhouettes of laptop and mobile device users are seen next to a screen projection of the YouTube logo.

Dado Ruvic | Reuters

Google is using its expansive library of YouTube videos to train its artificial intelligence models, including Gemini and the Veo 3 video and audio generator, CNBC has learned.

The tech company is turning to its catalog of 20 billion YouTube videos to train these new-age AI tools, according to a person who was not authorized to speak publicly about the matter. Google confirmed to CNBC that it relies on its vault of YouTube videos to train its AI models, but the company said it only uses a subset of its videos for the training and that it honors specific agreements with creators and media companies.

“We’ve always used YouTube content to make our products better, and this hasn’t changed with the advent of AI,” said a YouTube spokesperson in a statement. “We also recognize the need for guardrails, which is why we’ve invested in robust protections that allow creators to protect their image and likeness in the AI era — something we’re committed to continuing.”

Such use of YouTube videos has the potential to lead to an intellectual property crisis for creators and media companies, experts said.

While YouTube says it has shared this information previously, experts who spoke with CNBC said it’s not widely understood by creators and media organizations that Google is training its AI models using its video library.

YouTube didn’t say how many of the 20 billion videos on its platform or which ones are used for AI training. But given the platform’s scale, training on just 1% of the catalog would amount to 2.3 billion minutes of content, which experts say is more than 40 times the training data used by competing AI models.

The company shared in a blog post published in September that YouTube content could be used to “improve the product experience … including through machine learning and AI applications.” Users who have uploaded content to the service have no way of opting out of letting Google train on their videos.

“It’s plausible that they’re taking data from a lot of creators that have spent a lot of time and energy and their own thought to put into these videos,” said Luke Arrigoni, CEO of Loti, a company that works to protect digital identity for creators. “It’s helping the Veo 3 model make a synthetic version, a poor facsimile, of these creators. That’s not necessarily fair to them.”

CNBC spoke with multiple leading creators and IP professionals, none were aware or had been informed by YouTube that their content could be used to train Google’s AI models.

Google DeepMind Veo 3.

Courtesy: Google DeepMind

The revelation that YouTube is training on its users’ videos is noteworthy after Google in May announced Veo 3, one of the most advanced AI video generators on the market. In its unveiling, Google showcased cinematic-level video sequences, including a scene of an old man on a boat and another showing Pixar-like animals talking with one another. The entirety of the scenes, both the visual and the audio, were entirely AI generated.

According to YouTube, an average of 20 million videos are uploaded to the platform each day by independent creators by nearly every major media company. Many creators say they are now concerned they may be unknowingly helping to train a system that could eventually compete with or replace them.

“It doesn’t hurt their competitive advantage at all to tell people what kind of videos they train on and how many they trained on,” Arrigoni said. “The only thing that it would really impact would be their relationship to creators.”

Even if Veo 3’s final output does not directly replicate existing work, the generated content fuels commercial tools that could compete with the creators who made the training data possible, all without credit, consent or compensation, experts said.

When uploading a video to the platform, the user is agreeing that YouTube has a broad license to the content.

“By providing Content to the Service, you grant to YouTube a worldwide, non-exclusive, royalty-free, sublicensable and transferable license to use that Content,” the terms of service read.

“We’ve seen a growing number of creators discover fake versions of themselves circulating across platforms — new tools like Veo 3 are only going to accelerate the trend,” said Dan Neely, CEO of Vermillio, which helps individuals protect their likeness from being misused and also facilitates secure licensing of authorized content.

Neely’s company has challenged AI platforms for generating content that allegedly infringes on its clients’ intellectual property, both individual and corporate. Neely says that although YouTube has the right to use this content, many of the content creators who post on the platform are unaware that their videos are being used to train video-generating AI software.

Vermillio uses a proprietary tool called Trace ID to asses whether an AI-generated video has significant overlap with a human-created video. Trace ID assigns scores on a scale of zero to 100. Any score over 10 for a video with audio is considered meaningful, Neely said.

A video from YouTube creator Brodie Moss closely matched content generated by Veo 3. Using Vermillio’s Trace ID tool, the system attributed a score of 71 to the original video with the audio alone scoring over 90.

Vermillio

In one example cited by Neely, a video from YouTube creator Brodie Moss closely matched content generated by Veo 3. Trace ID attributed a score of 71 to the original video with the audio alone scoring over 90.

Some creators told CNBC they welcome the opportunity to use Veo 3, even if it may have been trained on their content.

“I try to treat it as friendly competition more so than these are adversaries,” said Sam Beres, a creator with 10 million subscribers on YouTube. “I’m trying to do things positively because it is the inevitable —but it’s kind of an exciting inevitable.”

Google includes an indemnification clause for its generative AI products, including Veo, which means that if a user faces a copyright challenge over AI-generated content, Google will take on legal responsibility and cover the associated costs.

YouTube announced a partnership with Creative Artists Agency in December to develop access for top talent to identify and manage AI-generated content that features their likeness. YouTube also has a tool for creators to request a video to be taken down if they believe it abuses their likeness.

However, Arrigoni said that the tool hasn’t been reliable for his clients.

YouTube also allows creators to opt out of third party training from select AI companies including Amazon, Apple and Nvidia, but users are not able to stop Google from training for its own models.

The Walt Disney Company and Universal filed a joint lawsuit last Wednesday against the AI image generator Midjourney, alleging copyright infringement, the first lawsuit of its kind out of Hollywood.

“The people who are losing are the artists and the creators and the teenagers whose lives are upended,” said Sen. Josh Hawley, R-Mo., in May at a Senate hearing about the use of AI to replicate the likeness of humans. “We’ve got to give individuals powerful enforceable rights and their images in their property in their lives back again or this is just never going to stop.”

Disclosure: Universal is part of NBCUniversal, the parent company of CNBC.

WATCH: Google buyouts highlight tech’s cost-cutting amid AI CapEx boom

Technology

CNBC Daily Open: U.S. stocks retreat from highs as Broadcom leads tech sell-off

Published

7 hours ago

December 15, 2025

admin

CNBC Daily Open: U.S. stocks retreat from highs as Broadcom leads tech sell-off

Signage at the Broadcom Inc. headquarters in San Jose, California, U.S., on Monday, June 2, 2025.

David Paul Morris | Bloomberg | Getty Images

The sell-off in artificial intelligence stocks continued unabated Friday stateside. Broadcom shares tumbled more than 11% as investors grew concerned over lower margins and uncertain deals. Names such as Nvidia, Advanced Micro Devices and Oracle fell in sympathy, which caused major U.S. indexes to close lower.

It was a motif patterning the week. Even though the Dow Jones Industrial Average rose 1.1% week on week on the back of outperformance by financial stocks, tech names dragged down the S&P 500 and the Nasdaq Composite, which fell 0.6% and 1.6% respectively for the week.

That said, investors could have just been jittery amid the narrative of an apparent AI bubble, and were spooked by any sign of bad news. After all, Broadcom’s earnings — as well as its guidance for the current quarter — breezed past expectations.

“Frankly we aren’t sure what else one could desire as the company’s AI story continues to not only overdeliver but is doing it at an accelerating rate,” Bernstein analyst Stacy Rasgon, who has a “buy” rating on Broadcom, wrote in a Friday note.

Future prospects also look rosy, according to UBS. “We expect high profitability and the accelerating impact of the AI, power and resources, and longevity themes to drive 2026 performance,” said strategist Sagar Khandelwal.

But in the near term, investors may still be flighty, unless something concretely reassuring, such as Oracle achieving positive cash flow, reassures them the snapping sound is just a twig in the forest.

What you need to know today

U.S. stocks dragged down by AI names. Major indexes fell Friday, a day after they hit record highs. Asia-Pacific markets traded lower Monday. South Korea’s Kospi retreated roughly 1.5% as of 2:45 p.m. Singapore time (1:45 a.m. ET), leading losses in the region.

China’s economic slowdown deepens. Even though the country’s retail sales and industrial production grew year on year in November, their increase missed forecasts and slowed from the previous month. Investment in fixed assets in the January-to-November period contracted from a year earlier.

The end of the ‘Berkshire way’? Several aspects of Berkshire Hathaway’s leadership transition are signaling that the conglomerate is drifting away from the famously decentralized “Berkshire way,” CNBC’s Alex Crippen writes.

Hong Kong court finds Jimmy Lai guilty. The 78-year-old pro-democracy activist and media baron was ruled guilty of sedition and collusion with foreign countries by a Hong Kong court on Monday. The results might unsettle foreign investors, analysts say.

[PRO] China’s food security strategy. The spat between Beijing and Washington over soybean purchases has highlighted the evolution of China’s domestic agriculture industry. Goldman Sachs thinks this is the best way to play the sector.

And finally…

Copper prices have soared this year, hitting multiple record highs, fueled by supply disruptions and fears over U.S. tariffs.

Imagebroker/sunny Celeste | Imagebroker | Getty Images

Copper could hit ‘stratospheric new highs’ as hoarding of the metal in U.S. continues

Copper prices have hit multiple record highs this year, fueled by supply disruptions and as fears over U.S. tariffs have led to a surge in demand. The rally is set to continue into 2026.

Citi analysts expect prices of the red metal to skyrocket on the back of stronger demand led by the energy transition and artificial intelligence sectors. Electrification, grid expansion and data-center build-outs require large amounts of the metal for wiring, power transmission and cooling infrastructure.

— Lee Ying Shan

Technology

CNBC Daily Open: Investors sell off tech despite steady Broadcom numbers

Published

13 hours ago

December 15, 2025

admin

CNBC Daily Open: Investors sell off tech despite steady Broadcom numbers

Signage at the Broadcom Inc. headquarters in San Jose, California, U.S., on Monday, June 2, 2025.

David Paul Morris | Bloomberg | Getty Images

What you need to know today

U.S. stocks dragged down by AI names. Major indexes fell Friday, a day after they hit record highs. The pan-European Stoxx 600 retreated almost 0.5%. Separately, the U.K. economy unexpectedly shrank 0.1% in the three months to October.

Oracle will finish data centers on time. The company issued its response to a Bloomberg report, which cited unnamed people, that Oracle will complete data centers for OpenAI in 2028 rather than 2027. “There have been no delays,” Oracle said.

Coinbase to have an in-house prediction market. It will be powered be Kalshi, a source close to the matter told CNBC, and is a play to expand asset classes available on the cryptocurrency exchange.

[PRO] China’s food security strategy. The spate between Beijing and Washington over soybean purchases has highlighted the evolution of China’s domestic agriculture industry. Goldman Sachs thinks this is the best way to play the sector.

And finally…

A bear statue stands outside the Frankfurt Stock Exchange, operated by Deutsche Boerse AG, in Frankfurt, Germany, on Friday, March 13, 2020. Top European CEOs are fearing a euro zone recession as a confluence of economic shocks continues to threaten the outlook for the bloc.

Alex Kraus | Bloomberg | Getty Images

Global week ahead: Europe under fire

U.S. President Donald Trump’s verdict on Europe: a “decaying” group of nations led by “weak” people. His criticism in a recent Politico interview adds to a tough period for the bloc, with challenges on multiple fronts testing European leaders in the final weeks of the year.

This week looks set to be critical, with a high-stakes summit in Brussels and the European Central Bank’s final policy meeting of the year. Key topics for this week include defrosting frozen Russian assets for Ukraine aid; EU vs. U.S. in trade and tech, and updated economic figures at the ECB meeting.

— Leonie Kidd