Connect with us

Published

on

Breaking down AI chips, from Nvidia GPUs to ASICs by Google and Amazon

Nvidia outperformed all expectations, reporting soaring profits Wednesday thanks to its graphics processing units that excel at AI workloads. But more categories of AI chips are gaining ground.

Custom ASICs, or application-specific integrated circuits, are now being designed by all the major hyperscalers, from Google‘s TPU to Amazon‘s Trainium and OpenAI’s plans with Broadcom. These chips are smaller, cheaper, accessible and could reduce these companies’ reliance on Nvidia GPUs. Daniel Newman of the Futurum Group told CNBC that he sees custom ASICs “growing even faster than the GPU market over the next few years.”

Besides GPUs and ASICs, there are also field-programmable gate arrays, which can be reconfigured with software after they’re made for use in all sorts of applications, like signal processing, networking and AI. There’s also an entire group of AI chips that power AI on devices rather than in the cloud. Qualcomm, Apple and others have championed those on-device AI chips.

CNBC talked to experts and insiders at the Big Tech companies to break down the crowded space and the various kinds of AI chips out there.

GPUs for general compute

Once used primarily for gaming, GPUs made Nvidia the world’s most valuable public company after their use shifted toward AI workloads. Nvidia shipped some 6 million current-generation Blackwell GPUs over the past year.

Nvidia senior director of AI infrastructure Dion Harris shows CNBC’s Katie Tarasov how 72 Blackwell GPUs work together as one in a GB200 NVL72 rack-scale server system for AI at Nvidia headquarters in Santa Clara, California, on November 12, 2025.

Marc Ganley

The shift from gaming to AI started around 2012, when Nvidia’s GPUs were used by researchers to build AlexNet, what many consider to be modern AI’s big bang moment. AlexNet was a tool that was entered into a prominent image recognition contest. Whereas others in the contest used central processing units for their applications, AlexNet reliance on GPUs provided incredible accuracy and obliterated its competition.

AlexNet’s creators discovered that the same parallel processing that helps GPUs render lifelike graphics was also great for training neural networks, in which a computer learns from data rather than relying on a programmer’s code. AlexNet showcased the potential of GPUs.

Today, GPUs are often paired with CPUs and sold in server rack systems to be placed in data centers, where they run AI workloads in the cloud. CPUs have a small number of powerful cores running sequential general-purpose tasks, while GPUs have thousands of smaller cores more narrowly focused on parallel math like matrix multiplication.

Because GPUs can perform many operations simultaneously, they’re ideal for the two main phases of AI computation: training and inference. Training teaches the AI model to learn from patterns in large amounts of data, while inference uses the AI to make decisions based on new information.

GPUs are the general-purpose workhorses of Nvidia and its top competitor, Advanced Micro Devices. Software is a major differentiator between the two GPU leaders. While Nvidia GPUs are tightly optimized around CUDA, Nvidia’s proprietary software platform, AMD GPUs use a largely open-source software ecosystem.

AMD and Nvidia sell their GPUs to cloud providers like Amazon, Microsoft, Google, Oracle and CoreWeave. Those companies then rent the GPUs to AI companies by the hour or minute. Anthropic’s $30 billion deal with Nvidia and Microsoft, for example, includes 1 gigawatt of compute capacity on Nvidia GPUs. AMD has also recently landed big commitments from OpenAI and Oracle.

Nvidia also sells directly to AI companies, like a recent deal to sell at least 4 million GPUs to OpenAI, and to foreign governments, including South Korea, Saudi Arabia and the U.K.

The chipmaker told CNBC that it charges around $3 million for one of its server racks with 72 Blackwell GPUs acting as one, and ships about 1,000 each week. 

Dion Harris, Nvidia’s senior director of AI infrastructure, told CNBC he couldn’t have imagined this much demand when he joined Nvidia over eight years ago.

“When we were talking to people about building a system that had eight GPUs, they thought that was overkill,” he said.

ASICs for custom cloud AI

Training on GPUs has been key in the early boom days of large language models, but inference is becoming more crucial as the models mature. Inference can happen on less powerful chips that are programmed for more specific tasks. That’s where ASICs come in.

While a GPU is like a Swiss Army Knife able to do many kinds of parallel math for different AI workloads, an ASIC is like a single-purpose tool. It’s very efficient and fast, but hard-wired to do the exact math for one type of job.

Google released its 7th generation TPU, Ironwood, in November 2025, a decade after making its first custom ASIC for AI in 2015.

Google

“You can’t change them once they’re already carved into silicon, and so there’s a trade off in terms of flexibility,” said Chris Miller, author of “Chip War.”

Nvidia’s GPUs are flexible enough for adoption by many AI companies, but they cost up to $40,000 and can be hard to get. Still, startups rely on GPUs because designing a custom ASIC has an even higher up-front cost, starting at tens of millions of dollars, according to Miller.

For the biggest cloud providers who can afford them, analysts say custom ASICs pay off in the long-run.

“They want to have a little bit more control over the workloads that they build,” Newsom said. “At the same time, they’re going to continue to work very closely with Nvidia, with AMD, because they also need the capacity. The demand is so insatiable.”

Google was the first Big Tech company to make a custom ASIC for AI acceleration, coining the term Tensor Processing Unit when its first ASIC came out in 2015. Google said it considered making a TPU as far back as 2006, but the situation became “urgent” in 2013 as it realized AI was going to double its number of data centers. In 2017, the TPU also contributed to Google’s invention of the Transformer, the architecture powering almost all modern AI.

A decade after its first TPU, Google released its seventh generation TPU in November. Anthropic announced it will train its LLM Claude on up to 1 million TPUs. Some people think TPUs are technically on par or superior to Nvidia’s GPUs, Miller said.

“Traditionally, Google has only used them for in-house purposes,” Miller said. “There’s a lot of speculation that in the longer run, Google might open up access to TPUs more broadly.”

Amazon Web Services was the next cloud provider to design its own AI chips, after acquiring Israeli chip startup Annapurna labs in 2015. AWS announced Inferentia in 2018, and it launched Trainium in 2022. AWS is expected to announce Trainium’s third generation as soon December.

Ron Diamant, Trainium’s head architect, told CNBC that Amazon’s ASIC has 30% to 40% better price performance compared to other hardware vendors in AWS.

“Over time, we’ve seen that Trainium chips can serve both inference and training workloads quite well,” Diamant said.

CNBC’s Katie Tarasov holds Amazon Web Services’ Trainium 2 AI chip that fill its new AI data center in New Carlisle, Indiana, on October 8, 2025.

Erin Black

In October, CNBC went to Indiana for the first on-camera tour of Amazon’s biggest AI data center, where Anthropic is training its models on half a million Trainium2 chips. AWS fills its other data centers with Nvidia GPUs to meet the demand from AI customers like OpenAI.

Building ASICs isn’t easy. This is why companies turn to chip designers Broadcom and Marvell. They “provide the IP and the know-how and the networking” to help their clients build their ASICs, Miller said.

“So you’ve seen Broadcom in particular be one of the biggest beneficiaries of the AI boom,” Miller said.

Broadcom helped build Google’s TPUs and Meta‘s Training and Inference Accelerator launched in 2023, and has a new deal to help OpenAI build its own custom ASICs starting in 2026.

Microsoft is also getting into the ASIC game, telling CNBC that its in-house Maia 100 chips are currently deployed in its data centers in the eastern U.S. Others include Qualcomm with the A1200, Intel with its Gaudi AI accelerators and Tesla with its AI5 chip. There’s also a slew of start-ups going all in on custom AI chips, including Cerebras, which makes huge full-wafer AI chips, and Groq, with inference-focused language processing units.

In China, Huawei, ByteDance, and Alibaba are making custom ASICs, although export controls on the most advanced equipment and AI chips pose a challenge.

Edge AI with NPUs and FPGAs

The final big category of AI chips are those made to run on devices, rather than in the cloud. These chips are typically built into a device’s main System on a Chip, SoC. Edge AI chips, as they’re called, enable devices to have AI capabilities while helping them save battery life and space for other components.

“You’ll be able to do that right on your phone with very low latency, so you don’t have to have communication all the way back to a data center,” said Saif Khan, former White House AI and semiconductor policy advisor. “And you can preserve privacy of your data on your phone.”

Neural processing units are a major type of edge AI chip. Qualcomm, Intel and AMD are making NPUs that enable AI capabilities in personal computers.

Although Apple doesn’t use the term NPU, the in-house M-series chips inside its MacBooks include a dedicated neural engine. Apple also built neural accelerators into the latest iPhone A-series chips.

“It is efficient for us. It is responsive. We know that we are much more in control over the experience,” Tim Millet, Apple platform architecture vice president, told CNBC in an exclusive September interview

The latest Android phones also have NPUs built into their primary Qualcomm Snapdragon chips, and Samsung has its own NPU on its Galaxy phones, too. NPUs by companies like NXP and Nvidia power AI embedded in cars, robots, cameras, smart home devices and more.

“Most of the dollars are going towards the data center, but over time that’s going to change because we’ll have AI deployed in our phones and our cars and wearables, all sorts of other applications to a much greater degree than today,” Miller said.

Then there’s field-programmable gate arrays, or FPGAs, which can be reconfigured with software after they’re made. Although far more flexible than NPUs or ASICs, FPGAs have lower raw performance and lower energy efficiency for AI workloads.

AMD became the largest FPGA maker after acquiring Xilinx for $49 billion in 2022, with Intel in second thanks to its $16.7 billion purchase of Altera in 2015.

These players designing AI chips rely on a single company to manufacture them all: Taiwan Semiconductor Manufacturing Company.

TSMC has a giant new chip fabrication plant in Arizona, where Apple has committed to moving some chip production. In October, Nvidia CEO Jensen Huang said Blackwell GPUs were in “full production” in Arizona, too. 

Although the AI chip space is crowded, dethroning Nvidia won’t come easily.

“They have that position because they’ve earned it and they’ve spent the years building it,” Newman said. “They’ve won that developer ecosystem.”

Watch the video to see a breakdown of how all the AI chips work: https://www.cnbc.com/video/2025/11/21/nvidia-gpus-google-tpus-aws-trainium-comparing-the-top-ai-chips.html

Continue Reading

Technology

Shares of Chinese chipmaker MetaX soar nearly 700% in blockbuster Shanghai debut

Published

on

By

Shares of Chinese chipmaker MetaX soar nearly 700% in blockbuster Shanghai debut

Narumon Bowonkitwanchai | Moment | Getty Images

Shares of Chinese chipmaker MetaX Integrated Circuits soared about 700% in their market debut in Shanghai on Wednesday, after the company raised nearly $600 million in its initial public offering.

Shares, which were priced at 104.66 yuan in the IPO, surged to over 835 yuan on debut, marking a 697% jump.

Similar to Moore Threads, which saw a robust debut at the start of the month, MetaX develops graphics processing units for artificial intelligence applications, tapping into a fast-growing sector driven by rising adoption of AI services.

MetaX is part of a growing cohort of local chipmakers building AI processors, reflecting Beijing’s push to reduce dependence on U.S. chips following Washington’s tech curbs on export of high-end technology to China.

Washington has imposed export curbs on U.S. chip behemoth Nvidia, barring sales of its most advanced AI chips to China.

Newer Chinese players such as Enflame Technology and Biren Technology have also entered the AI space, aiming to capture a share of the billions in graphics processing unit, or GPU, demand no longer served by Nvidia. Chinese regulators have also been clearing more semiconductor IPOs in their drive for greater AI independence.

Earlier this month, shares of Moore Threads, a Beijing-based GPU manufacturer often referred to as “China’s Nvidia,” soared by more than 400% on its debut in Shanghai following its $1.1 billion listing.

Macquarie’s equity analyst Eugene Hsiao said investor enthusiasm around Chinese AI-chip IPOs such as MetaX is partly shaped by longer-term expectations that China will build a self-sufficient semiconductor ecosystem as tensions with the U.S. persist.

“For that to work, you need these players. You need names like Moore Threads, Meta X, etc,” he said.

“So I think when investors are looking at these IPOs, they implicitly are thinking about the nationalistic element,” Hsiao noted, adding that the main driver of the frenzy, however, was the firms’ growth potential.

— CNBC’s Dylan Butts contributed to this article.

Continue Reading

Technology

Alphabet-owned Waymo in talks to raise $15 billion in funding

Published

on

By

Alphabet-owned Waymo in talks to raise  billion in funding

Waymo co-CEOs (L-R): Tekedra Mawakana and Dmitri Dolgov

Waymo

Self-driving car company Waymo is in talks to raise $15 billion in funding in the new year.

The robotaxi company plans to raise billions from Alphabet, its parent company, as well as outside investors at a valuation as high as $110 billion, according to a person familiar with the discussions.

The latest funding discussions are indicative of Waymo’s status as the leader of the pack in the U.S. robotaxi market. The company has been spending heavily to ramp up its fleet and continue expanding to more regions. Waymo is now either operating its robotaxis, planning to launch service or starting to test its vehicles in 26 markets, in the U.S. and abroad.

Alphabet CEO Sundar Pichai said Waymo will “meaningfully” contribute to Alphabet’s financials as soon as 2027, CNBC reported Tuesday.  

If the Google sister company winds up raising as much as $15 billion, that would represent more than double the amount of its last funding round. That was a series C round of $5.6 billion at a $45 billion valuation, which closed in October 2024. Alphabet had committed $5 billion in a multiyear investment to Waymo at the time.

That round was led by Alphabet alongside previous backers, including Andreessen Horowitz, Fidelity, Perry Creek, Silver Lake, Tiger Global and T. Rowe Price. At the time, Waymo co-CEOs Tekedra Mawakana and Dmitri Dolgov said the funding would go toward expanding its robotaxi service.

Waymo currently serves paid rides to the public in the Austin, San Francisco Bay Area, Phoenix, Atlanta and Los Angeles markets.

Earlier this month, CNBC reported that Waymo crossed an estimated 450,000 weekly paid rides, and the company in December said it had served 14 million trips in 2025, putting it on pace to end the year at more than 20 million trips total since launching in 2020.

The company plans to open service next year in Dallas, Denver, Detroit, Houston, Las Vegas, Miami, Nashville, Orlando, San Antonio, San Diego and Washington, D.C. Waymo also announced plans to launch its service in London in 2026, which will mark the company’s first overseas service region.

Amazon’s Zoox this year began offering free driverless rides to the public around the Las Vegas Strip and certain San Francisco neighborhoods. Tesla launched a Robotaxi-branded service in Austin and the San Francisco Bay Area, but those cars still had human drivers or safety supervisors on board as of mid-December.

Fundraising plans were first reported by The Information.

WATCH: 2025: The year that the robotaxi went mainstream with Waymo leading the pack

2025: The year that the robotaxi went mainstream with Waymo leading the pack

Continue Reading

Technology

California judge rules that Tesla engaged in deceptive marketing around Autopilot

Published

on

By

California judge rules that Tesla engaged in deceptive marketing around Autopilot

Tesla electric vehicles (EV) in front of the company’s store in Colma, California, US, on Monday, Nov. 10, 2025.

David Paul Morris | Bloomberg | Getty Images

A California administrative law judge recently ruled recently that Tesla’s marketing around its “Autopilot” and “Full Self-Driving” systems had been deceptive, and that the company should face a 30-day suspension of each of its licenses to sell and manufacture cars in the state, according to California’s Department of Motor Vehicles.

The California DMV made formal accusations of false advertising against Tesla in 2022. Steve Gordon, the agency’s director, said in a press conference on Tuesday that the regulator will now give Elon Musk’s automaker 90 days to clarify or remove deceptive or confusing language about its Autopilot and FSD systems before implementing a 30-day suspension of the company’s sales license.

Gordon also said the DMV will stay the order to suspend Tesla’s manufacturing license so there will be no interruption to the company’s factory operations in the state.

In 2022, the DMV said that Tesla’s “Autopilot” and “Full Self-Driving” marketing suggested the company’s cars were capable of operating autonomously, though they required an attentive driver at the wheel, ready to steer or brake at any time.

Since that time, Tesla has changed the name of its premium, driver assistance option to Full Self-Driving (Supervised).

Tesla didn’t immediately respond to a request for comment on Tuesday.

Tesla’s stock price closed at a record on Tuesday, largely due to increased enthusiasm on Wall Street surrounding the company’s plans for its Robotaxis.

This is breaking news. Please check back for updates.

Continue Reading

Trending