Connect with us

Published

on

Jensen Huang, co-founder and chief executive officer of Nvidia Corp., during the Nvidia GPU Technology Conference (GTC) in San Jose, California, US, on Tuesday, March 19, 2024. 

David Paul Morris | Bloomberg | Getty Images

Nvidia’s 27% rally in May pushed its market cap to $2.7 trillion, behind only Microsoft and Apple among the most-valuable public companies in the world. The chipmaker reported a tripling in year-over-year sales for the third straight quarter driven by soaring demand for its artificial intelligence processors.

Mizuho Securities estimates that Nvidia controls between 70% and 95% of the market for AI chips used for training and deploying models like OpenAI’s GPT. Underscoring Nvidia’s pricing power is a 78% gross margin, a stunningly high number for a hardware company that has to manufacture and ship physical products.

Rival chipmakers Intel and Advanced Micro Devices reported gross margins in the latest quarter of 41% and 47%, respectively.

Nvidia’s position in the AI chip market has been described as a moat by some experts. Its flagship AI graphics processing units (GPUs), such as the H100, coupled with the company’s CUDA software led to such a head start on the competition that switching to an alternative can seem almost unthinkable.

Still, Nvidia CEO Jensen Huang, whose net worth has swelled from $3 billion to about $90 billion in the past five years, has said he’s “worried and concerned” about his 31-year-old company losing its edge. He acknowledged at a conference late last year that there are many powerful competitors on the rise.

“I don’t think people are trying to put me out of business,” Huang said in November. “I probably know they’re trying to, so that’s different.”

Nvidia has committed to releasing a new AI chip architecture every year, rather than every other year as was the case historically, and to putting out new software that could more deeply entrench its chips in AI software.

But Nvidia’s GPU isn’t alone in being able to run the complex math that underpins generative AI. If less powerful chips can do the same work, Huang might be justifiably paranoid.

The transition from training AI models to what’s called inference — or deploying the models — could also give companies an opportunity to replace Nvidia’s GPUs, especially if they’re less expensive to buy and run. Nvidia’s flagship chip costs roughly $30,000 or more, giving customers plenty of incentive to seek alternatives.

“Nvidia would love to have 100% of it, but customers would not love for Nvidia to have 100% of it,” said Sid Sheth, co-founder of aspiring rival D-Matrix. “It’s just too big of an opportunity. It would be too unhealthy if any one company took all of it.”

Founded in 2019, D-Matrix plans to release a semiconductor card for servers later this year that aims to reduce the cost and latency of running AI models. The company raised $110 million in September.

In addition to D-Matrix, companies ranging from multinational corporations to nascent startups are fighting for a slice of the AI chip market that could reach $400 billion in annual sales in the next five years, according to market analysts and AMD. Nvidia has generated about $80 billion in revenue over the past four quarters, and Bank of America estimates the company sold $34.5 billion in AI chips last year.

Many companies taking on Nvidia’s GPUs are betting that a different architecture or certain trade-offs could produce a better chip for particular tasks. Device makers are also developing technology that could end up doing a lot of the computing for AI that’s currently taking place in large GPU-based clusters in the cloud.

“Nobody can deny that today Nvidia is the hardware you want to train and run AI models,” Fernando Vidal, co-founder of 3Fourteen Research, told CNBC. “But there’s been incremental progress in leveling the playing field, from hyperscalers working on their own chips, to even little startups, designing their own silicon.”

AMD CEO Lisa Su wants investors to believe there’s plenty of room for many successful companies in the space.

“The key is that there are a lot of options there,” Su told reporters in December, when her company launched its most recent AI chip. “I think we’re going to see a situation where there’s not only one solution, there will be multiple solutions.”

Other big chipmakers

Lisa Su displays an AMD Instinct MI300 chip as she delivers a keynote address at CES 2023 in Las Vegas, Nevada, on Jan. 4, 2023.

David Becker | Getty Images

Nvidia’s top customers

How AWS is designing its own chips to help catch Microsoft and Google in generative A.I. race

One potential challenge for Nvidia is that it’s competing against some of its biggest customers. Cloud providers including Google, Microsoft and Amazon are all building processors for internal use. The Big Tech three, plus Oracle, make up over 40% of Nvidia’s revenue.

Amazon introduced its own AI-oriented chips in 2018, under the Inferentia brand name. Inferentia is now on its second version. In 2021, Amazon Web Services debuted Tranium targeted to training. Customers can’t buy the chips but they can rent systems through AWS, which markets the chips as more cost efficient than Nvidia’s.

Google is perhaps the cloud provider most committed to its own silicon. The company has been using what it calls Tensor Processing Units (TPUs) since 2015 to train and deploy AI models. In May, Google announced the sixth version of its chip, Trillium, which the company said was used to develop its models, including Gemini and Imagen.

Google also uses Nvidia chips and offers them through its cloud.

Microsoft isn’t as far along. The company said last year that it was building its own AI accelerator and processor, called Maia and Cobalt.

Meta isn’t a cloud provider, but the company needs massive amounts of computing power to run its software and website and to serve ads. While the Facebook parent company is buying billions of dollars worth of Nvidia processors, it said in April that some of its homegrown chips were already in data centers and enabled “greater efficiency” compared to GPUs.

JPMorgan analysts estimated in May that the market for building custom chips for big cloud providers could be worth as much as $30 billion, with potential growth of 20% per year.

Startups

Cerebras’ WSE-3 chip is one example of new silicon from upstarts designed to run and train artificial intelligence.

Cerebras Systems

Venture capitalists see opportunities for emerging companies to jump into the game. They invested $6 billion in AI semiconductor companies in 2023, up slightly from $5.7 billion a year earlier, according to data from PitchBook.

It’s a tough area for startups as semiconductors are expensive to design, develop and manufacture. But there are opportunities for differentiation.

For Cerebras Systems, an AI chipmaker in Silicon Valley, the focus is on basic operations and bottlenecks for AI, versus the more general purpose nature of a GPU. The company was founded in 2015 and was valued at $4 billion during its most recent fundraising, according to Bloomberg.

The Cerebras chip, WSE-2, puts GPU capabilities as well as central processing and additional memory into a single device, which is better for training large models, said CEO Andrew Feldman.

“We use a giant chip, they use a lot of little chips,” Feldman said. “They’ve got challenges of moving data around, we don’t.”

Feldman said his company, which counts Mayo Clinic, GlaxoSmithKline, and the U.S. Military as clients, is winning business for its supercomputing systems even going up against Nvidia.

“There’s ample competition and I think that’s healthy for the ecosystem,” Feldman said.

Sheth from D-Matrix said his company plans to release a card with its chiplet later this year that will allow for more computation in memory, as opposed to on a chip like a GPU. D-Matrix’s product can be slotted into an AI server along existing GPUs, but it takes work off of Nvidia chips, and helps to lower the cost of generative AI.

Customers “are very receptive and very incentivized to enable a new solution to come to market,” Sheth said.

Apple and Qualcomm

Apple iPhone 15 series devices are displayed for sale at The Grove Apple retail store on release day in Los Angeles, California, on September 22, 2023. 

Patrick T. Fallon | Afp | Getty Images

Don’t miss these exclusives from CNBC PRO

Nvidia and Microsoft has overshadowed weakness in tech stocks, says Solus' Dan Greenhaus

Continue Reading

Technology

Tesla is offering Cybertruck discounts as EV market gets crowded

Published

on

By

Tesla is offering Cybertruck discounts as EV market gets crowded

A soldier walks next to a Tesla Cybertruck, which was donated to the National Guard, after powerful winds fueling devastating wildfires in the Los Angeles area forced people to evacuate, in the Pacific Palisades neighborhood on the west side of Los Angeles, California, U.S. Jan. 13, 2025. 

Daniel Cole | Reuters

Tesla started offering discounts on new Cybertruck vehicles in its inventory this week, according to listings on the company’s website.

Discounts are as high as $1,600 off new Cybertrucks, with the reduced price depending on configuration, and up to around $2,600 for demo versions of the trucks in inventory, the listings show. Production of the angular, unpainted steel pickups has reportedly slowed in recent weeks at Tesla’s factory in Austin, Texas.

Deliveries of the unconventional pickup began reaching customers in 2023. CEO Elon Musk originally unveiled the Cybertruck in 2019 and said it would cost around $40,000, but its base price in the U.S. was closer to $80,000 over the course of 2024.

Wall Street previously viewed the Cybertruck as an important driver of growth for Tesla’s core automotive sales.

While the Cybertruck outsold the Ford Lightning F-150 last year in the U.S. and became the fifth best-selling EV domestically, according to data tracked by Cox Automotive, its high price, repeat recalls and production issues in Austin hampered growth. In November, Tesla initiated its sixth recall in a year  to replace defective drive inverters.

As CNBC previously reported, Tesla’s deliveries declined slightly year-over-year in 2024, even as EV demand worldwide reached a record. A slew of new competitive models from a wide range of automakers eroded Tesla’s market share.

According to Cox data, full-year EV sales reached an estimated 1.3 million in 2024 in the U.S., an increase of 7.3% from the prior year. But Tesla’s sales for the year declined by about 37,000 vehicles.

The Tesla Model Y SUV and Model 3 sedan ranked as the top two best-selling EVs by a wide margin. But both older, more affordable Tesla models saw sales drop from the previous year. Cox estimated Tesla sold around 38,965 Cybertrucks in the U.S. last year.

In recent days, Musk apologized to customers in California for delays in delivering their Cybertrucks. He said the trucks are now being used to bring supplies and wireless internet service to people in Los Angeles impacted by devastating wildfires.

“Apologies to those expecting Cybertruck deliveries in California over the next few days,” Musk wrote on X. “We need to use those trucks as mobile base stations to provide power to Starlink Internet terminals in areas of LA without connectivity. A new truck will be delivered end of week.”

WATCH: Here’s why Bank of America downgraded Tesla

Tesla: Here's why Bank of America downgraded the stock to neutral

Continue Reading

Technology

Goldman Sachs CEO Solomon says IPO market is ‘going to pick up’ along with dealmaking

Published

on

By

Goldman Sachs CEO Solomon says IPO market is 'going to pick up' along with dealmaking

David Solomon, CEO of Goldman Sachs, speaks during the Reuters NEXT conference, in New York City, U.S., December 10, 2024. 

Mike Segar | Reuters

Goldman Sachs CEO David Solomon says there’s an end in sight to the multi-year IPO drought.

“It’s going to pick up,” Solomon said on Wednesday, in an on-stage interview with Cisco CEO Chuck Robbins at a summit hosted by the computer networking company in Silicon Valley. “It’s been slow, it’s been turned off.”

Solomon, who flew to California for the event just after his Wall Street bank reported fourth-quarter results that blew past analysts’ estimates, said the capital markets broadly are showing signs of life ahead of President-elect Donald Trump’s inauguration next week.

The tech IPO market has largely been dormant since the end of 2021, when tech stocks started falling out of favor due to soaring inflation and rising interest rates. Mergers and acquisitions have been difficult in technology because of hefty regulation that’s restricted the ability for the biggest companies to grow through dealmaking.

Solomon said the mood is changing, and he expects momentum M&A as well as in IPOs.

“We have a more constructive kind of optimism, which always helps,” Solomon said. He later added that, “broadly speaking, I think it’s an improved business environment.”

Earlier in the day, Solomon said on his company’s earnings call that Trump’s election and a swing back to Republican power in Washington is already starting to make an impact in the business world. He noted on the call that “there is a significant backlog from sponsors and an overall increased appetite for dealmaking supported by an improved regulatory backdrop.”

Solomon’s comments on the call and at the Cisco event came on a day when the S&P 500 posted his biggest gain since November, helped by a tame inflation report and Goldman’s results. Goldman’s stock popped 6% on Wednesday.

While the stock market has had a strong two-year run and the S&P 500 and Nasdaq hit fresh records last month, IPOs have yet to see a resurgence. Cloud software vendor ServiceTitan debuted on the Nasdaq in December, marking the first significant venture-backed IPO in the U.S. since Rubrik in April.

“The values came down after 2021, people are growing back into those values,” Solomon said at the Cisco summit.

Some companies have said they’re ready. Chipmaker Cerebras filed to go public in September, but the process was slowed down due to a review by the Treasury Department’s Committee on Foreign Investment in the U.S., or CFIUS. In November, online lender Klarna said it had confidentially filed IPO paperwork with the SEC.

Though he’s bullish about what’s coming, Solomon said that there are structural reasons not to go public. He said 25 years ago there were roughly 13,000 public companies in the U.S., and today that number has come down to 3,800. There are higher standards around disclosure for being public, and there’s now tons of private capital available “at scale.”

“It’s not fun being a public company,” Solomon acknowledged. “Who would want to be a public company?”

WATCH: Goldman Sachs tops estimates

Goldman Sachs tops estimates on strong trading results

Continue Reading

Technology

How VPNs might allow Americans to continue using TikTok

Published

on

By

How VPNs might allow Americans to continue using TikTok

Dado Ruvic | Reuters

If TikTok does indeed go dark on Sunday for Americans, there may be a tool for them to continue accessing the popular social app: VPNs. 

The Chinese-owned app is set to be removed from mobile app stores and the web for U.S. users on Sunday as a result of a law signed by President Joe Biden in April 2024 requiring that the app be sold to a qualified buyer before the deadline. 

Barring a last-minute sale or reprieve from the Supreme Court, the app will almost certainly vanish from the app stores for iPhones and Android phones. It won’t be removed from people’s phones, but the app could stop working. 

TikTok plans to shut its service for Americans on Sunday, meaning that even those who already have the app downloaded won’t be able to continue using it, according to reports this week from Reuters and The Information. Apple and Google didn’t comment on their plans for taking down the apps from their app stores on Sunday.

“Basically, an app or a website can check where users came from,” said Justas Palekas, a head of product at IProyal.com, a proxy service. “Based on that, then they can impose restrictions based on their location.”

Masking your physical internet access point

That may stop most users, but for the particularly driven Americans, using VPNs might allow them to continue using the app. 

VPNs and a related business-to-business technology called proxies work by tunneling a user’s internet traffic through a server in another country, making it look like they are accessing the internet from a location different than the one they are physically in. 

This works because every time a computer connects to the internet, it is identified through an IP number, which is a 12-digit number that is different for every single computer. The first six digits of the number identifies the network, which also includes information about the physical region the request came from.

In China, people have used VPNs for years to get around the country’s firewall, which blocks U.S. websites such as Google and Facebook. VPNs saw big spikes in traffic when India banned TikTok in 2020, and people often use VPNs to watch sporting events from countries where official broadcasts aren’t available. 

As of 2022, the VPN market was worth nearly $38 billion, according to the VPN Trust Initiative, a lobbying group.

“We consistently see significant spikes in VPN demand when access to online platforms is restricted, and this situation is no different,” said Lauren Hendry Parsons, privacy advocate at ExpressVPN, a VPN provider that costs $5 per month to use.

“We’re not here to endorse TikTok, but the looming U.S. ban highlights why VPNs matter— millions rely on them for secure, private, and unrestricted access to the internet,” ProtonVPN posted on social media earlier this week. ProtonVPN offers its service for $10 a month. 

The price of VPNs

Both ExpressVPN and ProtonVPN allow users to set their internet-access location. 

Most VPN services charge a monthly fee to pay for their servers and traffic, but some use a business model where they collect user data or traffic trends, such as when Meta offered a free VPN so it could keep an eye on which competitors’ apps were growing quickly.

A key tradeoff for those who use VPN is speed due to requests having to flow through a middleman computer to mask a users’ physical location. 

And although VPNs have worked in the past when governments have banned apps, that doesn’t ensure that VPNs will work if TikTok goes dark. It won’t be clear if ExpressVPN would be able to access TikTok until after the ban takes place, Parsons told CNBC in an email. It’s also possible that TikTok may be able to determine Americans who try to use VPNs to access the app.  

(L-R) Sarah Baus of Charleston, S.C., holds a sign that reads “Keep TikTok” as she and other content creators Sallye Miley of Jackson, Mississippi, and Callie Goodwin of Columbia, S.C., stand outside the U.S. Supreme Court Building as the court hears oral arguments on whether to overturn or delay a law that could lead to a ban of TikTok in the U.S., on January 10, 2025 in Washington, DC. 

Andrew Harnik | Getty Images

VPNs and proxies to evade regional restrictions have been part of the internet’s landscape for decades, but their use is increasing as governments seek to ban certain services or apps.

Apps are removed by government request all the time. Nearly 1500 apps were removed in regions due to government takedown demands in 2023, according to Apple, with over 1,000 of them in China. Most of them are fringe apps that break laws such as those against gambling, or Chinese video game rules, but increasingly, countries are banning apps for national security or economic development reasons.

Now, the U.S. is poised to ban one of the most popular apps in the country — with 115 million users, it was the second most downloaded app of 2024 across both iOS and Android, according to an estimate provided to CNBC from Sensor Tower, a market intelligence firm.

“As we witness increasing attempts to fragment and censor the internet, the role of VPNs in upholding internet freedom is becoming increasingly critical,” Parsons said.

WATCH: Chinese TikTok alternative surges

Chinese TikTok alternative surges

Continue Reading

Trending