DeepSeek has rattled the U.S.-led AI ecosystem with its latest model, shaving hundreds of billions in chip leader Nvidia’s market cap. While the sector leaders grapple with the fallout, smaller AI companies see an opportunity to scale with the Chinese startup.
Several AI-related firms told CNBC that DeepSeek’s emergence is a “massive” opportunity for them, rather than a threat.
“Developers are very keen to replace OpenAI’s expensive and closed models with open source models like DeepSeek R1…” said Andrew Feldman, CEO of artificial intelligence chip startup Cerebras Systems.
The company competes with Nvidia’s graphic processing units and offers cloud-based services through its own computing clusters. Feldman said the release of the R1 model generated one of Cerebras’ largest-ever spikes in demand for its services.
“R1 shows that [AI market] growth will not be dominated by a single company — hardware and software moats do not exist for open-source models,” Feldman added.
Open source refers to software in which the source code is made freely available on the web for possible modification and redistribution. DeepSeek’s models are open source, unlike those of competitors such as OpenAI.
DeepSeek also claims its R1 reasoning model rivals the best American tech, despite running at lower costs and being trained without cutting-edge graphic processing units, though industry watchers and competitors have questioned these assertions.
“Like in the PC and internet markets, falling prices help fuel global adoption. The AI market is on a similar secular growth path,” Feldman said.
Inference chips
DeepSeek could increase the adoption of new chip technologies by accelerating the AI cycle from the training to “inference” phase, chip start-ups and industry experts said.
Inference refers to the act of using and applying AI to make predictions or decisions based on new information, rather than the building or training of the model.
“To put it simply, AI training is about building a tool, or algorithm, while inference is about actually deploying this tool for use in real applications,” said Phelix Lee, an equity analyst at Morningstar, with a focus on semiconductors.
While Nvidia holds a dominant position in GPUs used for AI training, many competitors see room for expansion in the “inference” segment, where they promise higher efficiency for lower costs.
AI training is very compute-intensive, but inference can work with less powerful chips that are programmed to perform a narrower range of tasks, Lee added.
A number of AI chip startups told CNBC that they were seeing more demand for inference chips and computing as clients adopt and build on DeepSeek’s open source model.
“[DeepSeek] has demonstrated that smaller open models can be trained to be as capable or more capable than larger proprietary models and this can be done at a fraction of the cost,” said Sid Sheth, CEO of AI chip start-up d-Matrix.
“With the broad availability of small capable models, they have catalyzed the age of inference,” he told CNBC, adding that the company has recently seen a surge in interest from global customers looking to speed up their inference plans.
Robert Wachen, co-founder and COO of AI chipmaker Etched, said dozens of companies have reached out to the startup since DeepSeek released its reasoning models.
“Companies are [now] shifting their spend from training clusters to inference clusters,” he said.
“DeepSeek-R1 proved that inference-time compute is now the [state-of-the-art] approach for every major model vendor and thinking isn’t cheap – we’ll only need more and more compute capacity to scale these models for millions of users.”
Jevon’s Paradox
Analysts and industry experts agree that DeepSeek’s accomplishments are a boost for AI inference and the wider AI chip industry.
“DeepSeek’s performance appears to be based on a series of engineering innovations that significantly reduce inference costs while also improving training cost,” according to a report from Bain & Company.
“In a bullish scenario, ongoing efficiency improvements would lead to cheaper inference, spurring greater AI adoption,” it added.
This pattern explains Jevon’s Paradox, a theory in which cost reductions in a new technology drive increased demand.
Financial services and investment firm Wedbush said in a research note last week that it continues to expect the use of AI across enterprise and retail consumers globally to drive demand.
Speaking to CNBC’s “Fast Money” last week, Sunny Madra, COO at Groq, which develops chips for AI inference, suggested that as the overall demand for AI grows, smaller players will have more room to grow.
“As the world is going to need more tokens [a unit of data that an AI model processes] Nvidia can’t supply enough chips to everyone, so it gives opportunities for us to sell into the market even more aggressively,” Madra said.
Marc Benioff, Chairman & CEO of Salesforce, speaking on CNBC’s Squawk Box outside the World Economic Forum in Davos, Switzerland on Jan. 22nd, 2025.
Gerry Miller | CNBC
Salesforce on Wednesday announced plans to invest $1 billion in Singapore over the next five years.
The cloud software giant said the investment is designed to accelerate the country’s digital transformation and the adoption of Salesforce’s flagship AI offering Agentforce.
Salesforce is among the many technology companies hoping to boost revenue with generative AI features.
The company launched the newest version of Agentforce last month. It has previously described the system — which it says can tackle sophisticated questions in Salesforce’s Slack communications app, based on all available data — as the first digital AI platform for enterprises.
Salesforce CEO Marc Benioff is scheduled to speak at CNBC’s CONVERGE LIVE at around 9:25 a.m. Singapore time (9:25 p.m. ET) on Wednesday.
“We are in an incredible new era of digital labor where every business will be transformed by autonomous agents that augment the work of humans, revolutionizing productivity and enabling every company to scale without limits,” Benioff said in a statement.
“Singapore is at the forefront of this shift, and as the world’s largest provider of digital labor through our Agentforce platform,” he added.
Salesforce said Agentforce can help Singapore to “rapidly expand” its labor force in several key service and public sector roles at a time when the country is grappling with an aging population and declining birth rates.
Jermaine Loy, managing director of the Singapore Economic Development Board, welcomed Salesforce’s investment, saying it will help to boost the country’s efforts “to build a vibrant hub for AI innovation.”
Reddit CEO Steve Huffman stands on the floor of the New York Stock Exchange (NYSE) after ringing a bell on the floor setting the share price at $47 in its initial public offering (IPO) on March 21, 2024 in New York City.
Spencer Platt | Getty Images News | Getty Images
Reddit shares rose more than 10% on Tuesday, reversing a three-day slump that coincided with a broader decline among technology companies.
Despite Tuesday’s gains, Reddit shares are still roughly 30% below the close on Wednesday.
Reddit’s stock market upswing was likely bolstered by a Loop Capital analyst note published Tuesday that reiterated a buy rating and characterized the company’s shares as “extremely attractive.” The analyst note said that Reddit’s 50% drop on Wall Street in the past month “is excessive,” and that the social media company “has the biggest upside potential relative to Street estimates in our coverage universe.”
The company’s shares dropped more than 15% in February after the company reported weaker-than-expected fourth-quarter user numbers as a result of a Googlesearch change that temporarily hurt its search-derived traffic. Although Reddit said at the time that it had recovered from the algorithmic shift, the user number miss spooked investors.
Loop Capital managing director Alan Gould acknowledged in the note that investors are operating in a “risk-off market environment,” but he contended that Reddit “has been one of the top performing stocks over the past year,” aside from its most recent dip.
“RDDT wildly exceeded ours and Street estimates for 2024, which explains why the stock increased almost 7-fold from a $34 IPO price to a peak of $230 in less than a year,” Gould wrote, noting Reddit’s growing revenue and improved advertising tools, among other positive developments.
Reddit’s fourth-quarter sales grew 71% year over year to $428 million, which represents the fastest growth rate for any quarter since 2022.
“In our view, RDDT deserves the revaluation it had experiencing based on the growth it has shown in the recent earnings reports and future projected growth driven by the ability to narrow the ARPU gap, and data licensing possibilities,” Gould wrote.
Waymo self-driving cars with roof-mounted sensor arrays traveling near palm trees and modern buildings along the Embarcadero, San Francisco, California, February 21, 2025.
Smith Collection/gado | Archive Photos | Getty Images
Waymo on Tuesday announced it is expanding its service to include another 27 square miles of coverage around the San Francisco Bay Area.
With the expansion, Waymo will now take passengers around Mountain View, Los Altos, Palo Alto and parts of Sunnyvale, California. The Alphabet-owned company opened its robotaxi service to the general public in San Francisco in June.
Waymo will initially limit the availability of its Silicon Valley service to users of the Waymo One app who are residents with ZIP codes in the area, the company said. Waymo plans to serve more riders across the region over time. The fleet of vehicles that will be in use in the new coverage areas are fully electric Jaguar I-Pace vehicles with Waymo’s fifth generation of self-driving sensors, software and other technology.
“Opening our fully autonomous ride-hailing service in Silicon Valley marks a special milestone in our Bay Area journey,” Waymo product chief Saswat Panigrahi said in a statement. “This is where Waymo began and where we’re headquartered.”
Waymo expanded its San Francisco Bay Area robotaxi service last summer into Daly City, Broadmoor and Colma. Its robotaxis do not yet carry passengers to San Francisco International Airport.
A spokesperson told CNBC that Waymo is in “active discussions with SFO,” and added that the company is “working to connect” Silicon Valley and San Francisco to “provide seamless autonomous rides across more of the Bay Area in the future.”
Waymo also recently launched a commercial robotaxi service in Austin, Texas, just in time for the city’s annual South by Southwest festival.
While would-be competitors including Elon Musk‘s automaker Tesla, and Amazon-owned Zoox, are continuing their own robotaxi testing and development, Waymo has pulled far ahead of self-driving companies in the U.S.
Before Tuesday’s expansion, Waymo said it was serving more than 200,000 paid trips per week across San Francisco, Los Angeles and Phoenix.
Alphabet doesn’t disclose financial results for the autonomous vehicle business, but Waymo is part of its “Other Bets.” That business unit generated $400 million in the fourth quarter of 2024 and incurred operating losses of $1.17 billion, according to the company’s most recent financial filing.