Connect with us

Published

on

Sam Altman, co-founder and CEO of OpenAI and co-founder of Tools for Humanity, participates remotely in a discussion on the sidelines of the IMF/World Bank Spring Meetings in Washington, D.C., April 24, 2025.

Brendan Smialowski | AFP | Getty Images

Not long ago, Silicon Valley was where the world’s leading artificial intelligence experts went to perform cutting-edge research. 

Meta, Google and OpenAI opened their wallets for top talent, giving researchers staff, computing power and plenty of flexibility. With the support of their employers, the researchers published high-quality academic papers, openly sharing their breakthroughs with peers in academia and at rival companies.

But that era has ended. Now, experts say, AI is all about the product.

Since OpenAI released ChatGPT in late 2022, the tech industry has shifted its focus to building consumer-ready AI services, in many cases prioritizing commercialization over research, AI researchers and experts in the field told CNBC. The profit potential is massive — some analysts predict $1 trillion in annual revenue by 2028. The prospective repercussions terrify the corner of the AI universe concerned about safety, industry experts said, particularly as leading players pursue artificial general intelligence, or AGI, which is technology that rivals or exceeds human intelligence.

In the race to stay competitive, tech companies are taking an increasing number of shortcuts when it comes to the rigorous safety testing of their AI models before they are released to the public, industry experts told CNBC.

James White, chief technology officer at cybersecurity startup CalypsoAI, said newer models are sacrificing security for quality, that is, better responses by the AI chatbots. That means they’re less likely to reject malicious kinds of prompts that could cause them to reveal ways to build bombs or sensitive information that hackers could exploit, White said.

“The models are getting better, but they’re also more likely to be good at bad stuff,” said White, whose company performs safety and security audits of popular models from Meta, Google, OpenAI and other companies. “It’s easier to trick them to do bad stuff.”

The changes are readily apparent at Meta and Alphabet, which have deprioritized their AI research labs, experts say. At Facebook’s parent company, the Fundamental Artificial Intelligence Research, or FAIR, unit has been sidelined by Meta GenAI, according to current and former employees. And at Alphabet, the research group Google Brain is now part of DeepMind, the division that leads development of AI products at the tech company.

CNBC spoke with more than a dozen AI professionals in Silicon Valley who collectively tell the story of a dramatic shift in the industry away from research and toward revenue-generating products. Some are former employees at the companies with direct knowledge of what they say is the prioritization of building new AI products at the expense of research and safety checks. They say employees face intensifying development timelines, reinforcing the idea that they can’t afford to fall behind when it comes to getting new models and products to market. Some of the people asked not to be named because they weren’t authorized to speak publicly on the matter.

Mark Zuckerberg, CEO of Meta Platforms, during the Meta Connect event in Menlo Park, California, on Sept. 25, 2024.

David Paul Morris | Bloomberg | Getty Images

Meta’s AI evolution

When Joelle Pineau, a Meta vice president and the head of the company’s FAIR division, announced in April that she would be leaving her post, many former employees said they weren’t surprised. They said they viewed it as solidifying the company’s move away from AI research and toward prioritizing developing practical products.

“Today, as the world undergoes significant change, as the race for AI accelerates, and as Meta prepares for its next chapter, it is time to create space for others to pursue the work,” Pineau wrote on LinkedIn, adding that she will formally leave the company May 30. 

Pineau began leading FAIR in 2023. The unit was established a decade earlier to work on difficult computer science problems typically tackled by academia. Yann LeCun, one of the godfathers of modern AI, initially oversaw the project, and instilled the research methodologies he learned from his time at the pioneering AT&T Bell Laboratories, according to several former employees at Meta. Small research teams could work on a variety of bleeding-edge projects that may or may not pan out.  

The shift began when Meta laid off 21,000 employees, or nearly a quarter of its workforce, starting in late 2022. CEO Mark Zuckerberg kicked off 2023 by calling it the “year of efficiency.” FAIR researchers, as part of the cost-cutting measures, were directed to work more closely with product teams, several former employees said.

Two months before Pineau’s announcement, one of FAIR’s directors, Kim Hazelwood, left the company, two people familiar with the matter said. Hazelwood helped oversee FAIR’s NextSys unit, which manages computing resources for FAIR researchers. Her role was eliminated as part of Meta’s plan to cut 5% of its workforce, the people said.

Joelle Pineau of Meta speaks at the Advancing Sustainable Development through Safe, Secure, and Trustworthy AI event at Grand Central Terminal in New York, Sept. 23, 2024.

Bryan R. Smith | Via Reuters

OpenAI’s 2022 launch of ChatGPT caught Meta off guard, creating a sense of urgency to pour more resources into large language models, or LLMs, that were captivating the tech industry, the people said. 

In 2023, Meta began heavily pushing its freely available and open-source Llama family of AI models to compete with OpenAI, Google and others.

With Zuckerberg and other executives convinced that LLMs were game-changing technologies, management had less incentive to let FAIR researchers work on far-flung projects, several former employees said. That meant deprioritizing research that could be viewed as having no impact on Meta’s core business, such as FAIR’s previous health care-related research into using AI to improve drug therapies.

Since 2024, Meta Chief Product Officer Chris Cox has been overseeing FAIR as a way to bridge the gap between research and the product-focused GenAI group, people familiar with the matter said. The GenAI unit oversees the Llama family of AI models and the Meta AI digital assistant, the two most important pillars of Meta’s AI strategy. 

Under Cox, the GenAI unit has been siphoning more computing resources and team members from FAIR due to its elevated status at Meta, the people said. Many researchers have transferred to GenAI or left the company entirely to launch their own research-focused startups or join rivals, several of the former employees said. 

While Zuckerberg has some internal support for pushing the GenAI group to rapidly develop real-world products, there’s also concern among some staffers that Meta is now less able to develop industry-leading breakthroughs that can be derived from experimental work, former employees said. That leaves Meta to chase its rivals.

A high-profile example landed in January, when Chinese lab DeepSeek released its R1 model, catching Meta off guard. The startup claimed it was able to develop a model as capable as its American counterparts but with training at a fraction of the cost.

Meta quickly implemented some of DeepSeek’s innovative techniques for its Llama 4 family of AI models that were released in April, former employees said. The AI research community had a mixed reaction to the smaller versions of Llama 4, but Meta said the biggest and most powerful Llama 4 variant is still being trained.

The company in April also released security and safety tools for developers to use when building apps with Meta’s Llama 4 AI models. These tools help mitigate the chances of Llama 4 unintentionally leaking sensitive information or producing harmful content, Meta said.

“Our commitment to FAIR remains strong,” a Meta spokesperson told CNBC. “Our strategy and plans will not change as a result of recent developments.”

In a statement to CNBC, Pineau said she is enthusiastic about Meta’s overall AI work and strategy.

“There continues to be strong support for exploratory research and FAIR as a distinct organization in Meta,” Pineau said. “The time was simply right for me personally to re-focus my energy before jumping into a new adventure.”

Meta on Thursday named FAIR co-founder Rob Fergus as Pineau’s replacement. Fergus will return to the company to serve as a director at Meta and head of FAIR, according to his LinkedIn profile. He was most recently a research director at Google DeepMind.

“Meta’s commitment to FAIR and long term research remains unwavering,” Fergus said in a LinkedIn post. “We’re working towards building human-level experiences that transform the way we interact with technology and are dedicated to leading and advancing AI research.”

Demis Hassabis, co-founder and CEO of Google DeepMind, attends the Artificial Intelligence Action Summit at the Grand Palais in Paris, Feb. 10, 2025.

Benoit Tessier | Reuters

Google ‘can’t keep building nanny products’

Google released its latest and most powerful AI model, Gemini 2.5, in March. The company described it as “our most intelligent AI model,” and wrote in a March 25 blog post that its new models are “capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy.”

For weeks, Gemini 2.5 was missing a model card, meaning Google did not share information about how the AI model worked or its limitations and potential dangers upon its release.

Model cards are a common tool for AI transparency.

A Google website compares model cards to food nutrition labels: They outline “the key facts about a model in a clear, digestible format,” the website says.

“By making this information easy to access, model cards support responsible AI development and the adoption of robust, industry-wide standards for broad transparency and evaluation practices,” the website says.

Google wrote in an April 2 blog post that it evaluates its “most advanced models, such as Gemini, for potential dangerous capabilities prior to their release.” Google later updated the blog to remove the words “prior to their release.”

Without a model card for Gemini 2.5, the public had no way of knowing which safety evaluations were conducted or whether DeepMind checked for dangerous capabilities at all.

In response to CNBC’s inquiry on April 2 about Gemini 2.5’s missing model card, a Google spokesperson said that a “tech report with additional safety information and model cards are forthcoming.” Google published an incomplete model card on April 16 and updated it on April 28, more than a month after the AI model’s release, to include information about Gemini 2.5’s “dangerous capability evaluations.” 

Those assessments are important for gauging the safety of a model — whether people can use the models to learn how to build chemical or nuclear weapons or hack into important systems. These checks also determine whether a model is capable of autonomously replicating itself, which could lead to a company losing control of it. Running tests for those capabilities requires more time and resources than simple, automated safety evaluations, according to industry experts.

Google co-founder Sergey Brin

Kelly Sullivan | Getty Images Entertainment | Getty Images

The Financial Times in March reported that Google DeepMind CEO Demis Hassabis had installed a more rigorous vetting process for internal research papers to be published. The clampdown at Google is particularly notable because the company’s “Transformers” technology gained recognition across Silicon Valley through that type of shared research. Transformers were critical to OpenAI’s development of ChatGPT and the rise of generative AI. 

Google co-founder Sergey Brin told staffers at DeepMind and Gemini in February that competition has accelerated and “the final race to AGI is afoot,” according to a memo viewed by CNBC. “We have all the ingredients to win this race but we are going to have to turbocharge our efforts,” he said in the memo.

Brin said in the memo that Google has to speed up the process of testing AI models, as the company needs “lots of ideas that we can test quickly.” 

“We need real wins that scale,” Brin wrote. 

In his memo, Brin also wrote that the company’s methods have “a habit of minor tweaking and overfitting” products for evaluations and “sniping” the products at checkpoints. He said employees need to build “capable products” and to “trust our users” more.

“We can’t keep building nanny products,” Brin wrote. “Our products are overrun with filters and punts of various kinds.”

A Google spokesperson told CNBC that the company has always been committed to advancing AI responsibly. 

“We continue to do that through the safe development and deployment of our technology, and research contributions to the broader ecosystem,” the spokesperson said.

Sam Altman, CEO of OpenAI, is seen through glass during an event on the sidelines of the Artificial Intelligence Action Summit in Paris, Feb. 11, 2025.

Aurelien Morissard | Via Reuters

OpenAI’s rush through safety testing

The debate of product versus research is at the center of OpenAI’s existence. The company was founded as a nonprofit research lab in 2015 and is now in the midst of a contentious effort to transform into a for-profit entity.

That’s the direction co-founder and CEO Sam Altman has been pushing toward for years. On May 5, though, OpenAI bowed to pressure from civic leaders and former employees, announcing that its nonprofit would retain control of the company even as it restructures into a public benefit corporation.

Nisan Stiennon worked at OpenAI from 2018 to 2020 and was among a group of former employees urging California and Delaware not to approve OpenAI’s restructuring effort. “OpenAI may one day build technology that could get us all killed,” Stiennon wrote in a statement in April. “It is to OpenAI’s credit that it’s controlled by a nonprofit with a duty to humanity.”

But even with the nonprofit maintaining control and majority ownership, OpenAI is speedily working to commercialize products as competition heats up in generative AI. And it may have rushed the rollout of its o1 reasoning model last year, according to some portions of its model card.

Results of the model’s “preparedness evaluations,” the tests OpenAI runs to assess an AI model’s dangerous capabilities and other risks, were based on earlier versions of o1. They had not been run on the final version of the model, according to its model card, which is publicly available.

Johannes Heidecke, OpenAI’s head of safety systems, told CNBC in an interview that the company ran its preparedness evaluations on near-final versions of the o1 model. Minor variations to the model that took place after those tests wouldn’t have contributed to significant jumps in its intelligence or reasoning and thus wouldn’t require additional evaluations, he said. Still, Heidecke acknowledged that OpenAI missed an opportunity to more clearly explain the difference.

OpenAI’s newest reasoning model, o3, released in April, seems to hallucinate more than twice as often as o1, according to the model card. When an AI model hallucinates, it produces falsehoods or illogical information. 

OpenAI has also been criticized for reportedly slashing safety testing times from months to days and for omitting the requirement to safety test fine-tuned models in its latest “Preparedness Framework.” 

Heidecke said OpenAI has decreased the time needed for safety testing because the company has improved its testing effectiveness and efficiency. A company spokesperson said OpenAI has allocated more AI infrastructure and personnel to its safety testing, and has increased resources for paying experts and growing its network of external testers.

In April, the company shipped GPT-4.1, one of its new models, without a safety report, as the model was not designated by OpenAI as a “frontier model,” which is a term used by the tech industry to refer to a bleeding-edge, large-scale AI model.

But one of those small revisions caused a big wave in April. Within days of updating its GPT-4o model, OpenAI rolled back the changes after screenshots of overly flattering responses to ChatGPT users went viral online. OpenAI said in a blog post explaining its decision that those types of responses to user inquiries “raise safety concerns — including around issues like mental health, emotional over-reliance, or risky behavior.”

OpenAI said in the blogpost that it opted to release the model even after some expert testers flagged that its behavior “‘felt’ slightly off.”

“In the end, we decided to launch the model due to the positive signals from the users who tried out the model. Unfortunately, this was the wrong call,” OpenAI wrote. “Looking back, the qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics.”

Metr, a company OpenAI partners with to test and evaluate its models for safety, said in a recent blog post that it was given less time to test the o3 and o4-mini models than predecessors.

“Limitations in this evaluation prevent us from making robust capability assessments,” Metr wrote, adding that the tests it did were “conducted in a relatively short time.”

Metr also wrote that it had insufficient access to data that would be important in determining the potential dangers of the two models.

The company said it wasn’t able to access the OpenAI models’ internal reasoning, which is “likely to contain important information for interpreting our results.” However, Metr said, “OpenAI shared helpful information on some of their own evaluation results.”

OpenAI’s spokesperson said the company is piloting secure ways of sharing chains of thought for Metr’s research as well as for other third-party organizations. 

Steven Adler, a former safety researcher at OpenAI, told CNBC that safety testing a model before it’s rolled out is no longer enough to safeguard against potential dangers.

“You need to be vigilant before and during training to reduce the chance of creating a very capable, misaligned model in the first place,” Adler said.

He warned that companies such as OpenAI are backed into a corner when they create capable but misaligned models with goals that are different from the ones they intended to build.

“Unfortunately, we don’t yet have strong scientific knowledge for fixing these models — just ways of papering over the behavior,” Adler said. 

WATCH: OpenAI closes $40 billion funding round, largest private tech deal on record

OpenAI closes $40 billion funding round, largest private tech deal on record

Continue Reading

Technology

Silicon Valley’s new defense tech ‘neoprimes’ are pulling billions in funding to challenge legacy giants

Published

on

By

Silicon Valley’s new defense tech ‘neoprimes’ are pulling billions in funding to challenge legacy giants

Guvendemir | E+ | Getty Images

A wave of defense tech startups in Silicon Valley is drawing billions in funding and reshaping America’s national security.

Anduril Industries, recently valued at $30.5 billion following its latest funding round, is among the so-called “neoprimes” — companies challenging the dominance of legacy contractors, dubbed “primes,” such as Lockheed MartinNorthrop Grumman, Boeing, General Dynamics, and RTX (formerly Raytheon).

“There’s more money than ever going to what we call the ‘neoprimes'” Jameson Darby, co-founder and director of autonomy at investment syndicate MilVet Angels, or MVA, told CNBC. “It’s still a fraction of the overall budget, but the trend is all positive.”

Other examples of defense tech startups challenging the incumbents include SpaceX and Palantir Technologies, said Darby, who is also a founding member of the U.S. Department of Defense’s Defense Innovation Unit.

Unlike the primes, these startups are faster, leaner and software-first — with many of them building things that can help close “critical technology gaps that are really important to national security,” said Ernestine Fu Mak, co-founder of MVA and founder of Brave Capital, a venture capital firm.

Venture funding for U.S.-based defense tech startups totaled about $38 billion through the first half of 2025, and could exceed its 2021 peak if the pace remains constant for the rest of the year, according to JPMorgan.

‘The battlefield is changing’

As the global war landscape changed over the past decades, the U.S. Department of Defense has identified several technologies that are critical to national security, including hypersonics, energy resilience, space technology, integrated sensing and cyber.

“In a post-9/11 world, the entire Department of Defense effectively focused on … the global war on terrorism. It was our military versus insurgents, guerrillas, asymmetric warfare, relatively low-tech fighters in most cases,” said Darby.

But war today is more focused on “great power competition,” said Mak.

The battlefield is changing and new technologies are needed … warfare no longer being limited to land, sea, air. There’s also cyber and space domains that have become contested.

Ernestine Fu Mak

Co-founder, MilVet Angels

“The focus is more on deterring and competing with [adversaries] in these very high-tech, multi-domain conflicts,” Mak added. “The battlefield is changing and new technologies are needed… warfare no longer being limited to land, sea, air. There’s also cyber and space domains that have become contested.”

Today, some of these Silicon Valley “neoprimes” are developing not just weapons, but also dual-use technologies that can be applied both commercially and by militaries.

“So things like artificial intelligence and autonomy have broad, sweeping commercial applications, but they’re also clearly a force multiplier in a military context,” said Darby. “[The] Department of War is rapidly assessing and adopting these dual-use technologies … they’re sending signals to the investment world, to the defense industrial base, that the U.S. government needs these things.”

That direction from the government has, in turn, provided a clear and strategic roadmap for both investors and entrepreneurs, said Mak.

The ‘new guard’

On Sept. 17, MVA came out of stealth mode after quietly backing some leading defense tech startups since 2021.

Today, Mak says the syndicate’s roughly 250 members include tech founders, Wall Street financiers, company executives, intelligence officials, former military leaders and Navy SEALs. Together, they’ve invested in companies like Anduril Industries, Shield AI, Hermeus, Ursa Major and Aetherflux.

“Overall, we believe that ‘neoprimes’ cannot exist in the abstract. They require people — individuals who bring technical expertise, who carry a deep sense of mission, and who contribute complementary voices and talents. Together, this coalition forms what we are convening and calling the ‘new guard,'” said Mak.

She added that modern national security requires both the “warrior’s insight on the battlefield” and the “builder’s drive for innovation”.

“Working together with engaged, informed patriots whose participation strengthens our defense ecosystem and reinforces the very fabric of national security,” Mak said.

Mak and Darby both agree that as new technologies develop and make their way onto battlefields globally, it’s changing the way militaries fight, which can also pose new threats.

“You’re seeing these technologists, these builders … building defense tech, and the reason why they’re doing so, is not to initiate conflict, but rather to create a credible deterrent that discourages aggression,” said Mak.

“No one in defense tech is looking to wage war, rather, it’s looking to deter it and wanting adversaries to think twice before threatening peace and stability,” Mak added.

Continue Reading

Technology

Amazon faces FAA, NTSB probe after two delivery drones crashed into crane in Arizona

Published

on

By

Amazon faces FAA, NTSB probe after two delivery drones crashed into crane in Arizona

Two Amazon Prime Air MK30 drones collided with a crane on Oct. 2, 2025 in Tolleson, Arizona.

Courtesy: 12News

Amazon is facing federal probes after two of its Prime Air delivery drones collided with a crane in Arizona, prompting the company to temporarily pause drone service in the area.

The incident occurred on Wednesday around 1 p.m. EST in Tolleson, Arizona, a city west of Phoenix. Two MK30 drones crashed into the boom of a stationary construction crane that was in a commercial area just a few miles away from an Amazon warehouse.

One person was evaluated on the scene for possible smoke inhalation, said Sergeant Erik Mendez of the Tolleson Police Department.

“We’re aware of an incident involving two Prime Air drones in Tolleson, Arizona,” Amazon spokesperson Terrence Clark said in a statement. “We’re currently working with the relevant authorities to investigate.”

Both drones sustained “substantial” damage from the collision on Wednesday, which occurred when the aircraft were mid-route, according to preliminary FAA crash reports.

The Federal Aviation Administration and National Transportation Safety Board are investigating the incident. The NTSB didn’t immediately respond to a request for comment.

Read more CNBC tech news

The drones were believed to be flying northeast back-to-back when they collided with the crane that was being used for roof work on a distribution facility, Tolleson police said in a release. The drones landed in the backyard of a nearby building, according to the release.

The probes come just a few months after Amazon, in January, paused drone deliveries in Tolleson and College Station, Texas, temporarily following two crashes at its Pendleton, Oregon, test site. Those crashes also prompted investigations by the FAA and NTSB. The company resumed deliveries in March after it said it had resolved issues with the drone’s software, CNBC previously reported.

Amazon says its delivery drones are equipped with a sense-and-avoid system that enables them to “detect and stay away from obstacles in the air and on the ground.” The system also allows the aircraft to operate without visual observers over greater distances, the company said.

For over a decade, Amazon has been working to bring to life founder Jeff Bezos’ vision of drones whizzing toothpaste, books and batteries to customers’ doorsteps in 30 minutes or less. But progress has been slow, as Prime Air has only been made available in a handful of U.S. cities.

Amazon has set a goal to deliver 500 million packages by drone per year by the end of the decade.

Google and Amazon race to upgrade voice assistants with AI as OpenAI raises the stakes

Continue Reading

Technology

Intel stock is up 50% over the last month, putting U.S. stake at $16 billion

Published

on

By

Intel stock is up 50% over the last month, putting U.S. stake at  billion

Signage outside the Intel headquarters in San Jose, California, US, on Thursday, Sept. 18, 2025.

David Paul Morris | Bloomberg | Getty Images

Shares of U.S. chipmaker Intel climbed 3% Thursday, putting the monthly gain over 50%.

The surge pushed the stock past $37, hiking the value of the U.S. government’s 10% stake in Intel to roughly $16 billion.

The Trump administration negotiated an $8.9 billion investment in Intel common stock in August, purchasing 433.3 million shares at $20.47 per share.

Press secretary Karoline Leavitt celebrated the surge with a post on X from the Association of Mature American Citizens, a conservative organization.

Intel shares jumped 7% on Wednesday after news that the company is in early talks with AMD to add the hardware-maker as a customer.

Read more CNBC tech news

Continue Reading

Trending