Connect with us

Published

on

David Silver, leader of the reinforcement learning research group at DeepMind, being awarded an honorary “ninth dan” professional ranking for AlphaGo.
JUNG YEON-JE | AFP | Getty Images

Computer scientists are questioning whether DeepMind, the Alphabet-owned U.K. firm that’s widely regarded as one of the world’s premier AI labs, will ever be able to make machines with the kind of “general” intelligence seen in humans and animals.

In its quest for artificial general intelligence, which is sometimes called human-level AI, DeepMind is focusing a chunk of its efforts on an approach called “reinforcement learning.”

This involves programming an AI to take certain actions in order to maximize its chance of earning a reward in a certain situation. In other words, the algorithm “learns” to complete a task by seeking out these preprogrammed rewards. The technique has been successfully used to train AI models how to play (and excel at) games like Go and chess. But they remain relatively dumb, or “narrow.” DeepMind’s famous AlphaGo AI can’t draw a stickman or tell the difference between a cat and a rabbit, for example, while a seven-year-old can.

Despite this, DeepMind, which was acquired by Google in 2014 for around $600 million, believes that AI systems underpinned by reinforcement learning could theoretically grow and learn so much that they break the theoretical barrier to AGI without any new technological developments.

Researchers at the company, which has grown to around 1,000 people under Alphabet’s ownership, argued in a paper submitted to the peer-reviewed Artificial Intelligence journal last month that “Reward is enough” to reach general AI. The paper was first reported by VentureBeat last week.

In the paper, the researchers claim that if you keep “rewarding” an algorithm each time it does something you want it to, which is the essence of reinforcement learning, then it will eventually start to show signs of general intelligence.

“Reward is enough to drive behavior that exhibits abilities studied in natural and artificial intelligence, including knowledge, learning, perception, social intelligence, language, generalization and imitation,” the authors write.

“We suggest that agents that learn through trial and error experience to maximize reward could learn behavior that exhibits most if not all of these abilities, and therefore that powerful reinforcement learning agents could constitute a solution to artificial general intelligence.”

Not everyone is convinced, however.

Samim Winiger, an AI researcher in Berlin, told CNBC that DeepMind’s “reward is enough” view is a “somewhat fringe philosophical position, misleadingly presented as hard science.”

He said the path to general AI is complex and that the scientific community is aware that there are countless challenges and known unknowns that “rightfully instill a sense of humility” in most researchers in the field and prevent them from making “grandiose, totalitarian statements” such as “RL is the final answer, all you need is reward.”

DeepMind told CNBC that while reinforcement learning has been behind some of its most well-known research breakthroughs, the AI technique accounts for only a fraction of the overall research it carries out. The company said it thinks it’s important to understand things at a more fundamental level, which is why it pursues other areas such as “symbolic AI” and “population-based training.”

“In somewhat typical DeepMind fashion, they chose to make bold statements that grabs attention at all costs, over a more nuanced approach,” said Winiger. “This is more akin to politics than science.”

Stephen Merity, an independent AI researcher, told CNBC that there’s “a difference between theory and practice.” He also noted that “a stack of dynamite is likely enough to get one to the moon, but it’s not really practical.”

Ultimately, there’s no proof either way to say whether reinforcement learning will ever lead to AGI.

Rodolfo Rosini, a tech investor and entrepreneur with a focus on AI, told CNBC: “The truth is nobody knows and that DeepMind’s main product continues to be PR and not technical innovation or products.”

Entrepreneur William Tunstall-Pedoe, who sold his Siri-like app Evi to Amazon, told CNBC that even if the researchers are correct “that doesn’t mean we will get there soon, nor does it mean that there isn’t a better, faster way to get there.”

DeepMind’s “Reward is enough” paper was co-authored by DeepMind heavyweights Richard Sutton and David Silver, who met DeepMind CEO Demis Hassabis at the University of Cambridge in the 1990s.

“The key problem with the thesis put forth by ‘Reward is enough’ is not that it is wrong, but rather that it cannot be wrong, and thus fails to satisfy Karl Popper’s famous criterion that all scientific hypotheses be falsifiable,” said a senior AI researcher at a large U.S. tech firm, who wished to remain anonymous due to the sensitive nature of the discussion.

“Because Silver et al. are speaking in generalities, and the notion of reward is suitably underspecified, you can always either cherry pick cases where the hypothesis is satisfied, or the notion of reward can be shifted such that it is satisfied,” the source added.

“As such, the unfortunate verdict here is not that these prominent members of our research community have erred in any way, but rather that what is written is trivial. What is learned from this paper, in the end? In the absence of practical, actionable consequences from recognizing the unalienable truth of this hypothesis, was this paper enough?”

What is AGI?

While AGI is often referred to as the holy grail of the AI community, there’s no consensus on what AGI actually is. One definition is it’s the ability of an intelligent agent to understand or learn any intellectual task that a human being can.

But not everyone agrees with that and some question whether AGI will ever exist. Others are terrified about its potential impacts and whether AGI would build its own, even more powerful, forms of AI, or so-called superintelligences.

Ian Hogarth, an entrepreneur turned angel investor, told CNBC that he hopes reinforcement learning isn’t enough to reach AGI. “The more that existing techniques can scale up to reach AGI, the less time we have to prepare AI safety efforts and the lower the chance that things go well for our species,” he said.

Winiger argues that we’re no closer to AGI today than we were several decades ago. “The only thing that has fundamentally changed since the 1950/60s, is that science-fiction is now a valid tool for giant corporations to confuse and mislead the public, journalists and shareholders,” he said.

Fueled with hundreds of millions of dollars from Alphabet every year, DeepMind is competing with the likes of Facebook and OpenAI to hire the brightest people in the field as it looks to develop AGI. “This invention could help society find answers to some of the world’s most pressing and fundamental scientific challenges,” DeepMind writes on its website.

DeepMind COO Lila Ibrahim said on Monday that trying to “figure out how to operationalize the vision” has been the biggest challenge since she joined the company in April 2018.

Continue Reading

Technology

Microsoft stock sinks on report AI product sales are missing growth goals

Published

on

By

Microsoft stock sinks on report AI product sales are missing growth goals

Microsoft: Have not lowered sales quotas or targets for salespeople

Microsoft pushed back on a report Wednesday that the company lowered growth targets for artificial intelligence software sales after many of its salespeople missed those goals in the last fiscal year.

The company’s stock sank more than 2% on The Information report.

A Microsoft spokesperson said the company has not lowered sales quotas or targets for its salespeople.

The sales lag occurred for Microsoft’s Foundry product, an Azure enterprise platform where companies can build and manage AI agents, according to The Information, which cited two salespeople in Azure’s cloud unit.

AI agents can carry out a series of actions for a user or organization autonomously.

Less than a fifth of salespeople in one U.S. Azure unit met the Foundry sales growth target of 50%, according to The Information.

In another unit, the quota was set to double Foundry sales, The Information reported. The quota was dropped to 50% after most salespeople didn’t meet it.

In a statement, the company said the news outlet inaccurately combined the concepts of growth and quotas.

Read more CNBC tech news

“Aggregate sales quotas for AI products have not been lowered, as we informed them prior to publication,” a Microsoft Spokesperson said.

The AI boom has presented opportunities for businesses to add efficiencies and streamline tasks, with the companies that build these agents touting the power of the tools to take on work and allow workers to do more.

OpenAI, Google, Anthropic, Salesforce, Amazon and others all have their own tools to create and manage these AI assistants.

But the adoption of these tools by traditional businesses hasn’t seen the same surge as other parts of the AI ecosystem.

The Information noted AI adoption struggles at private equity firm Carlyle last year, in which the tools wouldn’t reliably connect data from other places. The company later reduced how much it spent on the tools.

Read the full story from The Information here.

Continue Reading

Technology

Waymo expanding to Baltimore, Pittsburgh and St. Louis with manual test drives

Published

on

By

Waymo expanding to Baltimore, Pittsburgh and St. Louis with manual test drives

Waymo partners with Uber to bring robotaxi service to Atlanta and Austin.

Uber Technologies Inc.

Waymo on Wednesday said humans will begin test driving the Alphabet-owned company’s robotaxi vehicles in Baltimore, Pittsburgh and St. Louis.

The three cities represent the latest additions to Waymo’s quickly growing list of cities where the Google sister company is either operating its robotaxis, planning to launch service or starting to test its vehicles. That list now stands at 26 markets.

Waymo will begin manual drives in the trio of new cities this week with hopes to eventually begin serving fully-autonomous rides there, spokesperson Ethan Teicher told CNBC.

Over the past month, Waymo has been aggressively making announcements for new markets and developments at the Google sister company. This comes as tech rivals Amazon and Tesla made advancements in the robotaxi market in 2025. Amazon’s Zoox began offering free rides in Las Vegas and San Francisco, and Tesla this year launched ride-hailing service with human supervisors in the Austin and San Francisco markets.

In November, Waymo announced that it will soon begin manually driving in Minneapolis, Tampa and New Orleans. The company also added Houston, San Antonio and Orlando to its list of cities where it’ll launch service in 2026. Waymo also began offering rides on freeways in the San Francisco, Los Angeles and Phoenix markets, and it named a new finance chief.

With more than 250,000 weekly paid trips, Waymo’s robotaxi service currently operates in Austin, the San Francisco Bay Area, Phoenix, Atlanta and Los Angeles markets. The company in May said it had provided more than 10 million paid rides since launching in 2020.

The new cities further signal that Waymo is increasingly confident its service can work well in locations with colder weather conditions.

WATCH: Waymo launches paid robotaxi rides on freeways

Watch: Waymo launches paid robotaxi rides on freeways

Continue Reading

Technology

Security startup Verkada hits $5.8 billion valuation in latest funding round led by CapitalG

Published

on

By

Security startup Verkada hits .8 billion valuation in latest funding round led by CapitalG

Filip Kaliszan, CEO of Verkada.

Courtesy: Verkada

Security technology startup Verkada has reached a $5.8 billion valuation after a new funding round led by CapitalG, Alphabet’s venture capital arm, announced Wednesday.

“I think Google saw the opportunity with us in the application of AI and everything we’re driving to apply AI to the physical security industry,” CEO Filip Kaliszan told CNBC’s Deirdre Bosa.

The company said in a release that the investment will be used to bolster its artificial intelligence capabilities and provide liquidity.

The financing totaled $100 million, a person familiar with the terms of the round told CNBC, raising the company’s valuation by $1.3 billion from its Series E funding in February. The person asked not to be named in order to discuss details of the funding.

CapitalG also recently contributed to a $435 million fundraise for cybersecurity startup Armis in November.

The new funding comes as Verkada surpasses $1 billion in annualized bookings across 30,000 customers globally.

The company develops physical security products, including cameras, alarms and sensors, that are connected under a single cloud-based software platform.

Kaliszan said his company serves a broad span of businesses, such as retailers, government properties, schools, and transportation.

For example, TeraWatt Infrastructure, which supplies charging sites to electric vehicles like Google’s Waymo, uses Verkada technology to protect EV facilities.

In September, the company rolled out over 60 new AI features and platform updates, including tools like “AI-Powered Unified Timeline.”

Read more CNBC tech news

The tool can automatically synthesize videos and images from several cameras into a single visual timeline, rather than requiring security teams to dig through multiple videos during an investigation.

“The genius of Filip and the team of Verkada is that they’re leveraging AI as a Rosetta Stone to really help unlock insights from cameras to help companies become safer and more efficient,” CapitalG general partner Derek Zanutto told Bosa.

By capturing over 20 million images per hour, Verkada can provide notable data like foot traffic, occupancy rates, security violations and other trends, Zanutto said.

He added that the physical security is a sleeping $60 billion market that is led by legacy hardware like “cameras that just record, not cameras that think” — a gap that Verkada is hoping to fill.

However, AI-powered technology will not necessarily replace human security guards any time soon.

“I think humans will be providing security to other humans for as long as I can think,” Kaliszan said. “But AI can empower these first responders to be more aware, to have situational knowledge, to know what to do, and in some cases, actually prevent the problems from happening.”

He pointed to the Louvre heist in October, where multiple crown jewels were robbed from the museum, as an opportunity where AI-assisted devices that could actively monitor, then immediately alert security forces, would be more effective than only physical personnel.

“If you could intervene right then, if you could know in real time that that’s happening, the potential for savings and preventing damage is tremendous,” he said.

xAI raises $15B in series E round

Continue Reading

Trending