Google expects its more advanced large language models to arrive on Android phones next year

Published

2 years ago

February 26, 2024

admin

Betul Abali | Anadolu | Getty Images

Google is bullish on the prospect of its more advanced artificial intelligence models coming to smartphones in the next year.

The internet giant expects that its currently available Gemini large language model (LLM), which competes with Microsoft-backed OpenAI’s most powerful GPT-4 AI model, will begin to get embedded into devices starting next year.

Google already offers Gemini Nano, which is the company’s most efficient model for “on-device” AI, across its Pixel devices and on all other capable Android devices.

Brian Rakowski, vice president of product management for Google’s Pixel unit, said that he expects the company’s most advanced large language models, which are currently only accessible via remote data centers via an internet connection, to begin arriving on smartphones directly next year.

“There are smaller versions of our Gemini model on the cloud,” Rakowski told CNBC. “There’s been quite a few breakthroughs to compress these models to get them to run on device.”

“Some have already been proven and some are being explored for some applications. It’d be great having all models on device. It still has wondrous applications.”

“Gemini Nano is performing at a level that our online models were at less than one year ago,” Rakowski added. “You can do a lot with these distilled small versions of the models on device.”

“If you just follow that trajectory, some of that stuff we thought we would have had to go to the cloud for next year will be on device, which is pretty exciting, which is instantaneous without requiring a connection or subscription.”

Large language models, or LLMs, are AI models capable of understanding and generating language in a humanlike way. Gemini Ultra is Google’s top LLM, clocking in at a whopping 1.56 trillion parameters. For comparison, OpenAI’s GPT-4 consists of 1.76 trillion parameters.

Dreaming of a smartphone ‘supercycle’

Smartphone makers have been dreaming of a “supercycle” in their industry, driven by artificial intelligence, after a bruising few years that saw device sales slow aggressively. In 2023, smartphone sales fell to 1.16 billion units, the lowest point for unit shipments in a decade.

Analysts say a supercycle is unlikely to occur within the next few years as there’s not enough going on in the market in terms of novel features and innovation that will convince people holding their aging smartphones to upgrade.

“Unfortunately, we don’t expect that boom,” Francisco Jeronimo, vice president of data and analytics at research firm IDC, told CNBC.

“The last supercycle we saw was between 2010 and 2015, where in five years the market grew five times from around 300 million smartphones per year to 1.5 billion.”

Nonetheless, more and more smartphone makers are making big investments in AI in the hope that it will drive some more excitement around mobile technology.

The likes of Humane, Rabbit, and China’s Meizu are betting on a future for smartphones that doesn’t even look like a traditional smartphone. These are devices that would be smaller and more compact, and that we could interact with via voice activation, like an Amazon Echo speaker but on the go.

Google has been making huge bets on AI in an effort to gain an edge over its rivals like OpenAI, the Microsoft-backed company behind ChatGPT.

Google recently announced a major rebrand of Bard, its ChatGPT alternative, including a fresh app and subscription options. Bard was renamed Gemini, the same name as the suite of AI models that power the chatbot.

Android users can download a dedicated Android app for Gemini, while iPhone users can use Gemini in the Google app on iOS.

Alphabet CEO Sundar Pichai highlighted the firm’s commitment to AI during the company’s Jan. 30 earnings call. Pichai said he eventually wants to offer an AI agent that can complete more and more tasks on a user’s behalf, including within Google Search, although he said there is “a lot of execution ahead.”

Likewise, chief executives at tech giants from Microsoft to Amazon have underlined their commitment to building AI agents as productivity tools.

Technology

Luma AI raises $900 million in funding round led by Saudi AI firm Humain

Published

32 mins ago

November 19, 2025

admin

Luma AI raises 0 million in funding round led by Saudi AI firm Humain

Thomas Fuller | SOPA Images | Lightrocket | Getty Images

Video generation startup Luma AI said it raised $900 million in a new funding round led by Humain, an artificial intelligence company owned by Saudi Arabia’s Public Investment Fund.

The financing, which included participation from Advanced Micro Devices’ venture arm and existing investors Andreessen Horowitz, Amplify Partners and Matrix Partners, was announced at the U.S.-Saudi Investment Forum on Wednesday.

The company is now valued upwards of $4 billion, CNBC has confirmed.

Luma develops multimodal “world models” that are able to learn from not only text, but also video, audio and images in order to simulate reality. CEO Amit Jain told CNBC in an interview that these models expand beyond large language models, which are solely trained on text, to be more effective in “helping in the real, physical world.”

“With this funding, we plan to scale our and accelerate our efforts in training and then deploying these world models today,” Jain said.

Luma released Ray3 in September, the first reasoning video model that can interpret prompts to create videos, images and audio. Jain said Ray3 currently benchmarks higher than OpenAI’s Sora 2 and around the same level as Google’s Veo 3.

Humain, which was launched in May, is aiming to deliver full-stack AI capabilities to bolster Saudi Arabia’s position as a global AI hub. The company is led by industry veteran Tareq Amin, who previously ran Aramco Digital and before that was CEO of Rakuten Mobile.

Luma and Humain will also partner to build a 2-gigawatt AI supercluster, dubbed Project Halo, in Saudi Arabia. The buildout will be one of the one of the largest deployments of graphic processing units (GPUs) in the world, Jain said.

Major tech companies have been investing in supercomputers across the globe to train massive AI models. In July, Meta announced plans to build a 1-gigawatt supercluster called Prometheus, and Microsoft deployed the first supercomputing cluster using the Nvidia GB300 NVL72 platform in October.

“Our investment in Luma AI, combined with HUMAIN’s 2GW supercluster, positions us to train, deploy, and scale multimodal intelligence at a frontier level,” Amin said in a release. “This partnership sets a new benchmark for how capital, compute, and capability come together.”

The collaboration also includes Humain Create, an initiative to create sovereign AI models trained on Arabic and regional data. Along with focusing on building the world’s first Arabic video model, Jain said Luma models and capabilities will be deployed to Middle Eastern businesses.

He added that since most models are trained by scraping data from the internet, countries outside the U.S. and Asia are often less represented in AI-generated content.

“It’s really important that we bring these cultures, their identities, their representation — visual and behavioral and everything — to our model,” Jain said.

AI-generated content tools have received significant backlash over the past year from entertainment studios over copyright concerns. Luma’s flagship text-to-video platform Dream Machine garnered some accusations of copying IP earlier this year, but Jain the company has installed safeguards to prevent unwanted usage.

“Even if you really try to trick it, we are constantly improving it,” he said. “We have built very robust systems that are actually using models we trained to detect them.”

WATCH: Humain CEO on building an Arabic rival to ChatGPT

Saudi Arabia's Humain CEO on building an Arabic rival to ChatGPT

Technology

Perplexity announces free product to streamline online shopping

Published

1 hour ago

November 19, 2025

admin

Perplexity announces free product to streamline online shopping

Davide Bonaldo | Lightrocket | Getty Images

Perplexity on Wednesday announced it will roll out a free agentic shopping product for U.S. users next week, as consumers ramp up spending for the holiday season.

“The agentic part is the seamless purchase right from the answer,” Dmitry Shevelenko, Perplexity’s chief business officer, told CNBC in an interview. “Most people want to still do their own research. They want that streamlined and simplified, and so that’s the part that is agentic in this launch.”

The artificial intelligence startup has partnered with PayPal ahead of the launch, and users will eventually be able to directly purchase items from more than 5,000 merchants through Perplexity’s search engine.

Perplexity initially released a shopping offering called “Buy With Pro” for its paid subscribers late last year. The company said its new free product will be better at detecting shopping intent and will deliver more personalized results by drawing on memory from a user’s previous searches.

Perplexity declined to share whether it will earn revenue from transactions that are completed through its platform.

The startup’s competitor OpenAI announced a similar e-commerce feature called Instant Checkout in September, which allows ChatGPT users to buy items from merchants without leaving the chatbot’s interface. OpenAI has said it will take a fee from those purchases.

Nvidia reports third-quarter earnings after the bell

Published

1 hour ago

November 19, 2025

admin

Nvidia reports third-quarter earnings after the bell

Nvidia founder and CEO Jensen Huang reacts during a press conference at the Asia-Pacific Economic Cooperation (APEC) CEO Summit in Gyeongju on October 31, 2025.

Jung Yeon-je | Afp | Getty Images

Artificial intelligence chipmaker Nvidia is scheduled to report fiscal third-quarter earnings on Wednesday after the market closes.

Here’s what Wall Street is expecting, per LSEG consensus estimates:

EPS: $1.25
Revenue: $54.92 billion

Wall Street is expecting the chipmaker to guide in the current quarter to $1.43 in earnings per share on $61.66 billion of revenue. Nvidia typically provides one quarter of revenue guidance.

Anything Nvidia or CEO Jensen Huang says about the company’s outlook and its sales backlog will be closely scrutinized.

He’ll have lots to talk about.

Nvidia is at the center of the AI boom, and it counts counts every major cloud company and AI lab as a customer. All of the major AI labs use Nvidia chips to develop next-generation models, and a handful of companies called hyperscalers have committed hundreds of billions of dollars to construct new data centers around Nvidia technology in unprecedented build-outs.

Last month, Huang said Nvidia had $500 billion in chip orders in calendar 2025 and 2026, including the forthcoming Rubin chip, which will start shipping in volume next year. Analysts will want to know more about what Nvidia sees coming from the AI infrastructure world next year, because all five of the top AI model developers in the U.S. use the company’s chips.

As of Tuesday, analysts polled by LSEG expect Nvidia’s sales to rise 39% in the company’s fiscal 2027, which starts in early 2026.

Investors will want to hear about Nvidia’s equity deals with customers and suppliers, including an agreement to invest in OpenAI, a deal with Nokia and an investment into former rival Intel. Nvidia has kept its pace of deal-making up, agreeing to invest $10 billion into AI company Anthropic earlier this week.

Nvidia management will also be asked about China, and the possibility that the company could gain licenses from the U.S. government to export a version of its current-generation Blackwell AI chip to the country. Analysts say Nvidia’s sales could get a boost of as much as $50 billion per year if it is allowed to sell current-generation chips to Chinese companies.

WATCH: There’s a lot riding on Nvidia’s earnings, says Interactive Brokers’ Sosnick