A team at Google has proposed using AI technology to create a “bird’s-eye” view of users’ lives using mobile phone data such as photographs and searches.
Dubbed “Project Ellmann,” after biographer and literary critic Richard David Ellmann, the idea would be to use LLMs like Gemini to ingest search results, spot patterns in a user’s photos, create a chatbot, and “answer previously impossible questions,” according to a copy of a presentation viewed by CNBC. Ellmann’s aim, it states, is to be “Your Life Story Teller.”
It’s unclear if the company has plans to produce these capabilities within Google Photos, or any other product. Google Photos has more than one billion users and four trillion photos and videos, according to a company blog post.
Project Ellman is just one of many ways Google is proposing to create or improve its products with AI technology. On Wednesday, Google launched its latest “most capable” and advanced AI model yet, Gemini, which in some cases outperformed OpenAI’s GPT-4. The company is planning to license Gemini to a wide range of customers through Google Cloud for them to use in their own applications. One of Gemini’s standout features is that it’s multimodal, meaning it can process and understand information beyond text, including images, video and audio.
A product manager for Google Photos presented Project Ellman alongside Gemini teams at a recent internal summit, according to documents viewed by CNBC. They wrote that the teams spent the past few months determining that large language models are the ideal tech to make this bird’s-eye approach to one’s life story a reality.
Ellmann could pull in context using biographies, previous moments, and subsequent photos to describe a user’s photos more deeply than “just pixels with labels and metadata,” the presentation states. It proposes to be able to identify a series of moments like university years, Bay Area years, and years as a parent.
“We can’t answer tough questions or tell good stories without a bird’s-eye view of your life,” one description reads alongside a photo of a small boy playing with a dog in the dirt.
“We trawl through your photos, looking at their tags and locations to identify a meaningful moment,” a presentation slide reads. “When we step back and understand your life in its entirety, your overarching story becomes clear.”
The presentation said large language models could infer moments like a user’s child’s birth. “This LLM can use knowledge from higher in the tree to infer that this is Jack’s birth, and that he’s James and Gemma’s first and only child.”
“One of the reasons that an LLM is so powerful for this bird’s-eye approach, is that it’s able to take unstructured context from all different elevations across this tree, and use it to improve how it understands other regions of the tree,” a slide reads, alongside an illustration of a user’s various life “moments” and “chapters.”
Presenters gave another example of determining one user had recently been to a class reunion. “It’s exactly 10 years since he graduated and is full of faces not seen in 10 years so it’s probably a reunion,” the team inferred in its presentation.
The team also demonstrated “Ellmann Chat,” with the description: “Imagine opening ChatGPT but it already knows everything about your life. What would you ask it?”
It displayed a sample chat in which a user asks “Do I have a pet?” To which it answers that yes, the user has a dog which wore a red raincoat, then offered the dog’s name and the names of the two family members it’s most often seen with.
Another example for the chat was a user asking when their siblings last visited. Another asked it to list similar towns to where they live because they are thinking of moving. Ellmann offered answers to both.
Ellmann also presented a summary of the user’s eating habits, other slides showed. “You seem to enjoy Italian food. There are several photos of pasta dishes, as well as a photo of a pizza.” It also said that the user seemed to enjoy new food because one of their photos had a menu with a dish it didn’t recognize.
The technology also determined what products the user was considering purchasing, their interests, work, and travel plans based on the user’s screenshots, the presentation stated. It also suggested it would be able to know their favorite websites and apps, giving examples Google Docs, Reddit and Instagram.
A Google spokesperson told CNBC, “Google Photos has always used AI to help people search their photos and videos, and we’re excited about the potential of LLMs to unlock even more helpful experiences. This is a brainstorming concept a team is at the early stages of exploring. As always, we’ll take the time needed to ensure we do it responsibly, protecting users’ privacy as our top priority.”
Big Tech’s race to create AI-driven ‘Memories’
The proposed Project Ellmann could help Google in the arms race among tech giants to create more personalized life memories.
Google Photos and Apple Photos have for years served “memories” and generated albums based on trends in photos.
In November, Google announced that with the help of AI, Google Photos can now group together similar photos and organize screenshots into easy-to-find albums.
Apple announced in June that its latest software update will include the ability for its photo app to recognize people, dogs, and cats in their photos. It already sorts out faces and allows users to search for them by name.
Apple also announced an upcoming Journal App, which will use on-device AI to create personalized suggestions to prompt users to write passages that describe their memories and experiences based on recent photos, locations, music and workouts.
But Apple, Google and other tech giants are still grappling with the complexities of displaying and identifying images appropriately.
For instance, Apple and Google still avoid labeling gorillas after reports in 2015 found the company mislabeling Black people as gorillas. A New York Times investigation this year found Apple and Google’s Android software, which underpins most of the world’s smartphones, turned off the ability to visually search for primates for fear of labeling a person as an animal.
Companies including Google, Facebook and Apple have over time added controls to minimize unwanted memories, but users have reported they sometimes still surface unwanted memories and require the users to toggle through several settings in order to minimize them.
Chinese e-commerce behemoth Alibaba on Friday beat profit expectations in its September quarter, but sales fell short as sluggishness in the world’s second-largest economy hit consumer spending.
Alibaba said net income rose 58% year on year to 43.9 billion yuan ($6.07 billion) in the company’s quarter ended Sept. 30, on the back of the performance of its equity investments. This compares with an LSEG forecast of 25.83 billion yuan.
“The year-over-year increases were primarily attributable to the mark-to-market changes from our equity investments, decrease in impairment of our investments and increase in income from operations,” the company said of the annual profit jump in its earnings statement.
Revenue, meanwhile, came in at 236.5 billion yuan, 5% higher year on year but below an analyst forecast of 238.9 billion yuan, according to LSEG data.
The company’s New York-listed shares have gained ground this year to date, up more than 13%. The stock fell more than 2% in morning trading on Friday, after the release of the quarterly earnings.
Sales sentiment
Investors are closely watching the performance of Alibaba’s main business units, Taobao and Tmall Group, which reported a 1% annual uptick in revenue to 98.99 billion yuan in the September quarter.
The results come at a tricky time for Chinese commerce businesses, given a tepid retail environment in the country. Chinese e-commerce group JD.com also missed revenue expectations on Thursday, according to Reuters.
Markets are now watching whether a slew of recent stimulus measures from Beijing, including a five-year 1.4 trillion yuan package announced last week, will help resuscitate the country’s growth and curtail a long-lived real estate market slump.
The impact on the retail space looks promising so far, with sales rising by a better-than-expected 4.8% year on year in October, while China’s recent Singles’ Day shopping holiday — widely seen as a barometer for national consumer sentiment — regained some of its luster.
Alibaba touted “robust growth” in gross merchandise volume — an industry measure of sales over time that does not equate to the company’s revenue — for its Taobao and Tmall Group businesses during the festival, along with a “record number of active buyers.”
“Alibaba’s outlook remains closely aligned with the trajectory of the Chinese economy and evolving regulatory policies,” ING analysts said Thursday, noting that the company’s Friday report will shed light on the Chinese economy’s growth momentum.
The e-commerce giant’s overseas online shopping businesses, such as Lazada and Aliexpress, meanwhile posted a 29% year-on-year hike in sales to 31.67 billion yuan.
Cloud business accelerates
Alibaba’s Cloud Intelligence Group reported year-on-year sales growth of 7% to 29.6 billion yuan in the September quarter, compared with a 6% annual hike in the three-month period ended in June. The slight acceleration comes amid ongoing efforts by the company to leverage its cloud infrastructure and reposition itself as a leader in the booming artificial intelligence space.
“Growth in our Cloud business accelerated from prior quarters, with revenues from public cloud products growing in double digits and AI-related product revenue delivering triple-digit growth. We are more confident in our core businesses than ever and will continue to invest in supporting long-term growth,” Alibaba CEO Eddie Wu said in a statement Friday.
Stymied by Beijing’s sweeping 2022 crackdown on large internet and tech companies, Alibaba last year overhauled the division’s leadership and has been shaping it as a future growth driver, stepping up competition with rivals including Baidu and Huawei domestically, and Microsoft and OpenAI in the U.S.
Alibaba, which rolled out its own ChatGPT-style product Tongyi Qianwen last year, this week unveiled its own AI-powered search tool for small businesses in Europe and the Americas, and clinched a key five-year partnership to supply cloud services to Indonesian tech giant GoTo in September.
Speaking at the Apsara Conference in September, Alibaba’s Wu said the company’s cloud unit is investing “with unprecedented intensity, in the research and development of AI technology and the building of its global infrastructure,” noting that the future of AI is “only beginning.”
Correction: This article has been updated to reflect that Alibaba’s Cloud Intelligence Group reported quarterly revenue of 29.6 billion yuan in the September quarter.
Elon Musk listens as US President-elect Donald Trump speaks during a House Republicans Conference meeting at the Hyatt Regency on Capitol Hill on November 13, 2024 in Washington, DC.
Allison Robbert | Getty Images
Elon Musk’s artificial intelligence company xAI is raising up to $6 billion at a $50 billion valuation, according to CNBC’s David Faber.
Sources told Faber that the funding, which should close early next week, is a combination of $5 billion expected from sovereign funds in the Middle East and $1 billion from other investors, some of whom may want to re-up their investments.
The money will be used to acquire 100,000 Nvidia chips, per sources familiar with the situation. Tesla‘s Full Self Driving is expected to rely on the new Memphis supercomputer.
Musk’s AI startup, which he announced in July 2023, seeks to “understand the true nature of the universe,” according to its website. Last November, X.AI released a chatbot called Grok, which the company said was modeled after “The Hitchhiker’s Guide to the Galaxy.” The chatbot debuted with two months of training and had real-time knowledge of the internet, the company claimed at the time.
With Grok, X.AI aims to directly compete with companies including ChatGPT creator OpenAI, which Musk helped start before a conflict with co-founder Sam Altman led him to depart the project in 2018. It will also be vying with Google’s Bard technology and Anthropic’s Claude chatbot.
Now that Donald Trump is President-elect, Elon Musk is beginning to actively work with the new administration on its approach to AI and tech more broadly, as part of Trump’s inner circle in recent weeks.
Trump plans to repeal President Biden’s executive order on AI, according to his campaign platform, stating that it “hinders AI Innovation, and imposes Radical Leftwing ideas on the development of this technology” and that “in its place, Republicans support AI Development rooted in Free Speech and Human Flourishing.”
Amazon logo on a brick building exterior, San Francisco, California, August 20, 2024.
Smith Collection | Gado | Archive Photos | Getty Images
Amazon representatives met with the House China committee in recent months to discuss lawmaker concerns over the company’s partnership with TikTok, CNBC confirmed.
A spokesperson for the House Select Committee on the Chinese Communist Party confirmed the meeting, which centered on a shopping deal between Amazon and TikTok announced in August. The agreement allows users of TikTok, owned by China’s ByteDance, to link their account with Amazon and make purchases from the site without leaving TikTok.
“The Select Committee conveyed to Amazon that it is dangerous and unwise for Amazon to partner with TikTok given the grave national security threat the app poses,” the spokesperson said. The parties met in September, according to Bloomberg, which first reported the news.
Representatives from Amazon and TikTok did not immediately respond to CNBC’s request for comment.
TikTok’s future viability in the U.S. is uncertain. In April, President Joe Biden signed a law that requires ByteDance to sell TikTok by Jan. 19. If TikTok fails to cut ties with its parent company, app stores and internet hosting services would be prohibited from offering the app.
President-elect Donald Trump could rescue TikTok from a potential U.S. ban. He promised on the campaign trail that he would “save” TikTok, and said in a March interview with CNBC’s “Squawk Box” that “there’s a lot of good and there’s a lot of bad” with the app.
In his first administration, Trump had tried to implement a TikTok ban. He changed his stance around the time he met with billionaire Jeff Yass. The Republican megadonor’s trading firm, Susquehanna International Group, owns a 15% stake in ByteDance, while Yass has a 7% stake in the company, NBC and CNBC reported in March.
— CNBC’s Jonathan Vanian contributed to this report.