But these image-generating programs — which look like toys today — could be the start of a big wave in technology. Technologists call them generative models, or generative AI.
“In the last three months, the words ‘generative AI’ went from, ‘no one even discussed this’ to the buzzword du jour,” said David Beisel, a venture capitalist at NextView Ventures.
In the past year, generative AI has gotten so much better that it’s inspired people to leave their jobs, start new companies and dream about a future where artificial intelligence could power a new generation of tech giants.
The field of artificial intelligence has been having a boom phase for the past half-decade or so, but most of those advancements have been related to making sense of existing data. AI models have quickly grown efficient enough to recognize whether there’s a cat in a photo you just took on your phone and reliable enough to power results from a Google search engine billions of times per day.
But generative AI models can produce something entirely new that wasn’t there before — in other words, they’re creating, not just analyzing.
“The impressive part, even for me, is that it’s able to compose new stuff,” said Boris Dayma, creator of the Craiyon generative AI. “It’s not just creating old images, it’s new things that can be completely different to what it’s seen before.”
Sequoia Capital — historically the most successful venture capital firm in the history of the industry, with early bets on companies like Apple and Google — says in a blog post on its website that “Generative AI has the potential to generate trillions of dollars of economic value.” The VC firm predicts that generative AI could change every industry that requires humans to create original work, from gaming to advertising to law.
In a twist, Sequoia also notes in the post that the message was partially written by GPT-3, a generative AI that produces text.
How generative AI works
Kif Leswing/Craiyon
Image generation uses techniques from a subset of machine learning called deep learning, which has driven most of the advancements in the field of artificial intelligence since a landmark 2012 paper about image classification ignited renewed interest in the technology.
Deep learning uses models trained on large sets of data until the program understands relationships in that data. Then the model can be used for applications, like identifying if a picture has a dog in it, or translating text.
Image generators work by turning this process on its head. Instead of translating from English to French, for example, they translate an English phrase into an image. They usually have two main parts, one that processes the initial phrase, and the second that turns that data into an image.
The first wave of generative AIs was based on an approach called GAN, which stands for generative adversarial networks. GANs were famously used in a tool that generates photos of people who don’t exist. Essentially, they work by having two AI models compete against each other to better create an image that fits with a goal.
Newer approaches generally use transformers, which were first described in a 2017 Google paper. It’s an emerging technique that can take advantage of bigger datasets that can cost millions of dollars to train.
The first image generator to gain a lot of attention was DALL-E, a program announced in 2021 by OpenAI, a well-funded startup in Silicon Valley. OpenAI released a more powerful version this year.
“With DALL-E 2, that’s really the moment when when sort of we crossed the uncanny valley,” said Christian Cantrell, a developer focusing on generative AI.
Another commonly used AI-based image generator is Craiyon, formerly known as Dall-E Mini, which is available on the web. Users can type in a phrase and see it illustrated in minutes in their browser.
Since launching in July 2021, it’s now generating about 10 million images a day, adding up to 1 billion images that have never existed before, according to Dayma. He’s made Craiyon his full-time job after usage skyrocketed earlier this year. He says he’s focused on using advertising to keep the website free to users because the site’s server costs are high.
For example, Stable Diffusion was integrated into Adobe Photoshop through a plug-in, allowing users to generate backgrounds and other parts of images that they can then directly manipulate inside the application using layers and other Photoshop tools, turning generative AI from something that produces finished images into a tool that can be used by professionals.
“I wanted to meet creative professionals where they were and I wanted to empower them to bring AI into their workflows, not blow up their workflows,” said Cantrell, developer of the plug-in.
Cantrell, who was a 20-year Adobe veteran before leaving his job this year to focus on generative AI, says the plug-in has been downloaded tens of thousands of times. Artists tell him they use it in myriad ways that he couldn’t have anticipated, such as animating Godzilla or creating pictures of Spider-Man in any pose the artist could imagine.
“Usually, you start from inspiration, right? You’re looking at mood boards, those kinds of things,” Cantrell said. “So my initial plan with the first version, let’s get past the blank canvas problem, you type in what you’re thinking, just describe what you’re thinking and then I’ll show you some stuff, right?”
An emerging art to working with generative AIs is how to frame the “prompt,” or string of words that lead to the image. A search engine called Lexica catalogs Stable Diffusion images and the exact string of words that can be used to generate them.
Startups, cloud providers, and chip makers could thrive
Image generated by DALL-E with prompt: A cat on sitting on the moon, in the style of Pablo Picasso, detailed, stars
Screenshot/OpenAI
Some investors are looking at generative AI as a potentially transformative platform shift, like the smartphone or the early days of the web. These kinds of shifts greatly expand the total addressable market of people who might be able to use the technology, moving from a few dedicated nerds to business professionals — and eventually everyone else.
“It’s not as though AI hadn’t been around before this — and it wasn’t like we hadn’t had mobile before 2007,” said Beisel, the seed investor. “But it’s like this moment where it just kind of all comes together. That real people, like end-user consumers, can experiment and see something that’s different than it was before.”
Cantrell sees generative machine learning as akin to an even more foundational technology: the database. Originally pioneered by companies like Oracle in the 1970s as a way to store and organize discrete bits of information in clearly delineated rows and columns — think of an enormous Excel spreadsheet, databases have been re-envisioned to store every type of data for every conceivable type of computing application from the web to mobile.
“Machine learning is kind of like databases, where databases were a huge unlock for web apps. Almost every app you or I have ever used in our lives is on top of a database,” Cantrell said. “Nobody cares how the database works, they just know how to use it.”
Michael Dempsey, managing partner at Compound VC, says moments where technologies previously limited to labs break into the mainstream are “very rare” and attract a lot of attention from venture investors, who like to make bets on fields that could be huge. Still, he warns that this moment in generative AI might end up being a “curiosity phase” closer to the peak of a hype cycle. And companies founded during this era could fail because they don’t focus on specific uses that businesses or consumers would pay for.
Others in the field believe that startups pioneering these technologies today could eventually challenge the software giants that currently dominate the artificial intelligence space, including Google, Facebook parent Meta and Microsoft, paving the way for the next generation of tech giants.
“There’s going to be a bunch of trillion-dollar companies — a whole generation of startups who are going to build on this new way of doing technologies,” said Clement Delangue, the CEO of Hugging Face, a developer platform like GitHub that hosts pre-trained models, including those for Craiyon and Stable Diffusion. Its goal is to make AI technology easier for programmers to build on.
Some of these firms are already sporting significant investment.
Hugging Face was valued at $2 billion after raising money earlier this year from investors including Lux Capital and Sequoia; and OpenAI, the most prominent startup in the field, has received over $1 billion in funding from Microsoft and Khosla Ventures.
Meanwhile, Stability AI, the maker of Stable Diffusion, is in talks to raise venture funding at a valuation of as much as $1 billion, according to Forbes. A representative for Stability AI declined to comment.
Cloud providers like Amazon, Microsoft and Google could also benefit because generative AI can be very computationally intensive.
Meta and Google have hired some of the most prominent talent in the field in hopes that advances might be able to be integrated into company products. In September, Meta announced an AI program called “Make-A-Video” that takes the technology one step farther by generating videos, not just images.
“This is pretty amazing progress,” Meta CEO Mark Zuckerberg said in a post on his Facebook page. “It’s much harder to generate video than photos because beyond correctly generating each pixel, the system also has to predict how they’ll change over time.”
On Wednesday, Google matched Meta and announced and released code for a program called Phenaki that also does text to video, and can generate minutes of footage.
The boom could also bolster chipmakers like Nvidia, AMD and Intel, which make the kind of advanced graphics processors that are ideal for training and deploying AI models.
At a conference last week, Nvidia CEO Jensen Huang highlighted generative AI as a key use for the company’s newest chips, saying these kind of programs could soon “revolutionize communications.”
Profitable end uses for Generative AI are currently rare. A lot of today’s excitement revolves around free or low-cost experimentation. For example, some writers have been experimented with using image generators to make images for articles.
Prompt: “A cat sitting on the moon, in the style of picasso, detailed”
Screenshot/Craiyon
Ultimately, everyone developing generative AI will have to grapple with some of the ethical issues that come up from image generators.
First, there’s the jobs question. Even though many programs require a powerful graphics processor, computer-generated content is still going to be far less expensive than the work of a professional illustrator, which can cost hundreds of dollars per hour.
That could spell trouble for artists, video producers and other people whose job it is to generate creative work. For example, a person whose job is choosing images for a pitch deck or creating marketing materials could be replaced by a computer program very shortly.
“It turns out, machine-learning models are probably going to start being orders of magnitude better and faster and cheaper than that person,” said Compound VC’s Dempsey.
There are also complicated questions around originality and ownership.
Generative AIs are trained on huge amounts of images, and it’s still being debated in the field and in courts whether the creators of the original images have any copyright claims on images generated to be in the original creator’s style.
One artist won an art competition in Colorado using an image largely created by a generative AI called MidJourney, although he said in interviews after he won that he processed the image after choosing it from one of hundreds he generated and then tweaking it in Photoshop.
Some images generated by Stable Diffusion seem to have watermarks, suggesting that a part of the original datasets were copyrighted. Some prompt guides recommend using specific living artists’ names in prompts in order to get better results that mimic the style of that artist.
Image generators can also be used to create new images of trademarked characters or objects, such as the Minions, Marvel characters or the throne from Game of Thrones.
As image-generating software gets better, it also has the potential to be able to fool users into believing false information or to display images or videos of events that never happened.
Developers also have to grapple with the possibility that models trained on large amounts of data may have biases related to gender, race or culture included in the data, which can lead to the model displaying that bias in its output. For its part, Hugging Face, the model-sharing website, publishes materials such as an ethics newsletter and holds talks about responsible development in the AI field.
“What we’re seeing with these models is one of the short-term and existing challenges is that because they’re probabilistic models, trained on large datasets, they tend to encode a lot of biases,” Delangue said, offering an example of a generative AI drawing a picture of a “software engineer” as a white man.
OpenAI CEO Sam Altman visits “Making Money With Charles Payne” at Fox Business Network Studios in New York on Dec. 4, 2024.
Mike Coppola | Getty Images
OpenAI CEO Sam Altman’s sister, Ann Altman, filed a lawsuit on Monday, alleging that her brother sexually abused her regularly between the years of 1997 and 2006.
The lawsuit, which was filed in U.S. District Court in the Eastern District of Missouri, alleges that the abuse took place at the family’s home in Clayton, Missouri, and began when Ann, who goes by Annie, was three and Sam was 12. The filing claims that the abusive activities took place “several times per week,” beginning with oral sex and later involving penetration.
The lawsuit claims that “as a direct and proximate result of the foregoing acts of sexual assault,” the plaintiff has experienced “severe emotional distress, mental anguish, and depression, which is expected to continue into the future.”
The younger Altman has publicly made similar sexual assault allegations against her brother in the past on platforms like X, but this is the first time she’s taken him to court. She’s being represented by Ryan Mahoney, whose Illinois-based firm specializes in matters including sexual assault and harassment.
The lawsuit requests a jury trial and damages in excess of $75,000.
In a joint statement on X with his mother, Connie, and his brothers Jack and Max, Sam Altman denied the allegations.
“Annie has made deeply hurtful and entirely untrue claims about our family, and especially Sam,” the statement said. “We’ve chosen not to respond publicly, out of respect for her privacy and our own. However, she has now taken legal action against Sam, and we feel we have no choice but to address this.”
Their response says “all of these claims are utterly untrue,” adding that “this situation causes immense pain to our entire family.” They said that Ann Altman faces “mental health challenges” and “refuses conventional treatment and lashes out at family members who are genuinely trying to help.”
Sam Altman has gained international prominence since OpenAI’s debut of the artificial intelligence chatbot ChatGPT in November 2022. Backed by Microsoft, the company was most recently valued at $157 billion, with funding coming from Thrive Capital, chipmaker Nvidia, SoftBank and others.
Altman was briefly ousted from the CEO role by OpenAI’s board in November 2023, but was quickly reinstated due to pressure from investors and employees.
This isn’t the only lawsuit the tech exec faces.
In March, Tesla and SpaceX CEO Elon Musk sued OpenAI and co-founders Altman and Greg Brockman, alleging breach of contract and fiduciary duty. Musk, who now runs a competing AI startup, xAI, was a co-founder of OpenAI when it began as a nonprofit in 2015. Musk left the board in 2018 and has publicly criticized OpenAI for allegedly abandoning its original mission.
Musk is suing to keep OpenAI from turning into a for-profit company. In June, Musk withdrew the original complaint filed in a San Francisco state court and later refiled in federal court.
Last month, OpenAI clapped back against Musk, claiming in a blog post that in 2017 Musk “not only wanted, but actually created, a for-profit” to serve as the company’s proposed new structure.
This photo illustration created on January 7, 2025, in Washington, DC, shows an image of Mark Zuckerberg, CEO of Meta, and an image of the Meta logo.
Drew Angerer | Afp | Getty Images
Meta employees took to their internal forum on Tuesday, criticizing the company’s decision to end third-party fact-checking on its services two weeks before President-elect Donald Trump’s inauguration.
Company employees voiced their concern after Joel Kaplan, Meta’s new chief global affairs officer and former White House deputy chief of staff under former President George W. Bush, announced the content policy changes on Workplace, the in-house communications tool.
“We’re optimistic that these changes help us return to that fundamental commitment to free expression,” Kaplan wrote in the post, which was reviewed by CNBC.
The content policy announcement follows a string of decisions that appear targeted to appease the incoming administration. On Monday, Meta added new members to its board, including UFC CEO Dana White, a longtime friend of Trump, and the company confirmed last month that it was contributing $1 million to Trump’s inauguration.
Among the latest changes, Kaplan announced that Meta will scrap its fact-checking program and shift to a user-generated system like X’s Community Notes. Kaplan, who took over his new role last week, also said that Meta will lift restrictions on certain topics and focus its enforcement on illegal and high-severity violations while giving users “a more personalized approach to political content.”
One worker wrote they were “extremely concerned” about the decision, saying it appears Meta is “sending a bigger, stronger message to people that facts no longer matter, and conflating that with a victory for free speech.”
Another employee commented that by “simply absolving ourselves from the duty to at least try to create a safe and respective platform is a really sad direction to take.” Other comments expressed concern about the impact the policy change could have on the discourse around topics like immigration, gender identity and gender, which, according to one employee, could result in an “influx of racist and transphobic content.”
A separate employee said they were scared that “we’re entering into really dangerous territory by paving the way for the further spread of misinformation.”
The changes weren’t universally criticized, as some Meta workers congratulated the company’s decision to end third-party fact checking. One wrote that X’s Community Notes feature has “proven to be a much better representation of the ground truth.”
Another employee commented that the company should “provide an accounting of the worst outcomes of the early years” that necessitated the creation of a third-party fact-checking program and whether the new policies would prevent the same type of fall out from happening again.
As part of the company’s massive layoffs in 2023, Meta also scrapped an internal fact-checking project, CNBC reported. That project would have let third-party fact checkers like the Associated Press and Reuters, in addition to credible experts, comment on flagged articles in order to verify the content.
Although Meta announced the end of its fact-checking program on Tuesday, the company had already been pulling it back. In September, a spokesperson for the AP told CNBC that the news agency’s “fact-checking agreement with Meta ended back in January” 2024.
Dana White, CEO of the Ultimate Fighting Championship gestures as he speaks during a rally for Republican presidential nominee and former U.S. President Donald Trump at Madison Square Garden, in New York, U.S., Oct. 27, 2024.
Andrew Kelly | Reuters
After the announcement of White’s addition to the board on Monday, employees also posted criticism, questions and jokes on Workplace, according to posts reviewed by CNBC.
White, who has led UFC since 2001, became embroiled in controversy in 2023 after a video published by TMZ showed him slapping his wife at a New Year’s Eve party in Mexico. White issued a public apology, and his wife, Anne White, issued a statement to TMZ, calling it an isolated incident.
Commenters on Workplace made jokes asking whether performance reviews would now involve mixed martial arts style fights.
In addition to White, John Elkann, the CEO of Italian auto holding company Exor, was named to Meta’s board.
Some employees asked what value autos and entertainment executives could bring to Meta, and whether White’s addition reflects the company’s values. One post suggested the new board appointments would help with political alliances that could be valuable but could also change the company culture in unintended or unwanted ways.
Comments in Workplace alluding to White’s personal history were flagged and removed from the discussion, according to posts from the internal app read by CNBC.
An employee who said he was with Meta’s Internal Community Relations team, posted a reminder to Workplace about the company’s “community engagement expectations” policy, or CEE, for using the platform.
“Multiple comments have been flagged by the community for review,” the employee posted. “It’s important that we maintain a respectful work environment where people can do their best work.”
The internal community relations team member added that “insulting, criticizing, or antagonizing our colleagues or Board members is not aligned with the CEE.”
Several workers responded to that note saying that even respectful posts, if critical, had been removed, amounting to a corporate form of censorship.
One worker said that because critical comments were being removed, the person wanted to voice support for “women and all voices.”
Meta declined to comment.
— CNBC’s Salvador Rodriguez contributed to this report.
Bitcoin slumped on Tuesday as a spike in Treasury yields weighed on risk assets broadly.
The price of the flagship cryptocurrency was last lower by 4.8% at $97,183.80, according to Coin Metrics. The broader market of cryptocurrencies, as measured by the CoinDesk 20 index, dropped more than 5%.
The moves followed a sudden increase in the 10-year U.S. Treasury yield after data released by the Institute for Supply Management reflected faster-than-expected growth in the U.S. services sector in December, adding to concerns about stickier inflation. Rising yields tend to pressure growth oriented risk assets.
Bitcoin traded above $102,000 on Monday and is widely expected to about double this year from that level. Investors are hopeful that clearer regulation will support digital asset prices and in turn benefit stocks like Coinbase and Robinhood.
However, uncertainty about the path of Federal Reserve interest rate cuts could put bumps in the road for crypto prices. In December, the central bank signaled that although it was cutting rates a third time, it may do fewer rate cuts in 2025 than investors had anticipated. Historically, rate cuts have had a positive effect on bitcoin price while hikes have had a negative impact.
Bitcoin is up more than 3% since the start of the year. It posted a 120% gain for 2024.
Don’t miss these cryptocurrency insights from CNBC Pro: