
How the job of Amazon delivery has changed with Rivian’s electric vans and routing software
More Videos
Published
2 years agoon
By
admin
For the 275,000 Amazon drivers dropping off 10 million packages a day around the world, the job can be a grind. But a lot has changed since drivers in 2021 told CNBC about unrealistic workloads, peeing in bottles, dog bites and error-prone routing software.
Among the biggest developments is the arrival of a brand-new electric van from Rivian.
Amazon was a big and early investor in the electric vehicle company, which went public in late 2021 with a plan to build trucks and SUVs for consumers and delivery vans for businesses. Since July, Amazon has rolled out more than 1,000 new Rivian vans, which are now making deliveries in more than 100 U.S. cities, including Baltimore, Chicago, Las Vegas, Nashville, New York City and Austin, Texas.
The partnership began in 2019, when Amazon founder and ex-CEO Jeff Bezos announced Amazon had purchased 100,000 electric vans from Rivian as one step toward his company’s ambitious promise of reaching net-zero carbon emissions by 2040.
″[We] will have prototypes on the road next year, but 100,000 deployed by 2024,” Bezos said at the National Press Club in Washington, D.C., in September 2019. Amazon has since revised the timeline, saying it expects all 100,000 Rivian vans on the road by 2030.
Rivian has faced several challenges in recent months. It cut back 2022 production amid supply chain and assembly line issues. Its stock price dropped so sharply last year that Amazon recorded a combined $11.5 billion markdown on its holdings in the first two quarters.
CNBC talked to drivers to see what’s changed with the driving experience. We also went to Amazon’s Delivering the Future event in Boston in November for a look at the technology designed to maximize safety and efficiency for delivery personnel.
For now, most Amazon drivers are still in about 110,000 gas-powered vans — primarily Ford Transits, Mercedes-Benz Sprinters and Ram ProMasters. Amazon wouldn’t share how it determines which of its 3,500 third-party delivery firms, or delivery service partners (DSPs), are receiving Rivian vans first.
The e-commerce giant has been using DSPs to deliver its packages since 2018, allowing the company to reduce its reliance on UPS and the U.S. Postal Service for the so-called last mile, the most expensive portion of the delivery journey. The DSP, which works exclusively with Amazon, employs the drivers and is responsible for the liabilities of the road, vehicle maintenance, and the costs of hiring, benefits and overtime pay.
Amazon leases the vans to DSP owners at a discount. The company covers the fuel for gas-powered vans and installs charging stations for electric vehicles.
The company says DSP owners have generated $26 billion in revenue and now operate in 15 countries, including Saudi Arabia, India, Brazil, Canada, and all over Europe.
What drivers think
In the early days of testing the Rivian vans, some drivers voiced concerns about range. An Amazon spokesperson told CNBC the vans can travel up to 150 miles on a single charge, which is typically plenty of power for a full shift and allows drivers to recharge the vehicle overnight.
As for maintenance, Amazon says that takes place at Rivian service centers near delivery stations or by a Rivian mobile service team, depending on location.
Julietta Dennis launched a DSP, Kangaroo Direct, in Baltimore three years ago. She employs about 75 drivers and leases more than 50 vans from Amazon. She now has 15 Rivian vehicles.
“It’s very easy to get in and out with all of the different handles to hold on to,” Dennis said. She said that some drivers were hesitant at first because the vehicles were so new and different, “but the moment they get in there and have their first experience, that’s the van that they want to drive.”
Baltimore DSP owner Julieta Dennis shows off a Rivian electric van at Amazon’s Delivering the Future event in Boston, Maryland, on November 10, 2022.
Erin Black
Brandi Monroe has been delivering for Kangaroo Direct for two years. She pointed to features on a Rivian van that are upgrades over what she’s driven in the past. There’s a large non-slip step at the back, a hand cart for helping with heavy packages and extra space for standing and walking in the cargo area.
“We have two shelves on both sides to allow for more space,” Monroe said, adding that she’d prefer to drive a Rivian for every shift. “And then the lights at the top: very innovative to help us see the packages and address a lot easier, especially at nighttime.”
There’s even a heated steering wheel.
Former driver B.J. Natividad, who goes by Avionyx on YouTube, says his non-electric van could get very cramped.
“I remember one time I had 23 or 24 bags and over 40 oversize packages and I had to be able to figure out how to stuff that all in there within the 15 minutes that they give us to load up in the morning,” said Natividad, who now works for USPS.
The Rivian vans have at least 100 more cubic feet than the Sprinter and up to double the cargo space of the Ford Transit vans Natividad drove in Las Vegas. Rivian vans are still small enough that they don’t require a special license to drive, though Amazon provides its own training for drivers.
One driver in Seattle, who asked to remain unnamed, was especially excited about the new Rivian vans. He offered an extensive tour of the new driving experience on his YouTube channel called Friday Adventure Club.
He said one of his favorite features is a light bar “that goes all the way around the back.” He also likes that the windshield is “absolutely massive,” the wide doors allow for easy entry and exit, and the cargo door automatically opens when the van is parked. There are two rows of shelves that fold up and down in the cargo area.
There’s also new technology, such as an embedded tablet with the driving route and a 360-degree view that shows all sides of the van.
Mai Le, Amazon’s vice president of Last Mile, oversaw the testing of the center console and Rivian’s integrated software.
“We did a lot of deliveries as a test,” Le said. “As a woman, I want to make sure that the seats are comfortable for me and that my legs can reach the pedals, I can see over the steering wheel.”
She demonstrated some of the benefits of the new technology.
“When we start to notice that you’re slowing down, that means that we can tell you’re getting near to your destination,” she said. “The map begins to zoom in, so you begin to find where’s your delivery location, which building and where parking could be.”
The new vans have keyless entry. They automatically lock when the driver is 15 feet away and unlock as the driver approaches.
Workers load packages into Amazon Rivian Electric trucks at an Amazon facility in Poway, California, November 16, 2022.
Sandy Huffaker | Reuters
Cameras and safety
Above all else, Amazon says the changes were designed to make the delivery job safer.
A ProPublica report found Amazon’s contract drivers were involved in more than 60 serious crashes from 2015 to 2019, at least 10 of which were fatal. Amazon put cameras and sensors all over the Rivian vans, which enable warnings and lane assist technology that autocorrects if the vehicle veers out of the lane.
Dennis mentioned the importance of automatic braking and the steering wheel that starts “just kind of shaking when you get too close to something.”
“There’s just so many features that would really, really help cut back on some of those incidental accidents,” she said.
Amazon vans have driver-facing cameras inside, which can catch unsafe driving practices as they happen.
“The in-vehicle safety technology we have watches for poor safety behaviors like distracted driving, seat belts not being fastened, running stop signs, traffic lights,” said Beryl Tomay, who helps run the technology side of delivery as vice president of Last Mile for Amazon.
“We’ve seen over the past year a reduction of 80% to 95% in these events when we’ve warned drivers real time,” she said. “But the really game-changing results that we’ve seen have been almost a 50% reduction in accidents.”
As a DSP owner, Dennis gets alerts if her drivers exhibit patterns of unsafe behavior.
“If something with a seat belt or just something flags, then our team will contact the driver and make sure that that’s coached on and taken care of and figured out, like what actually happened,” Dennis said.
That level of constant surveillance may be unsettling for some drivers. Dennis said that issues haven’t come up among her staffers. And Amazon stresses it’s focused on driver privacy.
“We’ve taken great care from a privacy perspective,” Tomay said. “There’s no sound ever being recorded. There’s no camera recording if the driver’s not driving and there’s a privacy mode.”
Amazon says the cabin-facing camera automatically switches off when the ignition is off, and privacy mode means it also turns off if the vehicle is stationary for more than 30 seconds.
Safety concerns extend beyond the vehicle itself. For example, an Amazon driver in Missouri was found dead in a front yard in October, allegedly after a dog attack.
Amazon says new technology can help. Drivers can choose to manually notify customers ahead of a delivery, giving them time to restrain pets. Another feature that’s coming, according to Le, will allow drivers to mark delivery locations that have pets.
Natividad said he had multiple close calls with dogs charging at him during deliveries.
“You customers out there, please restrain your dogs when you know a package is coming,” he said. “Please keep them inside. Don’t leave them just outside.”
Optimizing routes
Providing drivers with more efficient and better detailed routes could improve safety, too. Drivers in 2021 told us about losing time because Amazon’s routing software made a mistake, like not recognizing a closed road or gated community. In response, they sometimes tried to save time in other ways.
“People are running through stop signs, running through yellow lights,” said Adrienne Williams, a former DSP driver. “Everybody I knew was buckling their seat belt behind their backs because the time it took just to buckle your seat belt, unbuckle your seat belt every time was enough time to get you behind schedule.”
Amazon listened. The company has been adding a huge amount of detail to driver maps, using information from 16 third-party map vendors as well as machine learning models informed by satellite driver feedback and other sources.
One example is a new in-vehicle data collection system called Fleet Edge, which is currently in a few thousand vans. Fleet Edge collects real-time data from a street view camera and GPS device during a driver’s route.
“Due to Fleet Edge, we’ve added over 120,000 new street signs to Amazon’s mapping system,” Tomay said. “The accuracy of GPS locations has increased by over two and a half times in our test areas, improving navigation safety by announcing upcoming turns sooner.”
Tomay said the maps also added points of interest like coffee shops and restrooms, so in about 95% of metro areas, “drivers can find a spot to take a break within five minutes of a stop.”
In 2021, Amazon apologized for dismissing claims that drivers were urinating in bottles as a result of demanding delivery schedules. Natividad said he occasionally found urine-filled bottles in his vans before his shift in the mornings.
“As soon as I open the van, I’m looking around, I see a bottle of urine. I’m like, ‘Oh, I’m not touching this,'” he said.
Pay for Amazon drivers is up to the discretion of each individual DSP, although Amazon says it regularly audits DSP rates to make sure they’re competitive. Indeed.com puts average Amazon driver pay at nearly $19 an hour, 16% higher than the national average.
Natividad started delivering for Amazon in 2021 when his gigs as a fulltime disc jockey dried up because of the pandemic. He liked the job at the time, generally delivering at least 200 packages along the same route. However, during the holiday season that year, he once had more than 400 packages and 200 stops in a single shift.
“Towards the end of my day, they sent out two rescues to me to help out to make sure everything’s done before 10 hours,” he said.
Amazon is working to optimize its routes. But it’s an unwieldy operation. The company says it’s generated 225,000 unique routes per day during peak season.
Tomay said the company looks at the density of packages, the complexity of delivery locations “and any other considerations like weather and traffic from past history to put a route together that we think is ideal.”
There’s no one-size-fits-all solution.
“Given that we’re in over 20 countries and every geography looks different, it’s not just about delivery vehicles or vans anymore,” Tomay said. “We have rickshaws in India. We have walkers in Manhattan.”
In Las Vegas, Amazon held a roundtable last year for DSP owners and drivers. Natividad says he spoke for 20 minutes at the event about the need for Amazon to improve its routing algorithms.
“I think they should do that probably once a month, with all the DSP supervision and a few of the drivers, and not the same drivers every time. That way different feedback is given. And like seriously listen to them,” Natividad said. “Because they’re not the ones out there seeing and experiencing what we go through.”
Natividad didn’t get to try out the routing technology in the Rivian vans before he left to deliver for USPS in July. He’s excited that the postal service is following in Amazon’s footsteps with 66,000 electric vans coming by 2028.
Amazon, meanwhile, is diversifying its electric fleet beyond Rivian. The company has ordered thousands of electric Ram vans from Stellantis and also has some on the way from Mercedes-Benz.
You may like
Technology
AI research takes a backseat to profits as Silicon Valley prioritizes products over safety, experts say
Published
3 days agoon
May 14, 2025By
admin
Sam Altman, co-founder and CEO of OpenAI and co-founder of Tools for Humanity, participates remotely in a discussion on the sidelines of the IMF/World Bank Spring Meetings in Washington, D.C., April 24, 2025.
Brendan Smialowski | AFP | Getty Images
Not long ago, Silicon Valley was where the world’s leading artificial intelligence experts went to perform cutting-edge research.
Meta, Google and OpenAI opened their wallets for top talent, giving researchers staff, computing power and plenty of flexibility. With the support of their employers, the researchers published high-quality academic papers, openly sharing their breakthroughs with peers in academia and at rival companies.
But that era has ended. Now, experts say, AI is all about the product.
Since OpenAI released ChatGPT in late 2022, the tech industry has shifted its focus to building consumer-ready AI services, in many cases prioritizing commercialization over research, AI researchers and experts in the field told CNBC. The profit potential is massive — some analysts predict $1 trillion in annual revenue by 2028. The prospective repercussions terrify the corner of the AI universe concerned about safety, industry experts said, particularly as leading players pursue artificial general intelligence, or AGI, which is technology that rivals or exceeds human intelligence.
In the race to stay competitive, tech companies are taking an increasing number of shortcuts when it comes to the rigorous safety testing of their AI models before they are released to the public, industry experts told CNBC.
James White, chief technology officer at cybersecurity startup CalypsoAI, said newer models are sacrificing security for quality, that is, better responses by the AI chatbots. That means they’re less likely to reject malicious kinds of prompts that could cause them to reveal ways to build bombs or sensitive information that hackers could exploit, White said.
“The models are getting better, but they’re also more likely to be good at bad stuff,” said White, whose company performs safety and security audits of popular models from Meta, Google, OpenAI and other companies. “It’s easier to trick them to do bad stuff.”
The changes are readily apparent at Meta and Alphabet, which have deprioritized their AI research labs, experts say. At Facebook’s parent company, the Fundamental Artificial Intelligence Research, or FAIR, unit has been sidelined by Meta GenAI, according to current and former employees. And at Alphabet, the research group Google Brain is now part of DeepMind, the division that leads development of AI products at the tech company.
CNBC spoke with more than a dozen AI professionals in Silicon Valley who collectively tell the story of a dramatic shift in the industry away from research and toward revenue-generating products. Some are former employees at the companies with direct knowledge of what they say is the prioritization of building new AI products at the expense of research and safety checks. They say employees face intensifying development timelines, reinforcing the idea that they can’t afford to fall behind when it comes to getting new models and products to market. Some of the people asked not to be named because they weren’t authorized to speak publicly on the matter.
Mark Zuckerberg, CEO of Meta Platforms, during the Meta Connect event in Menlo Park, California, on Sept. 25, 2024.
David Paul Morris | Bloomberg | Getty Images
Meta’s AI evolution
When Joelle Pineau, a Meta vice president and the head of the company’s FAIR division, announced in April that she would be leaving her post, many former employees said they weren’t surprised. They said they viewed it as solidifying the company’s move away from AI research and toward prioritizing developing practical products.
“Today, as the world undergoes significant change, as the race for AI accelerates, and as Meta prepares for its next chapter, it is time to create space for others to pursue the work,” Pineau wrote on LinkedIn, adding that she will formally leave the company May 30.
Pineau began leading FAIR in 2023. The unit was established a decade earlier to work on difficult computer science problems typically tackled by academia. Yann LeCun, one of the godfathers of modern AI, initially oversaw the project, and instilled the research methodologies he learned from his time at the pioneering AT&T Bell Laboratories, according to several former employees at Meta. Small research teams could work on a variety of bleeding-edge projects that may or may not pan out.
The shift began when Meta laid off 21,000 employees, or nearly a quarter of its workforce, starting in late 2022. CEO Mark Zuckerberg kicked off 2023 by calling it the “year of efficiency.” FAIR researchers, as part of the cost-cutting measures, were directed to work more closely with product teams, several former employees said.
Two months before Pineau’s announcement, one of FAIR’s directors, Kim Hazelwood, left the company, two people familiar with the matter said. Hazelwood helped oversee FAIR’s NextSys unit, which manages computing resources for FAIR researchers. Her role was eliminated as part of Meta’s plan to cut 5% of its workforce, the people said.
Joelle Pineau of Meta speaks at the Advancing Sustainable Development through Safe, Secure, and Trustworthy AI event at Grand Central Terminal in New York, Sept. 23, 2024.
Bryan R. Smith | Via Reuters
OpenAI’s 2022 launch of ChatGPT caught Meta off guard, creating a sense of urgency to pour more resources into large language models, or LLMs, that were captivating the tech industry, the people said.
In 2023, Meta began heavily pushing its freely available and open-source Llama family of AI models to compete with OpenAI, Google and others.
With Zuckerberg and other executives convinced that LLMs were game-changing technologies, management had less incentive to let FAIR researchers work on far-flung projects, several former employees said. That meant deprioritizing research that could be viewed as having no impact on Meta’s core business, such as FAIR’s previous health care-related research into using AI to improve drug therapies.
Since 2024, Meta Chief Product Officer Chris Cox has been overseeing FAIR as a way to bridge the gap between research and the product-focused GenAI group, people familiar with the matter said. The GenAI unit oversees the Llama family of AI models and the Meta AI digital assistant, the two most important pillars of Meta’s AI strategy.
Under Cox, the GenAI unit has been siphoning more computing resources and team members from FAIR due to its elevated status at Meta, the people said. Many researchers have transferred to GenAI or left the company entirely to launch their own research-focused startups or join rivals, several of the former employees said.
While Zuckerberg has some internal support for pushing the GenAI group to rapidly develop real-world products, there’s also concern among some staffers that Meta is now less able to develop industry-leading breakthroughs that can be derived from experimental work, former employees said. That leaves Meta to chase its rivals.
A high-profile example landed in January, when Chinese lab DeepSeek released its R1 model, catching Meta off guard. The startup claimed it was able to develop a model as capable as its American counterparts but with training at a fraction of the cost.
Meta quickly implemented some of DeepSeek’s innovative techniques for its Llama 4 family of AI models that were released in April, former employees said. The AI research community had a mixed reaction to the smaller versions of Llama 4, but Meta said the biggest and most powerful Llama 4 variant is still being trained.
The company in April also released security and safety tools for developers to use when building apps with Meta’s Llama 4 AI models. These tools help mitigate the chances of Llama 4 unintentionally leaking sensitive information or producing harmful content, Meta said.
“Our commitment to FAIR remains strong,” a Meta spokesperson told CNBC. “Our strategy and plans will not change as a result of recent developments.”
In a statement to CNBC, Pineau said she is enthusiastic about Meta’s overall AI work and strategy.
“There continues to be strong support for exploratory research and FAIR as a distinct organization in Meta,” Pineau said. “The time was simply right for me personally to re-focus my energy before jumping into a new adventure.”
Meta on Thursday named FAIR co-founder Rob Fergus as Pineau’s replacement. Fergus will return to the company to serve as a director at Meta and head of FAIR, according to his LinkedIn profile. He was most recently a research director at Google DeepMind.
“Meta’s commitment to FAIR and long term research remains unwavering,” Fergus said in a LinkedIn post. “We’re working towards building human-level experiences that transform the way we interact with technology and are dedicated to leading and advancing AI research.”
Demis Hassabis, co-founder and CEO of Google DeepMind, attends the Artificial Intelligence Action Summit at the Grand Palais in Paris, Feb. 10, 2025.
Benoit Tessier | Reuters
Google ‘can’t keep building nanny products’
Google released its latest and most powerful AI model, Gemini 2.5, in March. The company described it as “our most intelligent AI model,” and wrote in a March 25 blog post that its new models are “capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy.”
For weeks, Gemini 2.5 was missing a model card, meaning Google did not share information about how the AI model worked or its limitations and potential dangers upon its release.
Model cards are a common tool for AI transparency.
A Google website compares model cards to food nutrition labels: They outline “the key facts about a model in a clear, digestible format,” the website says.
“By making this information easy to access, model cards support responsible AI development and the adoption of robust, industry-wide standards for broad transparency and evaluation practices,” the website says.
Google wrote in an April 2 blog post that it evaluates its “most advanced models, such as Gemini, for potential dangerous capabilities prior to their release.” Google later updated the blog to remove the words “prior to their release.”
Without a model card for Gemini 2.5, the public had no way of knowing which safety evaluations were conducted or whether DeepMind checked for dangerous capabilities at all.
In response to CNBC’s inquiry on April 2 about Gemini 2.5’s missing model card, a Google spokesperson said that a “tech report with additional safety information and model cards are forthcoming.” Google published an incomplete model card on April 16 and updated it on April 28, more than a month after the AI model’s release, to include information about Gemini 2.5’s “dangerous capability evaluations.”
Those assessments are important for gauging the safety of a model — whether people can use the models to learn how to build chemical or nuclear weapons or hack into important systems. These checks also determine whether a model is capable of autonomously replicating itself, which could lead to a company losing control of it. Running tests for those capabilities requires more time and resources than simple, automated safety evaluations, according to industry experts.
Google co-founder Sergey Brin
Kelly Sullivan | Getty Images Entertainment | Getty Images
The Financial Times in March reported that Google DeepMind CEO Demis Hassabis had installed a more rigorous vetting process for internal research papers to be published. The clampdown at Google is particularly notable because the company’s “Transformers” technology gained recognition across Silicon Valley through that type of shared research. Transformers were critical to OpenAI’s development of ChatGPT and the rise of generative AI.
Google co-founder Sergey Brin told staffers at DeepMind and Gemini in February that competition has accelerated and “the final race to AGI is afoot,” according to a memo viewed by CNBC. “We have all the ingredients to win this race but we are going to have to turbocharge our efforts,” he said in the memo.
Brin said in the memo that Google has to speed up the process of testing AI models, as the company needs “lots of ideas that we can test quickly.”
“We need real wins that scale,” Brin wrote.
In his memo, Brin also wrote that the company’s methods have “a habit of minor tweaking and overfitting” products for evaluations and “sniping” the products at checkpoints. He said employees need to build “capable products” and to “trust our users” more.
“We can’t keep building nanny products,” Brin wrote. “Our products are overrun with filters and punts of various kinds.”
A Google spokesperson told CNBC that the company has always been committed to advancing AI responsibly.
“We continue to do that through the safe development and deployment of our technology, and research contributions to the broader ecosystem,” the spokesperson said.
Sam Altman, CEO of OpenAI, is seen through glass during an event on the sidelines of the Artificial Intelligence Action Summit in Paris, Feb. 11, 2025.
Aurelien Morissard | Via Reuters
OpenAI’s rush through safety testing
The debate of product versus research is at the center of OpenAI’s existence. The company was founded as a nonprofit research lab in 2015 and is now in the midst of a contentious effort to transform into a for-profit entity.
That’s the direction co-founder and CEO Sam Altman has been pushing toward for years. On May 5, though, OpenAI bowed to pressure from civic leaders and former employees, announcing that its nonprofit would retain control of the company even as it restructures into a public benefit corporation.
Nisan Stiennon worked at OpenAI from 2018 to 2020 and was among a group of former employees urging California and Delaware not to approve OpenAI’s restructuring effort. “OpenAI may one day build technology that could get us all killed,” Stiennon wrote in a statement in April. “It is to OpenAI’s credit that it’s controlled by a nonprofit with a duty to humanity.”
But even with the nonprofit maintaining control and majority ownership, OpenAI is speedily working to commercialize products as competition heats up in generative AI. And it may have rushed the rollout of its o1 reasoning model last year, according to some portions of its model card.
Results of the model’s “preparedness evaluations,” the tests OpenAI runs to assess an AI model’s dangerous capabilities and other risks, were based on earlier versions of o1. They had not been run on the final version of the model, according to its model card, which is publicly available.
Johannes Heidecke, OpenAI’s head of safety systems, told CNBC in an interview that the company ran its preparedness evaluations on near-final versions of the o1 model. Minor variations to the model that took place after those tests wouldn’t have contributed to significant jumps in its intelligence or reasoning and thus wouldn’t require additional evaluations, he said. Still, Heidecke acknowledged that OpenAI missed an opportunity to more clearly explain the difference.
OpenAI’s newest reasoning model, o3, released in April, seems to hallucinate more than twice as often as o1, according to the model card. When an AI model hallucinates, it produces falsehoods or illogical information.
OpenAI has also been criticized for reportedly slashing safety testing times from months to days and for omitting the requirement to safety test fine-tuned models in its latest “Preparedness Framework.”
Heidecke said OpenAI has decreased the time needed for safety testing because the company has improved its testing effectiveness and efficiency. A company spokesperson said OpenAI has allocated more AI infrastructure and personnel to its safety testing, and has increased resources for paying experts and growing its network of external testers.
In April, the company shipped GPT-4.1, one of its new models, without a safety report, as the model was not designated by OpenAI as a “frontier model,” which is a term used by the tech industry to refer to a bleeding-edge, large-scale AI model.
But one of those small revisions caused a big wave in April. Within days of updating its GPT-4o model, OpenAI rolled back the changes after screenshots of overly flattering responses to ChatGPT users went viral online. OpenAI said in a blog post explaining its decision that those types of responses to user inquiries “raise safety concerns — including around issues like mental health, emotional over-reliance, or risky behavior.”
OpenAI said in the blogpost that it opted to release the model even after some expert testers flagged that its behavior “‘felt’ slightly off.”
“In the end, we decided to launch the model due to the positive signals from the users who tried out the model. Unfortunately, this was the wrong call,” OpenAI wrote. “Looking back, the qualitative assessments were hinting at something important, and we should’ve paid closer attention. They were picking up on a blind spot in our other evals and metrics.”
Metr, a company OpenAI partners with to test and evaluate its models for safety, said in a recent blog post that it was given less time to test the o3 and o4-mini models than predecessors.
“Limitations in this evaluation prevent us from making robust capability assessments,” Metr wrote, adding that the tests it did were “conducted in a relatively short time.”
Metr also wrote that it had insufficient access to data that would be important in determining the potential dangers of the two models.
The company said it wasn’t able to access the OpenAI models’ internal reasoning, which is “likely to contain important information for interpreting our results.” However, Metr said, “OpenAI shared helpful information on some of their own evaluation results.”
OpenAI’s spokesperson said the company is piloting secure ways of sharing chains of thought for Metr’s research as well as for other third-party organizations.
Steven Adler, a former safety researcher at OpenAI, told CNBC that safety testing a model before it’s rolled out is no longer enough to safeguard against potential dangers.
“You need to be vigilant before and during training to reduce the chance of creating a very capable, misaligned model in the first place,” Adler said.
He warned that companies such as OpenAI are backed into a corner when they create capable but misaligned models with goals that are different from the ones they intended to build.
“Unfortunately, we don’t yet have strong scientific knowledge for fixing these models — just ways of papering over the behavior,” Adler said.
WATCH: OpenAI closes $40 billion funding round, largest private tech deal on record

Technology
Stock trading app eToro pops 40% in Nasdaq debut after pricing IPO above expected range
Published
3 days agoon
May 14, 2025By
admin
Omar Marques | Sopa Images | Lightrocket | Getty Images
Shares of stock brokerage platform eToro popped in their Nasdaq debut on Wednesday after the company raised almost $310 million in its initial public offering.
The stock opened at $69.69, or 34% above its IPO, pushing its market cap to $5.6 billion. Shares were last up more than 40%.
The Israel-based company sold nearly six million shares at $52 each, above the expected range of $46 to $50. Almost six million additional shares were sold by existing investors. At the IPO price, the company was valued at roughly $4.2 billion.
Wall Street is looking to the Robinhood competitor for signs of renewed interest in IPOs after an extended drought. Many investors saw President Donald Trump’s return to the White House as a catalyst before tariff concerns led companies to delay their plans.
Etoro isn’t the only company attempting to test the waters. Fintech company Chime filed its prospectus with the U.S. Securities and Exchange Commission on Tuesday, while digital physical therapy company Hinge Health kickstarted its IPO roadshow, and said in a filing it aims to raise up to $437 million in its offering.
EToro had previously filed to go public in 2021 through a merger with a special purpose acquisition company, or SPAC, that would have valued it at more than $10 billion. It shelved those plans in 2022 as equity markets nosedived, but remained focused on an eventual IPO.
EToro was founded in 2007 by brothers Yoni and Ronen Assia and David Ring. The company makes money through trading-related fees and nontrading activities such as withdrawals. Net income increased almost thirteenfold last year to $192.4 million from $15.3 million in 2023.
The company has steadily built a growing business in cryptocurrencies. Revenue from crypto assets more than tripled to upward of $12 million in 2024, and one-quarter of its net trading contribution stemmed from crypto last year. That is up from 10% in 2023.
EToro said that for the first quarter, it expects crypto assets to account for 37% of its commission from trading activities, down from 43% a year earlier.
Spark Capital is the company’s biggest outside investor, with 14% control after the offering, followed by BRM Group at 8.7%. CEO Yoni Assia controls 9.3%.
Read more CNBC tech news

Technology
5 new Uber features you should know — including a way to avoid surge pricing
Published
3 days agoon
May 14, 2025By
admin
Travelers walk past a sign pointing toward the Uber ride-share vehicle pickup area at Los Angeles International Airport in Los Angeles on Feb. 8, 2023.
Mario Tama | Getty Images
Uber is giving commuters new ways to travel and cut costs on frequent rides.
The ride-hailing company on Wednesday announced a route share feature on its platform, prepaid ride passes and special deals week for Uber One members at its annual Go-Get showcase.
Uber’s new features come as the company accelerates its leadership position in the ride-sharing market and seeks to offer more affordable alternatives for users. It also follows last week’s first-quarter earnings as Uber swung to a profit but fell short of revenue estimates.
“The goal for us as we build our products is to put people at the center of everything, and right now for us, it means making things a little easier, a little more predictable, and above all, just a little more — or a lot more — affordable,” said Uber CEO Dara Khosrowshahi at the event.
Here are some of the big announcements from the annual product event.
Route Share
Users looking to save money on regular routes and willing to walk a short distance can select a shared ride with up to two other passengers through the new route-share feature.
The prepopulated routes run every 20 minutes along busy areas between 6 a.m. and 10 a.m. and 4 p.m. and 8 p.m. on weekdays. The initial program is slated to kick off in seven cities, including New York, San Francisco, Boston and Chicago.
Source: Uber
Uber said its new route-share fares will cost up to 50% less than an UberX option, and that it is working to partner with employers on qualifying the feature for commuter benefits. Users can book a seat from 7 days to 10 minutes before a pickup departure.
Ride Passes
Riders on Uber can now prepurchase two different types of ride passes to hold fares on frequented routes during a one-hour period every day. For $2.99 a month, riders can buy a price lock pass that holds a price between two locations for one hour every day. The pass expires after 30 days or a savings total of $50.
The feature gives riders a way to avoid surge pricing.
Ride Passes roll out in 10 cities on Wednesday, including Dallas, Orlando and San Francisco, and can be purchased for up to 10 routes a month. Uber will charge users a lower price if the fare is cheaper than the pass at departure time.
The company also debuted a prepaid pass option, allowing users to pay in advance and stock up on regular monthly trips. Uber’s pass option comes in bundles of 5, 10, 15 and 20-ride increments, with corresponding discounts between 5% and 20%.
Both pass options will be available on teen accounts in the fall, Uber said. The route share and ride passes will be available in a new commuter hub feature on the app coming later this year.
Shared autonomous rides
Uber is also expanding its autonomous vehicle partnership with Volkswagen.
The company will start testing shared AV rides later this year and is aiming for a launch in Los Angeles in 2026.
Uber rolled out autonomous rides in Austin, Texas, in March through its agreement with Alphabet-owned Waymo and is preparing for an Atlanta launch this summer. The company announced the partnership in May 2023. Autonomous Waymo rides are also currently offered through the Uber app in Phoenix, but the company does not directly manage that fleet.
Khosrowshahi called AVs “the single greatest opportunity ahead for Uber” during the company’s earnings call last week and said the Austin debut “exceeded” expectations. The company previously had an AV unit that it sold in 2020 as it faced high costs and a series of safety challenges, including a fatal accident.
Along with Volkswagen and Waymo, Uber has joined forces with Avride, May Mobility and self-driving trucking company Aurora for autonomous ride-sharing and freight services in the U.S. The company has partnerships with WeRide, Pony.AI and Momenta internationally.
Uber One Member Days
Uber is taking a page out of Amazon’s book by offering its own variation of the e-commerce giant’s beloved Prime Day, with special offers between May 16 and 23 for Uber One members.
Some of those deals include 50% off shared rides and 20% off Uber Black. The platform is also adding a new benefit of 10% back in Uber credits for users that use Uber Rent or book Lime rides.
UberEats partnership with OpenTable
UberEats also announced a partnership with OpenTable to allow users to book reservations and rides.
The new feature, powered by OpenTable, launches in six countries including the U.S. and Australia.
Through the partnership, users can book restaurant reservations and get a discount on rides. OpenTable members will also be able to transfer points to Uber and UberEats. The company is also offering OpenTable VIPs a six-month free trial of Uber One.
Read more CNBC tech news
Trending
-
Sports3 years ago
‘Storybook stuff’: Inside the night Bryce Harper sent the Phillies to the World Series
-
Sports1 year ago
Story injured on diving stop, exits Red Sox game
-
Sports2 years ago
Game 1 of WS least-watched in recorded history
-
Sports2 years ago
MLB Rank 2023: Ranking baseball’s top 100 players
-
Sports4 years ago
Team Europe easily wins 4th straight Laver Cup
-
Environment2 years ago
Japan and South Korea have a lot at stake in a free and open South China Sea
-
Environment2 years ago
Game-changing Lectric XPedition launched as affordable electric cargo bike
-
Business3 years ago
Bank of England’s extraordinary response to government policy is almost unthinkable | Ed Conway