Connect with us

Published

on

Not the sincerest form of flattery — NY Times copyright suit wants OpenAI to delete all GPT instances Shows evidence that GPT-based systems will reproduce Times articles if asked.

John Timmer – Dec 27, 2023 7:05 pm UTC Enlarge / Microsoft is named in the suit for allegedly building the system that allowed GPT derivatives to be trained using infringing material.Just_Super reader comments 359

In August, word leaked out that The New York Times was considering joining the growing legion of creators that are suing AI companies for misappropriating their content. The Times had reportedly been negotiating with OpenAI regarding the potential to license its material, but those talks had not gone smoothly. So, eight months after the company was reportedly considering suing, the suit has now been filed.

The Times is targeting various companies under the OpenAI umbrella, as well as Microsoft, an OpenAI partner that both uses it to power its Copilot service and helped provide the infrastructure for training the GPT Large Language Model. But the suit goes well beyond the use of copyrighted material in training, alleging that OpenAI-powered software will happily circumvent the Times’ paywall and ascribe hallucinated misinformation to the Times. Journalism is expensive

The suit notes that The Times maintains a large staff that allows it to do things like dedicate reporters to a huge range of beats and engage in important investigative journalism, among other things. Because of those investments, the newspaper is often considered an authoritative source on many matters.

All of that costs money, and The Times earns that by limiting access to its reporting through a robust paywall. In addition, each print edition has a copyright notification, the Times’ terms of service limit the copying and use of any published material, and it can be selective about how it licenses its stories. In addition to driving revenue, these restrictions also help it to maintain its reputation as an authoritative voice by controlling how its works appear.

The suit alleges that OpenAI-developed tools undermine all of that. “By providing Times content without The Timess permission or authorization, Defendants tools undermine and damage The Timess relationship with its readers and deprive The Times of subscription, licensing, advertising, and affiliate revenue,” the suit alleges.

Part of the unauthorized use The Times alleges came during the training of various versions of GPT. Prior to GPT-3.5, information about the training dataset was made public. One of the sources used is a large collection of online material called “Common Crawl,” which the suit alleges contains information from 16 million unique records from sites published by The Times. That places the Times as the third most referenced source, behind Wikipedia and a database of US patents. Advertisement

OpenAI no longer discloses as many details of the data used for training of recent GPT versions, but all indications are that full-text NY Times articles are still part of that process (Much more on that in a moment.) Expect access to training information to be a major issue during discovery if this case moves forward. Not just training

A number of suits have been filed regarding the use of copyrighted material during training of AI systems. But the Times’ suit goes well beyond that to show how the material ingested during training can come back out during use. “Defendants GenAI tools can generate output that recites Times content verbatim, closely summarizes it, and mimics its expressive style, as demonstrated by scores of examples,” the suit alleges.

The suit allegesand we were able to verifythat it’s comically easy to get GPT-powered systems to offer up content that is normally protected by the Times’ paywall. The suit shows a number of examples of GPT-4 reproducing large sections of articles nearly verbatim.

The suit includes screenshots of ChatGPT being given the title of a piece at The New York Times and asked for the first paragraph, which it delivers. Getting the ensuing text is apparently as simple as repeatedly asking for the next paragraph.

ChatGPT has apparently closed that loophole in between the preparation of that suit and the present. We entered some of the prompts shown in the suit, and were advised “I recommend checking The New York Times website or other reputable sources,” although we can’t rule out that context provided prior to that prompt could produce copyrighted material. Ask for a paragraph, and Copilot will hand you a wall of normally paywalled text.John Timmer

But not all loopholes have been closed. The suit also shows output from Bing Chat, since rebranded as Copilot. We were able to verify that asking for the first paragraph of a specific article at The Times caused Copilot to reproduce the first third of the article. Advertisement

The suit is dismissive of attempts to justify this as a form of fair use. “Publicly, Defendants insist that their conduct is protected as ‘fair use’ because their unlicensed use of copyrighted content to train GenAI models serves a new ‘transformative’ purpose,” the suit notes. “But there is nothing ‘transformative’ about using The Timess content without payment to create products that substitute for The Times and steal audiences away from it.” Reputational and other damages

The hallucinations common to AI also came under fire in the suit for potentially damaging the value of the Times’ reputation, and possibly damaging human health as a side effect. “A GPT model completely fabricated that The New York Times published an article on January 10, 2020, titled Study Finds Possible Link between Orange Juice and Non-Hodgkins Lymphoma, the suit alleges. “The Times never published such an article.”

Similarly, asking about a Times article on heart-healthy foods allegedly resulted in Copilot saying it contained a list of examples (which it didn’t). When asked for the list, 80 percent of the foods on weren’t even mentioned by the original article. In another case, recommendations were ascribed to the Wirecutter when the products hadn’t even been reviewed by its staff.

As with the Times material, it’s alleged that it’s possible to get Copilot to offer up large chunks of Wirecutter articles (The Wirecutter is owned by The New York Times). But the suit notes that these article excerpts have the affiliate links stripped out of them, keeping the Wirecutter from its primary source of revenue.

The suit targets various OpenAI companies for developing the software, as well as Microsoftthe latter for both offering OpenAI-powered services, and for having developed the computing systems that enabled the copyrighted material to be ingested during training. Allegations include direct, contributory, and vicarious copyright infringement, as well as DMCA and trademark violations. Finally, it alleges “Common Law Unfair Competition By Misappropriation.”

The suit seeks nothing less than the erasure of both any GPT instances that the parties have trained using material from the Times, as well as the destruction of the datasets that were used for the training. It also asks for a permanent injunction to prevent similar conduct in the future. The Times also wants money, lots and lots of money: “statutory damages, compensatory damages, restitution, disgorgement, and any other relief that may be permitted by law or equity.” reader comments 359 John Timmer John is Ars Technica’s science editor. He has a Bachelor of Arts in Biochemistry rom Columbia University, and a Ph.D. in Molecular and Cell Biology from the University of California, Berkeley. When physically separated from his keyboard, he tends to seek out a bicycle, or a scenic location for communing with his hiking boots. Advertisement Channel Ars Technica ← Previous story Next story → Related Stories Today on Ars

Continue Reading

World

Biden allows Kyiv to begin firing US rockets deep into Russia – as Starmer calls on allies to ‘double down’ on support

Published

on

By

Biden allows Kyiv to begin firing US rockets deep into Russia - as Starmer calls on allies to 'double down' on support

Joe Biden has authorised Ukraine to begin firing US-supplied rockets deep into Russia – as Sir Keir Starmer prepares to push for “further support” for Kyiv at the G20 summit.

Mr Biden’s policy shift means Kyiv will now be able to use Army Tactical Missile Systems (ATACMS) for long-range attacks, two American officials have told Sky News’ US partner network NBC News.

Ukraine plans to conduct its first such attacks in the coming days, the sources said, without revealing details due to operational security concerns.

The US has eased restrictions on the use of ATACMS, which have a range of up to 190 miles, after Russia began deploying North Korean ground troops to supplement its own forces in the conflict.

The development was condemned by Biden officials as a possible expansion of the war.

President Joe Biden meets with Ukrainian President Volodymyr Zelenskyy in the Oval Office in September last year. Pic: AP
Image:
Joe Biden meets with Volodymyr Zelenskyy in the Oval Office in September last year. Pic: AP

Follow latest: Ukraine war live updates

The son of president-elect Donald Trump has criticised the move to allow Ukraine to fire deep into Russia.

More on Joe Biden

Donald Trump Jr wrote on the X social media platform: “The Military Industrial Complex seems to want to make sure they get World War 3 going before my father has a chance to create peace and save lives… Imbeciles!”

The outgoing Biden administration’s move comes as there are concerns about the level of support the Trump White House may be willing to give Ukraine.

Mr Trump has previously vowed to limit US support for Ukraine and end its war with Russia.

In an evening address after Kyiv was given permission to fire deep into Russia, Ukrainian President Volodymyr Zelenskyy said: “Today, there’s a lot of talk in the media about us receiving permission for respective actions. But strikes are not carried out with words. Such things are not announced. Missiles will speak for themselves. They certainly will.”

Back in September, Russian President Vladimir Putin said if the US were to lift the ban on long-range missile use it would be seen as NATO’s “direct participation” in the war.

He added: “This, of course, will significantly change the very essence, the very nature of the conflict.”

The US military tests an early version of an Army Tactical Missile System in 2021. Pic: AP
Image:
The US military tests an early version of an Army Tactical Missile System in 2021. Pic: AP

Meanwhile, the UK prime minister has said he has “no plans” to speak with the Russian president as world leaders gather for the G20 summit in Rio de Janeiro.

Mr Putin will not be attending the two-day summit which starts on Monday after saying in October that his presence would “disrupt the normal work of this forum”. Russia’s foreign minister Sergei Lavrov will be attending instead.

It will take place days after German Chancellor Olaf Scholz spoke to Mr Putin on what was the Russian leader’s first publicly announced conversation with the sitting head of a major Western power in nearly two years.

Asked if he had any plans to make a similar call, Sir Keir said: “It’s a matter for Chancellor Scholz who he speaks to. I have no plans to speak to Putin.”

Read more:
Why Biden’s move will trigger fury from Moscow
The city where schools go underground to flee Russian missiles
Xi tells Biden that China is ‘ready to work’ with Trump

Firefighters work at the site of a residential area hit by a Russian missile strike in the Lviv region of Ukraine. Pic: Reuters
Image:
Firefighters work at the site of a residential area hit by a Russian missile strike in the Lviv region of Ukraine. Pic: Reuters

Speaking to reporters while on his way to the summit, he added: “We are coming up to the 1,000th day of this conflict on Tuesday.

“That’s 1,000 days of Russian aggression, 1,000 days of huge impact and sacrifice in relation to the Ukrainian people and recently we’ve seen the addition of North Korean troops working with Russians which does have serious implications.

“I think on one hand it shows the desperation of Russia, but it’s got serious implications for European security […] and for Indo-Pacific security and that’s why I think we need to double down on shoring up our support for Ukraine and that’s top of my agenda for the G20.

“There’s got to be full support as long as it takes and that certainly is top of my agenda, shoring up that further support for Ukraine.”

Please use Chrome browser for a more accessible video player

One of Russia’s ‘largest air attacks’

Follow Sky News on WhatsApp
Follow Sky News on WhatsApp

Keep up with all the latest news from the UK and around the world by following Sky News

Tap here

The latest developments come after Russia launched a large-scale attack on Ukraine on Sunday, with Mr Zelenskyy claiming Moscow had launched a total of 120 missiles and 90 drones.

The sweeping attack, which left at least eight people dead, targeted energy infrastructure across Ukraine overnight and prompted emergency power cuts.

Hours later, Moscow mayor Sergei Sobyanin said Russia’s air defence units had destroyed a drone heading towards the city.

Continue Reading

Politics

Crypto.com to offer equities trading to Australians after acquiring Fintek

Published

on

By

Crypto.com to offer equities trading to Australians after acquiring Fintek

After acquiring Fintek Securities, Crypto.com can use the firm’s Australian Financial Services Licence to offer equities, derivatives, and forex trading to users in the country. 

Continue Reading

Environment

Saldivar’s Trucking: first owner-operator to deploy Volvo VNR Electric semi

Published

on

By

Saldivar's Trucking: first owner-operator to deploy Volvo VNR Electric semi

Owner-operators are a huge part of the heavy truck market, and they’ve been among the most hesitant groups to transition from diesel to electric semi trucks. That may be changing, however, as Saldivar’s Trucking becomes first independent owner-operator in the US to deploy a Volvo VNR Electric Class 8 truck.

The higher up-front cost of electric semi trucks has been a huge obstacle for smaller fleets. That’s there are incentives from governments, utilities, and even non-profits to help overcome that initial obstacle. And the smart dealers are the ones who are putting in the hours to learn about those incentives, educate their customers, and ultimately sell more vehicles.

TEC Equipment is a smart dealer, and they worked closely with South Coast Air Quality Management District to secure the CARB funding and ensure Saldivar’s was able to ssecure $410,000 in funding from CARB’s On-Road Heavy-Duty Voucher Incentive Program (HVIP), which provides funding to replace older, heavy-duty trucks with zero-emission vehicles. The program is directed exclusively to small fleets with 10 vehicles or less that operate in California and aims to bridge the gap between the regulatory push for clean transportation and the financial realities faced by small business owners.

“TEC Equipment has been instrumental in supporting owner-operators like Saldivar’s Trucking through the transition to battery-electric vehicles,” explains Peter Voorhoeve, president of Volvo Trucks North America. “Their dedication to providing comprehensive support and securing necessary funding demonstrates how crucial dealer partners are in turning the vision of owning a battery-electric vehicle into a reality for fleets of all sizes.”

Saldivar’s Volvo VNR Electric features a six-battery configuration, with 565 kWh of storage capacity and a 250 kW charging capability. The zero-tailpipe emission truck can charge to 80% in 90 minutes to provide a range of up to 275 miles.

Those specs mean the Volvo electric semi is more than capable of meeting Saldivar’s operational needs, which include night shifts at California ports covering 175-200 miles per night, five nights a week. And, as he adds his VNR Electric miles to Volvo’s ever-growing tally, other owner-operators will see that it works for them, too.

“While large fleets often make headlines for their ambitious investments in battery-electric vehicles, nearly half of the 3.5 million professional truck drivers in the U.S. are owner-operators running their businesses with just one truck,” adds Voorhoeve. “These small operations face unique challenges, from the initial capital investment to securing adequate charging infrastructure … this collaboration is a perfect example of the important role to be played by truck dealers and why stakeholders need to work together to succeed in this new era of sustainable transportation.” We need solutions that work for different fleets of all sizes in the marketplace,” added Voorhoeve.”

Electrek’s Take

Saldivar’s Trucking poses with $410,000 incentive check; via Volvo Trucks.

Electrifying America’s commercial trucking fleet can’t happen soon enough – for the health of the people who live and work near these vehicles, the health of the planet they drive on, and (thanks to their substantially lower operating costs) the health of the businesses that deploy them. TEC is doing a great job advancing the cause, and acting as true expert partners for their customers.

You love to see it.

SOURCE | IMAGES: Volvo Trucks, via ACT News.

FTC: We use income earning auto affiliate links. More.

Continue Reading

Trending