NY Times copyright suit wants OpenAI to delete all GPT instances

Published

2 years ago

December 28, 2023

admin

Not the sincerest form of flattery — NY Times copyright suit wants OpenAI to delete all GPT instances Shows evidence that GPT-based systems will reproduce Times articles if asked.

John Timmer – Dec 27, 2023 7:05 pm UTC Enlarge / Microsoft is named in the suit for allegedly building the system that allowed GPT derivatives to be trained using infringing material.Just_Super reader comments 359

In August, word leaked out that The New York Times was considering joining the growing legion of creators that are suing AI companies for misappropriating their content. The Times had reportedly been negotiating with OpenAI regarding the potential to license its material, but those talks had not gone smoothly. So, eight months after the company was reportedly considering suing, the suit has now been filed.

The Times is targeting various companies under the OpenAI umbrella, as well as Microsoft, an OpenAI partner that both uses it to power its Copilot service and helped provide the infrastructure for training the GPT Large Language Model. But the suit goes well beyond the use of copyrighted material in training, alleging that OpenAI-powered software will happily circumvent the Times’ paywall and ascribe hallucinated misinformation to the Times. Journalism is expensive

The suit notes that The Times maintains a large staff that allows it to do things like dedicate reporters to a huge range of beats and engage in important investigative journalism, among other things. Because of those investments, the newspaper is often considered an authoritative source on many matters.

All of that costs money, and The Times earns that by limiting access to its reporting through a robust paywall. In addition, each print edition has a copyright notification, the Times’ terms of service limit the copying and use of any published material, and it can be selective about how it licenses its stories. In addition to driving revenue, these restrictions also help it to maintain its reputation as an authoritative voice by controlling how its works appear.

The suit alleges that OpenAI-developed tools undermine all of that. “By providing Times content without The Timess permission or authorization, Defendants tools undermine and damage The Timess relationship with its readers and deprive The Times of subscription, licensing, advertising, and affiliate revenue,” the suit alleges.

Part of the unauthorized use The Times alleges came during the training of various versions of GPT. Prior to GPT-3.5, information about the training dataset was made public. One of the sources used is a large collection of online material called “Common Crawl,” which the suit alleges contains information from 16 million unique records from sites published by The Times. That places the Times as the third most referenced source, behind Wikipedia and a database of US patents. Advertisement

OpenAI no longer discloses as many details of the data used for training of recent GPT versions, but all indications are that full-text NY Times articles are still part of that process (Much more on that in a moment.) Expect access to training information to be a major issue during discovery if this case moves forward. Not just training

A number of suits have been filed regarding the use of copyrighted material during training of AI systems. But the Times’ suit goes well beyond that to show how the material ingested during training can come back out during use. “Defendants GenAI tools can generate output that recites Times content verbatim, closely summarizes it, and mimics its expressive style, as demonstrated by scores of examples,” the suit alleges.

The suit allegesand we were able to verifythat it’s comically easy to get GPT-powered systems to offer up content that is normally protected by the Times’ paywall. The suit shows a number of examples of GPT-4 reproducing large sections of articles nearly verbatim.

The suit includes screenshots of ChatGPT being given the title of a piece at The New York Times and asked for the first paragraph, which it delivers. Getting the ensuing text is apparently as simple as repeatedly asking for the next paragraph.

ChatGPT has apparently closed that loophole in between the preparation of that suit and the present. We entered some of the prompts shown in the suit, and were advised “I recommend checking The New York Times website or other reputable sources,” although we can’t rule out that context provided prior to that prompt could produce copyrighted material. Ask for a paragraph, and Copilot will hand you a wall of normally paywalled text.John Timmer

But not all loopholes have been closed. The suit also shows output from Bing Chat, since rebranded as Copilot. We were able to verify that asking for the first paragraph of a specific article at The Times caused Copilot to reproduce the first third of the article. Advertisement

The suit is dismissive of attempts to justify this as a form of fair use. “Publicly, Defendants insist that their conduct is protected as ‘fair use’ because their unlicensed use of copyrighted content to train GenAI models serves a new ‘transformative’ purpose,” the suit notes. “But there is nothing ‘transformative’ about using The Timess content without payment to create products that substitute for The Times and steal audiences away from it.” Reputational and other damages

The hallucinations common to AI also came under fire in the suit for potentially damaging the value of the Times’ reputation, and possibly damaging human health as a side effect. “A GPT model completely fabricated that The New York Times published an article on January 10, 2020, titled Study Finds Possible Link between Orange Juice and Non-Hodgkins Lymphoma, the suit alleges. “The Times never published such an article.”

Similarly, asking about a Times article on heart-healthy foods allegedly resulted in Copilot saying it contained a list of examples (which it didn’t). When asked for the list, 80 percent of the foods on weren’t even mentioned by the original article. In another case, recommendations were ascribed to the Wirecutter when the products hadn’t even been reviewed by its staff.

As with the Times material, it’s alleged that it’s possible to get Copilot to offer up large chunks of Wirecutter articles (The Wirecutter is owned by The New York Times). But the suit notes that these article excerpts have the affiliate links stripped out of them, keeping the Wirecutter from its primary source of revenue.

The suit targets various OpenAI companies for developing the software, as well as Microsoftthe latter for both offering OpenAI-powered services, and for having developed the computing systems that enabled the copyrighted material to be ingested during training. Allegations include direct, contributory, and vicarious copyright infringement, as well as DMCA and trademark violations. Finally, it alleges “Common Law Unfair Competition By Misappropriation.”

The suit seeks nothing less than the erasure of both any GPT instances that the parties have trained using material from the Times, as well as the destruction of the datasets that were used for the training. It also asks for a permanent injunction to prevent similar conduct in the future. The Times also wants money, lots and lots of money: “statutory damages, compensatory damages, restitution, disgorgement, and any other relief that may be permitted by law or equity.” reader comments 359 John Timmer John is Ars Technica’s science editor. He has a Bachelor of Arts in Biochemistry rom Columbia University, and a Ph.D. in Molecular and Cell Biology from the University of California, Berkeley. When physically separated from his keyboard, he tends to seek out a bicycle, or a scenic location for communing with his hiking boots. Advertisement Channel Ars Technica ← Previous story Next story → Related Stories Today on Ars

World

Inside a secret, underground military base in eastern Ukraine

Published

38 mins ago

December 7, 2025

admin

Inside a secret, underground military base in eastern Ukraine

A hidden, underground military base in eastern Ukraine is so secret, soldiers change into civilian clothes whenever they step outside to avoid drawing attention.

Journalists are not usually allowed access.

But the unit that has been using this vast, subterranean warren of war rooms, a dormitory, kitchen, canteen and makeshift gym as its headquarters since the summer is imminently relocating, so Sky News was invited inside.

Lieutenant Colonel Arsen Dimitric – call sign Lemko – is the chief of staff of 1st Corps Azov of the National Guard of Ukraine, one of the country’s most effective combat forces.

He sat with us in the base, next to a large square table, covered by a map of the Donbas region.

His soldiers have been fighting in this area since the summer, countering a surge in Russian attacks in and around the frontline city of Pokrovsk.

“We aim to destroy as much of the enemy as possible,” he said.

“Will we take losses? Yes. Will it hurt? Absolutely.”

But he said if Russia is allowed to advance, even more Ukrainians will suffer.

“Their [the Russians’] only advantage is numbers,” he said.

“They don’t care how many people they lose.”

Lemko said almost 17,000 Russian soldiers had been killed or wounded fighting in this section of the warzone alone between August to November.

Ukrainian video footage of the battlefield showed Russian armoured vehicles being taken out by drones and artillery fire.

At one point, Russian soldiers mounted on motorbikes try to advance, only to be stopped by Ukrainian fire.

“Our task is to hit them as hard as possible in various areas,” Lemko said. “We focus on our operations, others on theirs, and leadership will negotiate the best possible terms.”

The Azov Corps soldiers are fighting over land that should be handed over to Russia, according to an initial draft of a peace deal proposal between Kyiv and Moscow put forward by the United States. This is despite swathes of the Donbas remaining under Ukrainian control.

But General Oleksandr Syrskyi, the head of the Ukrainian armed forces, has since told Sky News that simply surrendering territory would be “unacceptable”.

For Lemko, he says the job of his troops is to inflict as much damage as possible on the Russian side to help strengthen Ukraine’s hand in negotiations.

“Simply giving it [land] away isn’t the way,” he said.

“Diplomats do their work, we do ours. Our job as soldiers is to give as many advantages as possible to our negotiating team. And we’re doing exactly that.”

Lemko, who has been battling against Russia since the Crimean annexation in 2014, also had a warning for the rest of Europe about a rise in hybrid attacks, such as mysterious drone sightings, acts of sabotage and cyber hacks suspected of being linked to Moscow.

He said Ukraine’s experience showed that if attacks by Russia that fall under the threshold of conventional war are not successfully countered, full-scale conflict could follow.

“Ukraine once lost a hybrid war that had been waged since the very start of our independence,” he said.

“Because of that defeat, there was a physical operation against us in Crimea and then a physical operation in 2022.

“Now the hybrid war has reached its climax, and it is moving into the Baltic States and Europe.

“That is why, in my opinion – and in the opinion of most of our officers – now is the moment for all countries to unite and counter this hybrid war. Because the consequence may be a physical one.”

Production: Katy Scholes, security and defence producer, and Azad Safarov, Ukraine producer.

Camera operator: Mostyn Pryce

World

At least 25 people dead after major fire at nightclub in Goa, India

Published

38 mins ago

December 7, 2025

admin

At least 25 people dead after major fire at nightclub in Goa, India

At least 25 people have been killed after a fire at a nightclub in Goa, the state’s police service has said.

The fire reportedly started around midnight on Saturday local time.

The majority of victims were kitchen staff at the club – although around three to four tourists are thought to be among those killed.

Videos on social media showed emergency services lining up to help the injured – some of whom were taken to nearby hospitals.

Dr Pramod Sawant, Goa’s chief minister, said: “I am deeply grieved and offer my heartfelt condolences to all the bereaved families in this hour of unimaginable loss.”

He later said he was “closely reviewing the situation arising from the tragic fire” – adding six additional people had been injured.

“All six injured persons are in a stable condition and are receiving the best medical care,” he said.

Authorities worked through the night to bring the situation under control and all bodies have been recovered, the state’s police chief told reporters, according to Reuters news agency.

India’s Prime Minister Narendra Modi said the deadly fire was “deeply saddening”.

He said he had spoken with Goa’s chief minister and that “the state government is providing all possible assistance to those affected”.

Dr Sawant said he has “ordered an inquiry” to discover what happened after visiting the site.

“The inquiry will examine the exact cause of the fire and whether fire safety norms and building rules were followed,” he said.

“Those found responsible will face most stringent action under the law – any negligence will be dealt with firmly.”

Goa, a small state on India’s western coast, is a popular tourist destination, attracting millions of tourists every year.

US

Jeffrey Epstein’s most lucrative currency was people – six years after his death, he continues to haunt those who knew him

Published

38 mins ago

December 7, 2025

admin

Jeffrey Epstein's most lucrative currency was people - six years after his death, he continues to haunt those who knew him

Framed photos with presidents, princes and even the pope adorned the many homes of Jeffrey Epstein.

This article contains images and language that some readers may find disturbing.

The disgraced New York financier’s most lucrative currency was people. He made a career out of connections with world leaders in politics, business titans and science’s most lauded brains.

The man formerly known as Prince Andrew, Andrew Mountbatten-Windsor, described Epstein‘s appeal in his infamous TV interview: “He had the most extraordinary ability to bring extraordinary people together and that’s the bit that I remember, going to the dinner parties where you would meet academics, politicians, people from the United Nations. It was a cosmopolitan group of what I would describe as US eminence.”

His network was not just US-based but the global elite – among them hedge fund owners, bankers and hoteliers.

But as more and more new documents and photos are made public, we can build up an intimate portrait of a man who kept so much private.

Another man once called a prince, but of darkness this time, Peter Mandelson, described his “best pal” as a “prolific networker”. Epstein’s friends crossed political parties – Republican and Democratic – and continents.

Epstein’s Palm Beach mansion was just a seven-minute drive from Donald Trump‘s Mar-a-Lago. In 2002, Mr Trump told New York Magazine: “I’ve known Jeff for 15 years. Terrific guy. He’s a lot of fun to be with. It is even said that he likes beautiful women as much as I do, and many of them are on the younger side.”

They are said to have fallen out while competing to purchase a mansion in 2004.

The release of thousands of Epstein’s personal emails shows he had had plenty of world leaders in his inbox.

The former prime minister of Norway and former president of the Maldives sought his advice on politics and finance respectively.

An enigma

Epstein’s emails are short, often abrupt and riddled with spelling mistakes. The impression he wanted to give: he was a busy man, an enigma. You were lucky to be getting a reply.

He cared about appearances – his own and of the women he abused. He dated many models, including a former Miss Sweden. He followed a strict diet to keep lean and insisted the women in his life did the same.

His now notorious 50th birthday book is packed full of candid snaps, some featured here, that flaunt his lavish lifestyle. It is also brazen in its relishing of Epstein’s proclivity for young women. Images of scantily clad women are included in photos and doodles.

The anecdotes from his wealthy, powerful friends are often smutty or innuendo-led. “It’s no secret that Jeffrey appreciates beautiful women. But not many people know that he can create them out of thin air,” reads one.

Massages were entry route to abuse

Epstein’s black book of contacts had lengthy lists of women lined up for “massages” in Florida, California, New Mexico, New York, London, Paris and his island.

At least 152 women are named in it with phone numbers – they were available on speed dial.

The premise of a massage was often his entry route to abuse. The massages were scheduled, part of his daily routine. Whether on a private jet or his private island, he acted with impunity for far too long.

Epstein did not show remorse for his crimes

Multiple women went to the police to report his actions over the years. But the only jail time he was ever sentenced to was in 2008 after a controversial deal where he pleaded guilty to state charges of soliciting a minor for prostitution. He was sentenced to 18 months in jail, but only served 13 and negotiated the ability to leave the jail six days a week for up to 12 hours a day for work.

Despite becoming a registered sex offender in 2008, he was far from a social pariah. Nor did he show remorse for his crimes.

Even a decade after his conviction, he was still mocking sexual abuse. He wrote in a message to a friend in 2018, “so many guys caught in the me too, reaching out to me. Asking when does the madness stop. Funny,” and then that “breast cancer was easier to cure than the me too movement”.

‘Epstein claimed if girls had started menstruating they were of age to have sex’

Virginia Giuffre revealed in her memoir that Epstein would say that criminalising sex with teenage girls was a cultural aberration. He would point to different US states having different ages of consent – in Florida it was 18. He claimed if girls had started menstruating they were biologically of age to have sex.

Documents released by the House Oversight Committee reveal he paid to “clean up” what came up about him on Google after his conviction. On 11 December 2010 he bemoaned that despite forking out thousands, “the google page is not good” in an email.

‘An extraordinary volume’ of naked photos of young girls

On 6 July 2019, Epstein was arrested on federal charges related to sex trafficking after his private jet flew into the US from Paris.

“An extraordinary volume” of naked photos of young girls were found in his New York town house. Authorities also found a safe containing 48 loose diamonds, $70,000 (£52,000) in cash and three passports belonging to the sex offender. The expired Austrian passport had a photo of Epstein, but a different name and an address in Saudi Arabia.

👉 Follow Trump100 on your podcast app 👈

On 10 August 2019, Epstein was found dead in his New York prison cell while awaiting trial. Forty-eight hours before he died he signed a will which put his assets in a trust, the beneficiaries of which remain private.

Epstein’s most vocal victim, Ms Giuffre, who took her own life this year, closes her memoir Nobody’s Girl saying: “Epstein is dead but the attitude that allowed him to do what he did, it’s alive and well.”

Six years after his death, Epstein continues to haunt those who knew him. Some may be scared – for their reputation, careers and for what more could still come out.