Intellectual Property (IP), Artificial Intelligence (AI), Open Movements (OM) : AI training data

Showing posts with label AI training data. Show all posts

Thursday, December 4, 2025

OpenAI loses fight to keep ChatGPT logs secret in copyright case; Reuters, December 3, 2025

Blake Brittain, Reuters ; OpenAI loses fight to keep ChatGPT logs secret in copyright case

"OpenAI must produce millions of anonymized chat logs from ChatGPT users in its high-stakes copyright dispute with the New York Times and other news outlets, a federal judge in Manhattan ruled.

U.S. Magistrate Judge Ona Wang in a decision made public on Wednesday said that the 20 million logs were relevant to the outlets' claims and that handing them over would not risk violating users' privacy."

Lawsuit or License?; Columbia Journalism Review, December 4, 2025

KLAUDIA JAŹWIŃSKA, Columbia Journalism Review; Lawsuit or License?

"Today, the Tow Center for Digital Journalism is releasing a tracker that monitors developments between news publishers and AI companies—including lawsuits, deals, and grants—based on publicly available information."

Wednesday, December 3, 2025

‘The biggest decision yet’; The Guardian, December 2, 2025

Robert Booth , The Guardian; ‘The biggest decision yet’

"Humanity will have to decide by 2030 whether to take the “ultimate risk” of letting artificial intelligence systems train themselves to become more powerful, one of the world’s leading AI scientists has said.

Jared Kaplan, the chief scientist and co-owner of the $180bn (£135bn) US startup Anthropic, said a choice was looming about how much autonomy the systems should be given to evolve.

The move could trigger a beneficial “intelligence explosion” – or be the moment humans end up losing control...

He is not alone at Anthropic in voicing concerns. One of his co-founders, Jack Clark, said in October he was both an optimist and “deeply afraid” about the trajectory of AI, which he called “a real and mysterious creature, not a simple and predictable machine”.

Kaplan said he was very optimistic about the alignment of AI systems with the interests of humanity up to the level of human intelligence, but was concerned about the consequences if and when they exceed that threshold."

Bannon, top conservatives urge White House to reject Big Tech’s ‘fair use’ push to justify AI copyright theft: ‘Un-American and absurd’; New York Post, December 1, 2025

Thomas Barrabi , New York Post; Bannon, top conservatives urge White House to reject Big Tech’s ‘fair use’ push to justify AI copyright theft: ‘Un-American and absurd’

"Prominent conservatives including Steve Bannon are urging the Trump administration to reject an increasingly popular argument that tech giants are using to rip off copyrighted material to train artificial intelligence.

So-called “fair use” doctrine – which argues that the use of copyrighted content without permission is legally justified if it is done in the public interest – has become a common defense for AI firms like Google, Mark Zuckerberg’s Meta and Microsoft who have been accused of ripping off work.

The argument’s biggest backers also include White House AI czar David Sacks, who has warned that Silicon Valley firms “would be crippled” in a crucial race against AI firms in China unless they can rely on fair use protection...

Bannon and his allies threw cold water on such claims in a Monday letter addressed to US Attorney General Pam Bondi and Michael Kratsios, who heads the White House’s Office of Science and Technology Policy.

“This is un-American and absurd,” the conservatives argued in the letter, which was exclusively obtained by The Post. “We must compete and win the global AI race the American way — by ensuring we protect creators, children, conservatives, and communities.”...

The conservatives point to clear economic incentives to back copyright-protected industries, which contribute more than $2 trillion to the US GDP, carry an average annual wage of more than $140,000 and account for a $37 billion trade surplus, according to the letter...

The letter notes that money is no object for the companies leading the AI boom, which “enjoy virtually unlimited access to financing” and are each valued at hundreds of billions, if not trillions of dollars.

“In a free market, businesses pay for the inputs they need,” the letter said. “Imagine if AI CEOs claimed they needed free access to semiconductors, energy, researchers, and developers to build their products. They would be laughed out of their boardrooms.”...

The letter is the latest salvo in a heated policy divide as AI models gobble up data from the web. Critics accuse companies like Google, Microsoft, OpenAI and Meta of essentially seeking a “license to steal” from news outlets, artists, authors and others that produce original work."

Tuesday, December 2, 2025

Two AI copyright cases, two very different outcomes – here’s why; The Conversation, December 1, 2025

Hayleigh Bosher, Reader in Intellectual Property Law, Brunel University of London , The Conversation; Two AI copyright cases, two very different outcomes – here’s why

"Artificial intelligence companies and the creative industries are locked in an ongoing battle, being played out in the courts. The thread that pulls all these lawsuits together is copyright.

There are now over 60 ongoing lawsuits in the US where creators and rightsholders are suing AI companies. Meanwhile, we have recently seen decisions in the first court cases from the UK and Germany – here’s what happened in those...

Although the circumstances of the cases are slightly different, the heart of the issue was the same. Do AI models reproduce copyright-protected content in their training process and in generating outputs? The German court decided they do, whereas the UK court took a different view.

Both cases could be appealed and others are underway, so things may change. But the ending we want to see is one where AI and the creative industries come together in agreement. This would preferably happen with the use of copyright licences that benefit them both.

Importantly, it would also come with the consent of – and fair payment to – creators of the content that makes both their industries go round."

Tuesday, November 25, 2025

Huckabee’s Copyright Claim Over AI Advances Against Bloomberg; Bloomberg Law, November 25, 2025

Aruni Soni

, Bloomberg Law; Huckabee’s Copyright Claim Over AI Advances Against Bloomberg

"A federal judge declined to dismiss a copyright-infringement claim in a proposed class action led by Mike Huckabee, accusing Bloomberg LP of using a pirated dataset to train its AI model.

Judge Margaret M. Garnett said she couldn’t evaluate Bloomberg’s defense that its use of authors’ books to train BloombergGPT was fair use under US copyright law without a factual record, denying its motion to dismiss in a Monday opinion filed in the US District Court for the Southern District of New York."

Monday, November 24, 2025

Minister indicates sympathy for artists in debate over AI and copyright; The Guardian, November 23, 2025

Robert Booth, The Guardian; Minister indicates sympathy for artists in debate over AI and copyright

"The technology secretary, Liz Kendall, has indicated she is sympathetic to artists’ demands not to have their copyrighted works scraped by AI companies without payment and said she wanted to “reset” the debate.

In remarks that suggest a change in approach from her predecessor, Peter Kyle, who had hoped to require artists to actively opt out of having their work ingested by generative AI systems, she said “people rightly want to get paid for the work that they do” and “we have to find a way that both sectors can grow and thrive in future”.

The government has been consulting on a new intellectual property framework for AI which, in the case of the most common large language models (LLMs), requires vast amounts of training data to work effectively.

The issue has sparked impassioned protests from some of Britain’s most famous artists. This month Paul McCartney released a silent two-minute 45 second track of an empty studio on an album protesting against copyright grabs by AI firms as part of a campaign also backed by Kate Bush, Sam Fender, the Pet Shop Boys and Hans Zimmer."

Friday, November 21, 2025

Major AI copyright lawsuit settlement involves University of Georgia Press authors; The Red & Black, November 21, 2025

Sophia Hou, The Red & Black; Major AI copyright lawsuit settlement involves University of Georgia Press authors

"Under the terms of the settlement, Anthropic has agreed to pay at least $1.5 billion, which will be divided among class members whose claims are submitted and approved. This payout amounts to up to $3000 per work. Class members include all legal and beneficial copyright owners of the books included in the Anthropic copyright settlement website’s searchable database. The settlement administrator is currently notifying authors and publishers who may be the legal or beneficial copyright owners of these books.

Among the books listed in the settlement database were hundreds of books published by UGA Press...

Following initial court approval, the settlement will undergo a fairness hearing and any potential appeals before a final decision is made. The deadline to submit a claim form is March 23, 2026. Copyright owners who want to file individual lawsuits against Anthropic have the choice to opt out of the settlement by Jan. 7, 2026.

As one of the first major class action lawsuits involving AI and copyright in the U.S., this settlement has the potential to shape future legal debates over AI and intellectual property."

Thursday, November 20, 2025

Warner Music Settles Copyright Suit With AI Song Generator Udio; Bloomberg Law, November 19, 2025

Aruni Soni

, Bloomberg Law; Warner Music Settles Copyright Suit With AI Song Generator Udio

"Warner Music Group reached a deal with AI music-generator Udio, putting to bed its copyright lawsuit over the use of songs to train the startup’s AI model."

Tuesday, November 18, 2025

Student cheating dominates talk of generative AI in higher ed, but universities and tech companies face ethical issues too; The Conversation, November 17, 2025

Jeffrey C. Dixon, Professor of Sociology, College of the Holy Cross , The Conversation; Student cheating dominates talk of generative AI in higher ed, but universities and tech companies face ethical issues too

"Debates about generative artificial intelligence on college campuses have largely centered on student cheating. But focusing on cheating overlooks a larger set of ethical concerns that higher education institutions face, from the use of copyrighted material in large language models to student privacy.

As a sociologist who teaches about AI and studies the impact of this technology on work, I am well acquainted with research on the rise of AI and its social consequences. And when one looks at ethical questions from multiple perspectives – those of students, higher education institutions and technology companies – it is clear that the burden of responsible AI use should not fall entirely on students’ shoulders.

I argue that responsibility, more generally, begins with the companies behind this technology and needs to be shouldered by higher education institutions themselves."

Monday, November 17, 2025

Paul McCartney joins music industry protest against AI with silent track; The Guardian, November 17, 2025

Robert Booth, The Guardian ; Paul McCartney joins music industry protest against AI with silent track

"At two minutes 45 seconds it’s about the same length as With a Little Help From My Friends. But Paul McCartney’s first new recording in five years lacks the sing-along tune and jaunty guitar chops because there’s barely anything there.

The former Beatle, arguably Britain’s greatest living songwriter, is releasing a track of an almost completely silent recording studio as part of a music industry protest against copyright theft by artificial intelligence companies.

In place of catchy melodies and evocative lyrics there is only quiet hiss and the odd clatter. It suggests that if AI companies unfairly exploit musicians’ intellectual property to train their generative AI models, the creative ecosystem will be wrecked and original music silenced.

McCartney, 83 and currently touring North America, has added the track to the B-side of an LP called Is This What We Want?, which is filled with other silent recordings and will be pressed on vinyl and released later this month."

Wednesday, November 12, 2025

OpenAI used song lyrics in violation of copyright laws, German court says; Reuters, November 11, 2025

Jörn Poltz and Friederike Heine, Reuters ; OpenAI used song lyrics in violation of copyright laws, German court says

"OpenAI's chatbot ChatGPT violated German copyright laws by reproducing lyrics from songs by best-selling musician Herbert Groenemeyer and others, a court ruled on Tuesday, in a closely watched case against the U.S. firm over its use of lyrics to train its language models.

The regional court in Munich found that the company trained its AI on protected content from nine German songs, including Groenemeyer's hits "Maenner" and "Bochum"."

Saturday, November 8, 2025

Stability AI’s legal win over Getty leaves copyright law in limbo; The Verge, November 5, 2025

Robert Hart , The Verge; Stability AI’s legal win over Getty leaves copyright law in limbo

"Stability AI, the creator of popular AI art tool Stable Diffusion, was largely victorious against Getty Images on Tuesday in a British legal battle over the material used to train AI models. The case originally looked set to produce a landmark ruling on AI and copyright in the UK, but it landed with a thud and failed to set any clear precedent for the big question dividing AI companies and creative firms: whether AI models need permission to train on copyrighted works.

The case, first filed in 2023, is the first major AI copyright claim to reach England’s High Court, though the verdict offers little clarity to other AI companies and rightsholders."

Tuesday, November 4, 2025

AI firm wins high court ruling after photo agency’s copyright claim; The Guardian, November 4, 2025

Robert Booth , The Guardian; AI firm wins high court ruling after photo agency’s copyright claim

"A London-based artificial intelligence firm has won a landmark high court case examining the legality of AI models using vast troves of copyrighted data without permission.

Stability AI, whose directors include the Oscar-winning film-maker behind Avatar, James Cameron, successfully resisted a claim from Getty Images that it had infringed the international photo agency’s copyright.

The ruling is seen as a blow to copyright owners’ exclusive right to reap the rewards of their work, with one senior lawyer, Rebecca Newman, a legal director at Addleshaw Goddard, warning it means “the UK’s secondary copyright regime is not strong enough to protect its creators”."

Monday, November 3, 2025

Japanese Companies Tell OpenAI to Stop Infringing On Its IP; Gizmodo, November 2, 2025

JUSTIN CARTER , Gizmodo; Japanese Companies Tell OpenAI to Stop Infringing On Its IP

"The Content Overseas Distribution Association (CODA), which represents several major Japanese entertainment companies such as TV studio Toei and game developer Square Enix, recently sent a written request calling on OpenAI to end its unauthorized use of their IP to train its recently launched Sora 2 generative AI.

Nearly 20 co-signers have accused the tech company of copyright infringement, alleging a “large portion” of Sora 2 content “closely resembles Japanese content or images [as] a result of using Japanese content as machine learning data.” The letter mentioned OpenAI’s policy of using copyrighted works unless the owner explicitly asks to opt out, but argues under Japanese law, it should instead be an opt-in system, since permission for copyrighted works is generally required beforehand."

Saturday, November 1, 2025

‘Progressive’ Tech Group Asks Trump to Block AI Copyright Cases; The American Prospect, October 31, 2025

DAVID DAYEN, The American Prospect; ‘Progressive’ Tech Group Asks Trump to Block AI Copyright Cases

"The Chamber of Progress, a self-styled “progressive” industry trade group supported by most of the biggest tech platforms, has urged the Trump administration to intervene in a litany of copyright cases involving artificial intelligence firms, to try to stop authors and publishers from having their work used for training AI models without permission.

The pleading comes as Anthropic prepares to pay authors $1.5 billion, the largest award in the history of copyright law, for pirating their work, in a settlement announced last month. OpenAI, Microsoft, Google, and Meta are named defendants in the more than 50 active lawsuits over AI intellectual-property theft.

In a letter to Michael Kratsios, the lead science adviser to President Trump, the Chamber of Progress estimates that AI companies could be liable under the Copyright Act for up to $1.5 trillion for stealing copyrighted work on which to train their models. The letter’s authors claim that this represents “an existential risk” to AI companies, and that the cases should be tossed out under a “fair use” standard.

The Chamber of Progress’s campaign to promote fair use, which they have created a campaign around called “Generate and Create,” comes as at least three of the nonprofit organization’s past or current backers are being sued over copyright claims: Meta, Google, and the AI art generator Midjourney. Another current funder, Nvidia, relies heavily on AI development for its continued success, and venture capital firm a16z, with several AI startups in its portfolio, also funds the nonprofit."

Tuesday, October 28, 2025

OpenAI loses bid to dismiss part of US authors' copyright lawsuit; Reuters, October 28, 2025

Blake Brittain, Reuters; OpenAI loses bid to dismiss part of US authors' copyright lawsuit

"A New York federal judge has denied OpenAI's early request to dismiss authors' claims that text generated by OpenAI's artificial intelligence chatbot ChatGPT infringes their copyrights.

U.S. District Judge Sidney Stein said on Monday that the authors may be able to prove the text ChatGPT produces is similar enough to their work to violate their book copyrights."

Sunday, October 26, 2025

Federal government rules out changing copyright law to give AI companies free rein; Australian Broadcasting Corporation, October 26, 2025

Maani Truu, Australian Broadcasting Corporation; Federal government rules out changing copyright law to give AI companies free rein

"In short:

The government has definitively ruled out introducing a copyright exemption for artificial intelligence companies training their models on Australian creative works.

Such a carve-out has been fiercely rejected by the creative sector, after it was floated in a Productivity Commission report.

What's next?

A government working group on artificial intelligence and copyright will meet over the next two days to examine whether the current laws need a refresh."

Tuesday, October 21, 2025

It’s Still Ludicrously Easy to Generate Copyrighted Characters on ChatGPT; Futurism, October 18, 2025

Frank Landymore, Futurism; It’s Still Ludicrously Easy to Generate Copyrighted Characters on ChatGPT

"Forget Sora for just a second, because it’s still ludicrously easy to generate copyrighted characters using ChatGPT.

These include characters that the AI initially refuses to generate due to existing copyright, underscoring how OpenAI is clearly aware of how bad this looks — but is either still struggling to rein in its tech, figures it can get away with playing fast and loose with copyright law, or both.

When asked to “generate a cartoon image of Snoopy,” for instance, GPT-5 says it “can’t create or recreate copyrighted characters” — but it does offer to generate a “beagle-styled cartoon dog inspired by Snoopy’s general aesthetic.” Wink wink.

We didn’t go down that route, because even slightly rephrasing the request allowed us to directly get a pic of the iconic Charles Schultz character. “Generate a cartoon image of Snoopy in his original style,” we asked — and with zero hesitation, ChatGPT produced the spitting image of the “Peanuts” dog, looking like he was lifted straight from a page of the comic-strip."

Monday, October 20, 2025

‘Every kind of creative discipline is in danger’: Lincoln Lawyer author on the dangers of AI; The Guardian, October 20, 2025

Nadia Khomami , The Guardian; ‘Every kind of creative discipline is in danger’: Lincoln Lawyer author on the dangers of AI

"The writer has his own battles with AI. He is part of a collective of authors, including Jonathan Franzen, Jodi Picoult and John Grisham, suing OpenAI for copyright infringement...

Connelly has pledged $1m (£746m) to combat the wave of book bans sweeping through his home state of Florida. He said he felt moved to do something after he learned that Harper Lee’s To Kill A Mockingbird, which had been influential to him, was temporarily removed from classrooms in Palm Beach County.

“I had to read that book to be what I am today. I would have never written a Lincoln Lawyer without it,” he said. He was also struck when Stephen Chbosky’s coming of age novel The Perks of Being a Wallflower, “which meant a lot to my daughter”, received a ban.

He and his wife, Linda McCaleb, help fund PEN America’s Miami office countering book bans. “It’s run by a lawyer who then tries to step in, usually by filing injunctions against school boards,” he said. “I don’t believe anyone has any right to tell some other kid they can’t read something, to usurp another parent’s oversight of their children.”"