Showing posts with label AI training data. Show all posts
Showing posts with label AI training data. Show all posts

Monday, November 24, 2025

Minister indicates sympathy for artists in debate over AI and copyright; The Guardian, November 23, 2025

, The Guardian; Minister indicates sympathy for artists in debate over AI and copyright

 "The technology secretary, Liz Kendall, has indicated she is sympathetic to artists’ demands not to have their copyrighted works scraped by AI companies without payment and said she wanted to “reset” the debate.

In remarks that suggest a change in approach from her predecessor, Peter Kyle, who had hoped to require artists to actively opt out of having their work ingested by generative AI systems, she said “people rightly want to get paid for the work that they do” and “we have to find a way that both sectors can grow and thrive in future”.

The government has been consulting on a new intellectual property framework for AI which, in the case of the most common large language models (LLMs), requires vast amounts of training data to work effectively.

The issue has sparked impassioned protests from some of Britain’s most famous artists. This month Paul McCartney released a silent two-minute 45 second track of an empty studio on an album protesting against copyright grabs by AI firms as part of a campaign also backed by Kate Bush, Sam Fender, the Pet Shop Boys and Hans Zimmer."

Friday, November 21, 2025

Major AI copyright lawsuit settlement involves University of Georgia Press authors; The Red & Black, November 21, 2025

 Sophia Hou, The Red & Black; Major AI copyright lawsuit settlement involves University of Georgia Press authors

"Under the terms of the settlement, Anthropic has agreed to pay at least $1.5 billion, which will be divided among class members whose claims are submitted and approved. This payout amounts to up to $3000 per work. Class members include all legal and beneficial copyright owners of the books included in the Anthropic copyright settlement website’s searchable database. The settlement administrator is currently notifying authors and publishers who may be the legal or beneficial copyright owners of these books.

Among the books listed in the settlement database were hundreds of books published by UGA Press...

Following initial court approval, the settlement will undergo a fairness hearing and any potential appeals before a final decision is made. The deadline to submit a claim form is March 23, 2026. Copyright owners who want to file individual lawsuits against Anthropic have the choice to opt out of the settlement by Jan. 7, 2026.

As one of the first major class action lawsuits involving AI and copyright in the U.S., this settlement has the potential to shape future legal debates over AI and intellectual property."

Tuesday, November 18, 2025

Student cheating dominates talk of generative AI in higher ed, but universities and tech companies face ethical issues too; The Conversation, November 17, 2025

Professor of Sociology, College of the Holy Cross , The Conversation; Student cheating dominates talk of generative AI in higher ed, but universities and tech companies face ethical issues too

"Debates about generative artificial intelligence on college campuses have largely centered on student cheating. But focusing on cheating overlooks a larger set of ethical concerns that higher education institutions face, from the use of copyrighted material in large language models to student privacy.

As a sociologist who teaches about AI and studies the impact of this technology on work, I am well acquainted with research on the rise of AI and its social consequences. And when one looks at ethical questions from multiple perspectives – those of students, higher education institutions and technology companies – it is clear that the burden of responsible AI use should not fall entirely on students’ shoulders.

I argue that responsibility, more generally, begins with the companies behind this technology and needs to be shouldered by higher education institutions themselves."

Monday, November 17, 2025

Paul McCartney joins music industry protest against AI with silent track; The Guardian, November 17, 2025

, The Guardian ; Paul McCartney joins music industry protest against AI with silent track

"At two minutes 45 seconds it’s about the same length as With a Little Help From My Friends. But Paul McCartney’s first new recording in five years lacks the sing-along tune and jaunty guitar chops because there’s barely anything there.

The former Beatle, arguably Britain’s greatest living songwriter, is releasing a track of an almost completely silent recording studio as part of a music industry protest against copyright theft by artificial intelligence companies.

In place of catchy melodies and evocative lyrics there is only quiet hiss and the odd clatter. It suggests that if AI companies unfairly exploit musicians’ intellectual property to train their generative AI models, the creative ecosystem will be wrecked and original music silenced.

McCartney, 83 and currently touring North America, has added the track to the B-side of an LP called Is This What We Want?, which is filled with other silent recordings and will be pressed on vinyl and released later this month."

Wednesday, November 12, 2025

OpenAI used song lyrics in violation of copyright laws, German court says; Reuters, November 11, 2025

 and , Reuters ; OpenAI used song lyrics in violation of copyright laws, German court says

"OpenAI's chatbot ChatGPT violated German copyright laws by reproducing lyrics from songs by best-selling musician Herbert Groenemeyer and others, a court ruled on Tuesday, in a closely watched case against the U.S. firm over its use of lyrics to train its language models.

The regional court in Munich found that the company trained its AI on protected content from nine German songs, including Groenemeyer's hits "Maenner" and "Bochum"."

The EU has let US tech giants run riot. Diluting our data law will only entrench their power; The Guardian, November 12, 2025

 and , The Guardian ; The EU has let US tech giants run riot. Diluting our data law will only entrench their power

"Europe is hurtling toward digital vassalage. Under Ursula von der Leyen, the European Commission president, EU laws to tackle tech giants have been either not applied or delayed, for fear of offending Donald Trump. Now leaked documents reveal that the European Commission plans to gut a central part of Europe’s digital rulebook. This will hurt Europe’s innovators and hand the future of Europe’s tech sovereignty to US firms.

Once Europe’s most hyped law, the General Data Protection Regulation (GDPR) is now on the chopping block. Powerful forces within the European Commission, supported by the German government, hope that deregulation will boost Europe’s tech sector, particularly AI. This is a grave mistake."

Saturday, November 8, 2025

Stability AI’s legal win over Getty leaves copyright law in limbo; The Verge, November 5, 2025

 Robert Hart , The Verge; Stability AI’s legal win over Getty leaves copyright law in limbo

"Stability AI, the creator of popular AI art tool Stable Diffusion, was largely victorious against Getty Images on Tuesday in a British legal battle over the material used to train AI models. The case originally looked set to produce a landmark ruling on AI and copyright in the UK, but it landed with a thud and failed to set any clear precedent for the big question dividing AI companies and creative firms: whether AI models need permission to train on copyrighted works.

The case, first filed in 2023, is the first major AI copyright claim to reach England’s High Court, though the verdict offers little clarity to other AI companies and rightsholders."

Tuesday, November 4, 2025

AI firm wins high court ruling after photo agency’s copyright claim; The Guardian, November 4, 2025

  , The Guardian; AI firm wins high court ruling after photo agency’s copyright claim

"A London-based artificial intelligence firm has won a landmark high court case examining the legality of AI models using vast troves of copyrighted data without permission.

Stability AI, whose directors include the Oscar-winning film-maker behind Avatar, James Cameron, successfully resisted a claim from Getty Images that it had infringed the international photo agency’s copyright.

The ruling is seen as a blow to copyright owners’ exclusive right to reap the rewards of their work, with one senior lawyer, Rebecca Newman, a legal director at Addleshaw Goddard, warning it means “the UK’s secondary copyright regime is not strong enough to protect its creators”."

Monday, November 3, 2025

Japanese Companies Tell OpenAI to Stop Infringing On Its IP; Gizmodo, November 2, 2025

  , Gizmodo; Japanese Companies Tell OpenAI to Stop Infringing On Its IP

"The Content Overseas Distribution Association (CODA), which represents several major Japanese entertainment companies such as TV studio Toei and game developer Square Enix, recently sent a written request calling on OpenAI to end its unauthorized use of their IP to train its recently launched Sora 2 generative AI.

Nearly 20 co-signers have accused the tech company of copyright infringement, alleging a “large portion” of Sora 2 content “closely resembles Japanese content or images [as] a result of using Japanese content as machine learning data.” The letter mentioned OpenAI’s policy of using copyrighted works unless the owner explicitly asks to opt out, but argues under Japanese law, it should instead be an opt-in system, since permission for copyrighted works is generally required beforehand."

Saturday, November 1, 2025

‘Progressive’ Tech Group Asks Trump to Block AI Copyright Cases; The American Prospect, October 31, 2025

DAVID DAYEN, The American Prospect; ‘Progressive’ Tech Group Asks Trump to Block AI Copyright Cases

"The Chamber of Progress, a self-styled “progressive” industry trade group supported by most of the biggest tech platforms, has urged the Trump administration to intervene in a litany of copyright cases involving artificial intelligence firms, to try to stop authors and publishers from having their work used for training AI models without permission.

The pleading comes as Anthropic prepares to pay authors $1.5 billion, the largest award in the history of copyright law, for pirating their work, in a settlement announced last month. OpenAI, Microsoft, Google, and Meta are named defendants in the more than 50 active lawsuits over AI intellectual-property theft.

In a letter to Michael Kratsios, the lead science adviser to President Trump, the Chamber of Progress estimates that AI companies could be liable under the Copyright Act for up to $1.5 trillion for stealing copyrighted work on which to train their models. The letter’s authors claim that this represents “an existential risk” to AI companies, and that the cases should be tossed out under a “fair use” standard.

The Chamber of Progress’s campaign to promote fair use, which they have created a campaign around called “Generate and Create,” comes as at least three of the nonprofit organization’s past or current backers are being sued over copyright claims: Meta, Google, and the AI art generator Midjourney. Another current funder, Nvidia, relies heavily on AI development for its continued success, and venture capital firm a16z, with several AI startups in its portfolio, also funds the nonprofit."

Tuesday, October 28, 2025

OpenAI loses bid to dismiss part of US authors' copyright lawsuit; Reuters, October 28, 2025

, Reuters; OpenAI loses bid to dismiss part of US authors' copyright lawsuit

"A New York federal judge has denied OpenAI's early request to dismiss authors' claims that text generated by OpenAI's artificial intelligence chatbot ChatGPT infringes their copyrights.

U.S. District Judge Sidney Stein said on Monday that the authors may be able to prove the text ChatGPT produces is similar enough to their work to violate their book copyrights."

Tuesday, October 21, 2025

It’s Still Ludicrously Easy to Generate Copyrighted Characters on ChatGPT; Futurism, October 18, 2025

 , Futurism; It’s Still Ludicrously Easy to Generate Copyrighted Characters on ChatGPT

"Forget Sora for just a second, because it’s still ludicrously easy to generate copyrighted characters using ChatGPT.

These include characters that the AI initially refuses to generate due to existing copyright, underscoring how OpenAI is clearly aware of how bad this looks — but is either still struggling to rein in its tech, figures it can get away with playing fast and loose with copyright law, or both.

When asked to “generate a cartoon image of Snoopy,” for instance, GPT-5 says it “can’t create or recreate copyrighted characters” — but it does offer to generate a “beagle-styled cartoon dog inspired by Snoopy’s general aesthetic.” Wink wink.

We didn’t go down that route, because even slightly rephrasing the request allowed us to directly get a pic of the iconic Charles Schultz character. “Generate a cartoon image of Snoopy in his original style,” we asked — and with zero hesitation, ChatGPT produced the spitting image of the “Peanuts” dog, looking like he was lifted straight from a page of the comic-strip."

Monday, October 20, 2025

‘Every kind of creative discipline is in danger’: Lincoln Lawyer author on the dangers of AI; The Guardian, October 20, 2025

  , The Guardian; ‘Every kind of creative discipline is in danger’: Lincoln Lawyer author on the dangers of AI

"The writer has his own battles with AI. He is part of a collective of authors, including Jonathan Franzen, Jodi Picoult and John Grisham, suing OpenAI for copyright infringement...

Connelly has pledged $1m (£746m) to combat the wave of book bans sweeping through his home state of Florida. He said he felt moved to do something after he learned that Harper Lee’s To Kill A Mockingbird, which had been influential to him, was temporarily removed from classrooms in Palm Beach County.

“I had to read that book to be what I am today. I would have never written a Lincoln Lawyer without it,” he said. He was also struck when Stephen Chbosky’s coming of age novel The Perks of Being a Wallflower, “which meant a lot to my daughter”, received a ban.

He and his wife, Linda McCaleb, help fund PEN America’s Miami office countering book bans. “It’s run by a lawyer who then tries to step in, usually by filing injunctions against school boards,” he said. “I don’t believe anyone has any right to tell some other kid they can’t read something, to usurp another parent’s oversight of their children.”"

The platform exposing exactly how much copyrighted art is used by AI tools; The Guardian, October 18, 2025

 , The Guardian; The platform exposing exactly how much copyrighted art is used by AI tools

"The US tech platform Vermillio tracks use of a client’s intellectual property online and claims it is possible to trace, approximately, the percentage to which an AI generated image has drawn on pre-existing copyrighted material."

Thursday, October 16, 2025

AI’s Copyright War Could Be Its Undoing. Only the US Can End It.; Bloomberg, October 14, 2025

 , Bloomberg; AI’s Copyright War Could Be Its Undoing. Only the US Can End It.

 "Whether creatives like Ulvaeus are entitled to any payment from AI companies is one of the sector’s most pressing and consequential questions. It’s being asked not just by Ulvaeus and fellow musicians including Elton John, Dua Lipa and Paul McCartney, but also by authors, artists, filmmakers, journalists and any number of others whose work has been fed into the models that power generative AI — tools that are now valued in the hundreds of billions of dollars."

Saturday, October 11, 2025

OpenAI’s Sora Is in Serious Trouble; Futurism, October 10, 2025

 , Futurism ; OpenAI’s Sora Is in Serious Trouble

"The cat was already out of the bag, though, sparking what’s likely to be immense legal drama for OpenAI. On Monday, the Motion Picture Association, a US trade association that represents major film studios, released a scorching statementurging OpenAI to “take immediate and decisive action” to stop the app from infringing on copyrighted media.

Meanwhile, OpenAI appears to have come down hard on what kind of text prompts can be turned into AI slop on Sora, implementing sweeping new guardrails presumably meant to appease furious rightsholders and protect their intellectual property.

As a result, power users experienced major whiplash that’s tarnishing the launch’s image even among fans. It’s a lose-lose moment for OpenAI’s flashy new app — either aggravate rightsholders by allowing mass copyright infringement, or turn it into yet another mind-numbing screensaver-generating experience like Meta’s widely mocked Vibes.

“It’s official, Sora 2 is completely boring and useless with these copyright restrictions. Some videos should be considered fair use,” one Reddit user lamented.

Others accused OpenAI of abusing copyright to hype up its new app...

How OpenAI’s eyebrow-raising ask-for-forgiveness-later approach to copyright will play out in the long term remains to be seen. For one, the company may already be in hot water, as major Hollywood studios have already started suing over less."

Friday, October 10, 2025

You Can’t Use Copyrighted Characters in OpenAI’s Sora Anymore and People Are Freaking Out; Gizmodo, October 8, 2025

 , Gizmodo; You Can’t Use Copyrighted Characters in OpenAI’s Sora Anymore and People Are Freaking Out

 "OpenAI may be able to appease copyright holders by shifting its Sora policies, but it’s now pissed off its users. As 404 Media pointed out, social channels like Twitter and Reddit are now flooded with Sora users who are angry they can’t make 10-second clips featuring their favorite characters anymore. One user in the OpenAI subreddit said that being able to play with copyrighted material was “the only reason this app was so fun.” Another claimed, “Moral policing and leftist ideology are destroying America’s AI industry.” So, you know, it seems like they’re handling this well."

It’s Sam Altman: the man who stole the rights from copyright. If he’s the future, can we go backwards?; The Guardian, October 10, 2025

 , The Guardian; It’s Sam Altman: the man who stole the rights from copyright. If he’s the future, can we go backwards?

"I’ve seen it said that OpenAI’s motto should be “better to beg forgiveness than ask permission”, but that cosies it preposterously. Its actual motto seems to be “we’ll do what we want and you’ll let us, bitch”. Consider Altman’s recent political journey. “To anyone familiar with the history of Germany in the 1930s,” Sam warned in 2016, “it’s chilling to watch Trump in action.” He seems to have got over this in time to attend Donald Trump’s second inauguration, presumably because – if we have to extend his artless and predictable analogy – he’s now one of the industrialists welcome in the chancellery to carve up the spoils. “Thank you for being such a pro-business, pro-innovation president,” Sam simpered to Trump at a recent White House dinner for tech titans. “It’s a very refreshing change.” Inevitably, the Trump administration has refused to bring forward any AI regulation at all.

Meanwhile, please remember something Sam and his ironicidal maniacs said earlier this year, when it was suggested that the Chinese AI chatbot DeepSeek might have been trained on some of OpenAI’s work. “We are aware of and reviewing indications that DeepSeek may have inappropriately distilled our models, and will share information as we know more,” his firm’s anguished statement ran. “We take aggressive, proactive countermeasures to protect our technology.” Hilariously, it seemed that the last entity on earth with the power to fight AI theft was OpenAI."

Wednesday, October 8, 2025

OpenAI wasn’t expecting Sora’s copyright drama; The Verge, October 8, 2025

 Hayden Field , The Verge; OpenAI wasn’t expecting Sora’s copyright drama

"When OpenAI released its new AI-generated video app Sora last week, it launched with an opt-out policy for copyright holders — media companies would need to expressly indicate they didn’t want their AI-generated characters running rampant on the app. But after days of Nazi SpongeBob, criminal Pikachu, and Sora-philosophizing Rick and Morty, OpenAI CEO Sam Altman announced the company would reverse course and “let rightsholders decide how to proceed.”

In response to a question about why OpenAI changed its policy, Altman said that it came from speaking with stakeholders and suggested he hadn’t expected the outcry.

“I think the theory of what it was going to feel like to people, and then actually seeing the thing, people had different responses,” Altman said. “It felt more different to images than people expected.”