Showing posts with label Nvidia. Show all posts
Showing posts with label Nvidia. Show all posts

Monday, March 11, 2024

Nvidia sued over AI training data as copyright clashes continue; Ars Technica, March 11, 2024

  , Ars Technica Nvidia sued over AI training data as copyright clashes continue

"Book authors are suing Nvidia, alleging that the chipmaker's AI platform NeMo—used to power customized chatbots—was trained on a controversial dataset that illegally copied and distributed their books without their consent.

In a proposed class action, novelists Abdi Nazemian (Like a Love Story), Brian Keene (Ghost Walk), and Stewart O’Nan (Last Night at the Lobster) argued that Nvidia should pay damages and destroy all copies of the Books3 dataset used to power NeMo large language models (LLMs).

The Books3 dataset, novelists argued, copied "all of Bibliotek," a shadow library of approximately 196,640 pirated books. Initially shared through the AI community Hugging Face, the Books3 dataset today "is defunct and no longer accessible due to reported copyright infringement," the Hugging Face website says.

According to the authors, Hugging Face removed the dataset last October, but not before AI companies like Nvidia grabbed it and "made multiple copies." By training NeMo models on this dataset, the authors alleged that Nvidia "violated their exclusive rights under the Copyright Act." The authors argued that the US district court in San Francisco must intervene and stop Nvidia because the company "has continued to make copies of the Infringed Works for training other models.""

Monday, September 25, 2023

Getty Images promises its new AI contains no copyrighted art; MIT Technology Review, September 25, 2023

, MIT Technology Review; Getty Images promises its new AI contains no copyrighted art

"Getty Images is so confident its new generative AI model is free of copyrighted content that it will cover any potential intellectual-property disputes for its customers. 

The generative AI system, announced today, was built by Nvidia and is trained solely on images in Getty’s image library. It does not include logos or images that have been scraped off the internet without consent. 

“Fundamentally, it’s trained; it’s clean. It’s viable for businesses to use. We’ll stand behind that claim,” says Craig Peters, the CEO of Getty Images. Peters says companies that want to use generative AI want total legal certainty they won’t face expensive copyright lawsuits.""