Stand-up comedian and actor Sarah Silverman joins trio suing OpenAI and Meta over claims their AI models 'ingested and used' copyrighted work without permission

  • 📰 pcgamer
  • ⏱ Reading Time:
  • 53 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 24%
  • Publisher: 67%

Entertainment Entertainment Headlines News

Entertainment Entertainment Latest News,Entertainment Entertainment Headlines

Large language models, even larger language problems.

, were found in the datasets used to train LLaMA. The complaint mentions ThePile in particular, which was created by a company named EleutherAI.

The suit quotes EleutherAI's own description of its dataset as using Bibliotik, one of several"shadow libraries" the suit condemns:"Bibliotik consists of a mix of fiction and nonfiction books [...] We included Bibliotik because books are invaluable for long-range context modelling research and coherent storytelling."

The suit then explains:"These shadow libraries have long been of interest to the AI-training community because of the large quantity of copyrighted material they host. For that reason, these shadow libraries are also flagrantly illegal." The author's representatives, lawyers Matthew Butterick and Joseph Saveri, write on their litigation website:"Much of the mate­r­ial in the train­ing datasets used by OpenAI and Meta comes from copy­righted works—includ­ing books writ­ten by Plain­tiffs—that were copied by OpenAI and Meta with­out con­sent, with­out credit, and with­out com­pen­sa­tion.

 

Thank you for your comment. Your comment will be published after being reviewed.
Please try again later.
We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

 /  🏆 38. in ENTERTAİNMENT

Entertainment Entertainment Latest News, Entertainment Entertainment Headlines