OpenAI could be fined up to $150,000 for each piece of infringing content:
Weeks after The New York Times updated its terms of service (TOS) to prohibit AI companies from scraping its articles and images to train AI models, it appears that the Times may be preparing to sue OpenAI. The result, experts speculate, could be devastating to OpenAI, including the destruction of ChatGPT's dataset and fines up to $150,000 per infringing piece of content.
NPR spoke to two people "with direct knowledge" who confirmed that the Times' lawyers were mulling whether a lawsuit might be necessary "to protect the intellectual property rights" of the Times' reporting.
Neither OpenAI nor the Times immediately responded to Ars' request to comment.
If the Times were to follow through and sue ChatGPT-maker OpenAI, NPR suggested that the lawsuit could become "the most high-profile" legal battle yet over copyright protection since ChatGPT's explosively popular launch. This speculation comes a month after Sarah Silverman joined other popular authors suing OpenAI over similar concerns, seeking to protect the copyright of their books.
[...] In April, the News Media Alliance published AI principles, seeking to defend publishers' intellectual property by insisting that generative AI "developers and deployers must negotiate with publishers for the right to use" publishers' content for AI training, AI tools surfacing information, and AI tools synthesizing information.
Previously:
Sarah Silverman Sues OpenAI, Meta for Being "Industrial-Strength Plagiarists" - 20230711
Related:
The Internet Archive Reaches An Agreement With Publishers In Digital Book-Lending Case - 20230815
(Score: 0) by Anonymous Coward on Wednesday August 23 2023, @09:18AM (3 children)
You misunderstand copyright. It controls the right to make copies. It's right there in the name.
If OpenAI are not making copies they are not infringing on copyright. Using something as training material does not infringe copyright. Showing someone (or someAI) a work you bought a copy of does not infringe copyright.
Unless you make a copy, you have not infringed copyright.
(Score: 5, Insightful) by r_a_trip on Wednesday August 23 2023, @10:23AM (2 children)
Except anything to do with computing inherently has to make a copy to do something. Loading material into RAM is already copying (inevitable) and under copyright you need permission for that. That is why software comes with a license or a public domain statement. Otherwise you can't load the program/data.
Even if OpenAI doesn't store anything concrete after the training, the fact they loaded other people's material into RAM constitutes a copy which is controlled by the rightsholder.
I think both The Times and Silverman are having their panties in a twist over nothing. ChatGPT outputs convincing garbage, but it can't write books, news articles, comedy, recipes or anything else that requires human skill for that matter. It can only output unoriginal boilerplate and it has definite, discernable patterns in what it puts out when getting prompted to write in a certain topic. Anyone can see after they poke ChatGPT around for a bit.
So this unfounded fear of being replaced by a bot in the future is idiotic for the foreseeable future. ChatGPT has no agency. It is a parrot. Bluntly put, you need to say "Polly want a cracker?" for it to spring into action and regurgitate "Yaah, Yaah!"
Or, and this might equally be the case, they smell easy money and want in on the action. Paying several millions for a non-exclusive, perpetual worldwide licenses to copyright holders to be able to continue using their training data probably sounds pretty good to OpenAI. At least they get to keep their first to market advantage with such a setup.
(Score: 2, Touché) by r_a_trip on Wednesday August 23 2023, @10:35AM
Upon further review, Sarah Silverman might be in massive trouble. ChatGPT puts out better and funnier garbage than she does. So I consider her replaced.
(Score: 0) by Anonymous Coward on Wednesday August 23 2023, @02:15PM
Under that interpretation, looking at it in a mirror is a violation of copyright. You are looking at an image that you did not buy.
Conversely, the Sony decision argues against a local copy (necessary to use) being a copyright violation.