AI Story Roundup

posted by janrinok on Wednesday March 13 2024, @03:04PM

from the there-are-too-many-AI-stories! dept.

[We have had several complaints recently (polite ones, not a problem) regarding the number of AI stories that we are printing. I agree, but that reflects the number of submissions that we receive on the subject. So I have compiled a small selection of AI stories into one and you can read them or ignore them as you wish. If you are making a comment please make it clear exactly which story you are referring to unless your comment is generic. The submitters each receive the normal karma for a submission. JR]

Image-scraping Midjourney bans rival AI firm for scraping images

Freeman writes:

https://arstechnica.com/information-technology/2024/03/in-ironic-twist-midjourney-bans-rival-ai-firm-employees-for-scraping-its-image-data/

On Wednesday, Midjourney banned all employees from image synthesis rival Stability AI from its service indefinitely after it detected "botnet-like" activity suspected to be a Stability employee attempting to scrape prompt and image pairs in bulk. Midjourney advocate Nick St. Pierre tweeted about the announcement, which came via Midjourney's official Discord channel.
[...] Siobhan Ball of The Mary Sue found it ironic that a company like Midjourney, which built its AI image synthesis models using training data scraped off the Internet without seeking permission, would be sensitive about having its own material scraped. "It turns out that generative AI companies don't like it when you steal, sorry, scrape, images from them. Cue the world's smallest violin."
[...] Shortly after the news of the ban emerged, Stability AI CEO Emad Mostaque said that he was looking into it and claimed that whatever happened was not intentional. He also said it would be great if Midjourney reached out to him directly. In a reply on X, Midjourney CEO David Holz wrote, "sent you some information to help with your internal investigation."
[...] When asked about Stability's relationship with Midjourney these days, Mostaque played down the rivalry. "No real overlap, we get on fine though," he told Ars and emphasized a key link in their histories. "I funded Midjourney to get [them] off the ground with a cash grant to cover [Nvidia] A100s for the beta."

Midjourney stories on SoylentNews: https://soylentnews.org/search.pl?tid=&query=Midjourney&sort=2
Stable Diffusion (Stability AI) stories on SoylentNews: https://soylentnews.org/search.pl?tid=&query=Stable+Diffusion&sort=2

NYT disputes OpenAI "hacking" claim by pointing to ChatGPT bypassing paywalls

Freeman writes:

https://arstechnica.com/tech-policy/2024/03/nyt-disputes-openai-hacking-claim-by-pointing-to-chatgpt-bypassing-paywalls/

Late Monday, The New York Times responded to OpenAI's claims that the newspaper "hacked" ChatGPT to "set up" a lawsuit against the leading AI company.
[...] OpenAI had argued that NYT allegedly made "tens of thousands of attempts to generate" supposedly "highly anomalous results" showing that ChatGPT would produce excerpts of NYT articles. [...] But while defending tactics used to prompt ChatGPT to spout memorized training data—including more than 100 NYT articles—NYT pointed to ChatGPT users who have frequently used the tool to generate entire articles to bypass paywalls.
According to the filing, NYT today has no idea how many of its articles were used to train GPT-3 and OpenAI's subsequent AI models, or which specific articles were used, because OpenAI has "not publicly disclosed the makeup of the datasets used to train" its AI models. Rather than setting up a lawsuit, NYT was prompting ChatGPT to discover evidence in attempts to track the full extent of copyright infringement of the tool, NYT argued. [...] "In OpenAI's telling, The Times engaged in wrongdoing by detecting OpenAI's theft of The Times's own copyrighted content," NYT's court filing said. "OpenAI's true grievance is not about how The Times conducted its investigation, but instead what that investigation exposed: that Defendants built their products by copying The Times's content on an unprecedented scale—a fact that OpenAI does not, and cannot, dispute." On an OpenAI community page, one paid ChatGPT user complained that OpenAI is "working against the paid users of ChatGPT Plus. This time they're taking away Browsing, because it reads the content of a site that the user asks for? Please, that's what I pay for Plus for."
"I know it's no use complaining, because OpenAI is going to increasingly 'castrate' ChatGPT 4," the ChatGPT user continued, "but there's my rant."
NYT argued that public reports of users turning to ChatGPT to bypass paywalls "contradict OpenAI's contention that its products have not been used to serve up paywall-protected content, underscoring the need for discovery" in the lawsuit, rather than dismissal.
NYT wants a court to not only award damages for profits lost due to ChatGPT's alleged infringement, but also to order a permanent injunction to stop ChatGPT from infringement. A win for NYT could mean that OpenAI could be forced to wipe ChatGPT and start over. That could perhaps spur OpenAI to build a new AI model based on licensed content—since OpenAI said earlier this year it would be "impossible" to create useful AI models without copyrighted content—which would ensure publishers like NYT always get paid for training data.

Previously on SoylentNews:
OpenAI Says New York Times 'Hacked' ChatGPT to Build Copyright Lawsuit - 20240301
Why the New York Times Might Win its Copyright Lawsuit Against OpenAI - 20240220
New York Times Sues Microsoft, ChatGPT Maker OpenAI Over Copyright Infringement - 20231228
Report: Potential NYT lawsuit could force OpenAI to wipe ChatGPT and start over - 20230821

LLMs Become More Covertly Racist With Human Intervention

upstart writes:

LLMs become more covertly racist with human intervention:

Even when the two sentences had the same meaning, the models were more likely to apply adjectives like "dirty," "lazy," and "stupid" to speakers of African American English (AAE) than speakers of Standard American English (SAE). The models associated speakers of AAE with less prestigious jobs (or didn't associate them with having a job at all), and when asked to pass judgment on a hypothetical criminal defendant, they were more likely to recommend the death penalty.
An even more notable finding may be a flaw the study pinpoints in the ways that researchers try to solve such biases.
To purge models of hateful views, companies like OpenAI, Meta, and Google use feedback training, in which human workers manually adjust the way the model responds to certain prompts. This process, often called "alignment," aims to recalibrate the millions of connections in the neural network and get the model to conform better with desired values.
The method works well to combat overt stereotypes, and leading companies have employed it for nearly a decade. If users prompted GPT-2, for example, to name stereotypes about Black people, it was likely to list "suspicious," "radical," and "aggressive," but GPT-4 no longer responds with those associations, according to the paper.
However the method fails on the covert stereotypes that researchers elicited when using African-American English in their study, which was published on arXiv and has not been peer reviewed. That's partially because companies have been less aware of dialect prejudice as an issue, they say. It's also easier to coach a model not to respond to overtly racist questions than it is to coach it not to respond negatively to an entire dialect.
"Feedback training teaches models to consider their racism," says Valentin Hofmann, a researcher at the Allen Institute for AI and a coauthor on the paper. "But dialect prejudice opens a deeper level."
Avijit Ghosh, an ethics researcher at Hugging Face who was not involved in the research, says the finding calls into question the approach companies are taking to solve bias.
"This alignment—where the model refuses to spew racist outputs—is nothing but a flimsy filter that can be easily broken," he says.

Original Submission #1 Original Submission #2 Original Submission #3

This discussion was created by janrinok (52) for logged-in users only, but now has been archived. No new comments can be posted.

AI Story Roundup | Log In/Create an Account | Top | 27 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

AI Story Roundup

Image-scraping Midjourney bans rival AI firm for scraping images

NYT disputes OpenAI "hacking" claim by pointing to ChatGPT bypassing paywalls

LLMs Become More Covertly Racist With Human Intervention

Related Stories

New York Times Sues Microsoft, ChatGPT Maker OpenAI Over Copyright Infringement

Re: LLMs Become More Covertly Racist With Human In(Score: 0) by Anonymous Coward on Wednesday March 13 2024, @03:33PM

Back to the Future?Back to the Future? (Score: 4, Interesting) by JoeMerchant on Wednesday March 13 2024, @04:03PM (8 children)

Re:Back to the Future?Re:Back to the Future? (Score: 5, Touché) by drussell on Wednesday March 13 2024, @04:31PM (1 child)

Re:Back to the Future?(Score: 2, Interesting) by JoeMerchant on Wednesday March 13 2024, @05:52PM

Re:Back to the Future?Re:Back to the Future? (Score: 2) by Mojibake Tengu on Wednesday March 13 2024, @04:42PM (5 children)

Re:Back to the Future?Re:Back to the Future? (Score: 3, Interesting) by JoeMerchant on Wednesday March 13 2024, @05:59PM (4 children)

Re:Back to the Future?(Score: 2) by DannyB on Wednesday March 13 2024, @07:19PM

Re:Back to the Future?Re:Back to the Future? (Score: 3, Interesting) by drussell on Wednesday March 13 2024, @08:16PM (2 children)

Re:Back to the Future?(Score: 2) by JoeMerchant on Wednesday March 13 2024, @08:58PM

Re:Back to the Future?(Score: 2) by JoeMerchant on Wednesday March 13 2024, @09:00PM

And this is why I prefer this news siteAnd this is why I prefer this news site (Score: 3, Informative) by Opportunist on Wednesday March 13 2024, @04:17PM (2 children)

Re:And this is why I prefer this news siteRe:And this is why I prefer this news site (Score: 2) by DannyB on Wednesday March 13 2024, @07:23PM (1 child)

Deer in Headlight(Score: 3, Informative) by quietus on Thursday March 14 2024, @07:43AM

Is there an "acceptable" training set out there?Is there an "acceptable" training set out there? (Score: 3, Interesting) by JoeMerchant on Wednesday March 13 2024, @04:29PM (1 child)

Re:Is there an "acceptable" training set out there(Score: 2) by HiThere on Thursday March 14 2024, @12:08AM

Several interest aspectsSeveral interest aspects (Score: 4, Interesting) by khallow on Wednesday March 13 2024, @05:04PM (8 children)

Re:Several interest aspects(Score: 4, Insightful) by JoeMerchant on Wednesday March 13 2024, @07:44PM

Re:Several interest aspectsRe:Several interest aspects (Score: 2) by Tork on Wednesday March 13 2024, @09:21PM (6 children)

Re:Several interest aspects(Score: 2) by JoeMerchant on Wednesday March 13 2024, @09:40PM

Re:Several interest aspectsRe:Several interest aspects (Score: 1) by khallow on Wednesday March 13 2024, @11:46PM (4 children)

Re:Several interest aspectsRe:Several interest aspects (Score: 2) by Tork on Wednesday March 13 2024, @11:54PM (3 children)

Re:Several interest aspectsRe:Several interest aspects (Score: 1) by khallow on Thursday March 14 2024, @12:46AM (2 children)

Re:Several interest aspectsRe:Several interest aspects (Score: 2) by Tork on Thursday March 14 2024, @02:51AM (1 child)

Re:Several interest aspects(Score: 1) by khallow on Thursday March 14 2024, @05:11AM

AI, VR and eVTOLAI, VR and eVTOL (Score: 0) by Anonymous Coward on Wednesday March 13 2024, @07:08PM (1 child)

Re:AI, VR and eVTOL(Score: 3, Insightful) by DannyB on Wednesday March 13 2024, @07:29PM

copyright might be radically changed(Score: 1, Touché) by Anonymous Coward on Wednesday March 13 2024, @07:40PM

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

AI Story Roundup

Image-scraping Midjourney bans rival AI firm for scraping images

NYT disputes OpenAI "hacking" claim by pointing to ChatGPT bypassing paywalls

LLMs Become More Covertly Racist With Human Intervention

Related Stories

New York Times Sues Microsoft, ChatGPT Maker OpenAI Over Copyright Infringement

Re: LLMs Become More Covertly Racist With Human In(Score: 0) by Anonymous Coward on Wednesday March 13 2024, @03:33PM

Back to the Future?Back to the Future? (Score: 4, Interesting) by JoeMerchant on Wednesday March 13 2024, @04:03PM (8 children)

Re:Back to the Future?Re:Back to the Future? (Score: 5, Touché) by drussell on Wednesday March 13 2024, @04:31PM (1 child)

Re:Back to the Future?(Score: 2, Interesting) by JoeMerchant on Wednesday March 13 2024, @05:52PM

Re:Back to the Future?Re:Back to the Future? (Score: 2) by Mojibake Tengu on Wednesday March 13 2024, @04:42PM (5 children)

Re:Back to the Future?Re:Back to the Future? (Score: 3, Interesting) by JoeMerchant on Wednesday March 13 2024, @05:59PM (4 children)

Re:Back to the Future?(Score: 2) by DannyB on Wednesday March 13 2024, @07:19PM

Re:Back to the Future?Re:Back to the Future? (Score: 3, Interesting) by drussell on Wednesday March 13 2024, @08:16PM (2 children)

Re:Back to the Future?(Score: 2) by JoeMerchant on Wednesday March 13 2024, @08:58PM

Re:Back to the Future?(Score: 2) by JoeMerchant on Wednesday March 13 2024, @09:00PM

And this is why I prefer this news siteAnd this is why I prefer this news site (Score: 3, Informative) by Opportunist on Wednesday March 13 2024, @04:17PM (2 children)

Re:And this is why I prefer this news siteRe:And this is why I prefer this news site (Score: 2) by DannyB on Wednesday March 13 2024, @07:23PM (1 child)

Deer in Headlight(Score: 3, Informative) by quietus on Thursday March 14 2024, @07:43AM

Is there an "acceptable" training set out there?Is there an "acceptable" training set out there? (Score: 3, Interesting) by JoeMerchant on Wednesday March 13 2024, @04:29PM (1 child)

Re:Is there an "acceptable" training set out there(Score: 2) by HiThere on Thursday March 14 2024, @12:08AM

Several interest aspectsSeveral interest aspects (Score: 4, Interesting) by khallow on Wednesday March 13 2024, @05:04PM (8 children)

Re:Several interest aspects(Score: 4, Insightful) by JoeMerchant on Wednesday March 13 2024, @07:44PM

Re:Several interest aspectsRe:Several interest aspects (Score: 2) by Tork on Wednesday March 13 2024, @09:21PM (6 children)

Re:Several interest aspects(Score: 2) by JoeMerchant on Wednesday March 13 2024, @09:40PM

Re:Several interest aspectsRe:Several interest aspects (Score: 1) by khallow on Wednesday March 13 2024, @11:46PM (4 children)

Re:Several interest aspectsRe:Several interest aspects (Score: 2) by Tork on Wednesday March 13 2024, @11:54PM (3 children)

Re:Several interest aspectsRe:Several interest aspects (Score: 1) by khallow on Thursday March 14 2024, @12:46AM (2 children)

Re:Several interest aspectsRe:Several interest aspects (Score: 2) by Tork on Thursday March 14 2024, @02:51AM (1 child)

Re:Several interest aspects(Score: 1) by khallow on Thursday March 14 2024, @05:11AM

AI, VR and eVTOLAI, VR and eVTOL (Score: 0) by Anonymous Coward on Wednesday March 13 2024, @07:08PM (1 child)

Re:AI, VR and eVTOL(Score: 3, Insightful) by DannyB on Wednesday March 13 2024, @07:29PM

copyright might be radically changed(Score: 1, Touché) by Anonymous Coward on Wednesday March 13 2024, @07:40PM