A startup wants to democratize the tech behind DALL-E 2, consequences be damned – TechCrunch:
DALL-E 2, OpenAI's powerful text-to-image AI system, can create photos in the style of cartoonists, 19th century daguerreotypists, stop-motion animators and more. But it has an important, artificial limitation: a filter that prevents it from creating images depicting public figures and content deemed too toxic.
Now an open source alternative to DALL-E 2 is on the cusp of being released, and it'll have few — if any — such content filters.
London- and Los Altos-based startup Stability AI this week announced the release of a DALL-E 2-like system, Stable Diffusion, to just over a thousand researchers ahead of a public launch in the coming weeks. A collaboration between Stability AI, media creation company RunwayML, Heidelberg University researchers and the research groups EleutherAI and LAION, Stable Diffusion is designed to run on most high-end consumer hardware, generating 512×512-pixel images in just a few seconds given any text prompt.
"Stable Diffusion will allow both researchers and soon the public to run this under a range of conditions, democratizing image generation," Stability AI CEO and founder Emad Mostaque wrote in a blog post. "We look forward to the open ecosystem that will emerge around this and further models to truly explore the boundaries of latent space."
But Stable Diffusion's lack of safeguards compared to systems like DALL-E 2 poses tricky ethical questions for the AI community. Even if the results aren't perfectly convincing yet, making fake images of public figures opens a large can of worms. And making the raw components of the system freely available leaves the door open to bad actors who could train them on subjectively inappropriate content, like pornography and graphic violence.
[...] "Our benchmark models that we release are based on general web crawls and are designed to represent the collective imagery of humanity compressed into files a few gigabytes big," Mostaque said. "Aside from illegal content, there is minimal filtering, and it is on the user to use it as they will."
[...] Mostaque acknowledged that the tools could be used by bad actors to create "really nasty stuff," and CompVis says that the public release of the benchmark Stable Diffusion model will "incorporate ethical considerations." But Mostaque argues that — by making the tools freely available — it allows the community to develop countermeasures.
"We hope to be the catalyst to coordinate global open source AI, both independent and academic, to build vital infrastructure, models and tools to maximize our collective potential," Mostaque said. "This is amazing technology that can transform humanity for the better and should be open infrastructure for all."
[...] Stable Diffusion contains little in the way of mitigations besides training dataset filtering. So what's to prevent someone from generating, say, photorealistic images of protests, pornographic pictures of underage actors, "evidence" of fake moon landings and general misinformation? Nothing really. But Mostaque says that's the point.
"A percentage of people are simply unpleasant and weird, but that's humanity," Mostaque said. "Indeed, it is our belief this technology will be prevalent, and the paternalistic and somewhat condescending attitude of many AI aficionados is misguided in not trusting society ... We are taking significant safety measures including formulating cutting-edge tools to help mitigate potential harms across release and our own services. With hundreds of thousands developing on this model, we are confident the net benefit will be immensely positive and as billions use this tech harms will be negated."
What could possibly go wrong?
(Score: 0, Troll) by Anonymous Coward on Thursday August 18 2022, @02:12AM (1 child)
I hope so. We can fuck catgirls while our desktops churn out millions of racist memes.
(Score: -1, Troll) by Anonymous Coward on Thursday August 18 2022, @04:30AM
With our penis tentacles