OpenAI Teases a New Generative Video Model Called Sora

posted by hubie on Friday February 23 2024, @11:12AM

OpenAI teases an amazing new generative video model called Sora [technologyreview.com]:

OpenAI has built a striking new generative video model called Sora that can take a short text description and turn it into a detailed, high-definition film clip up to a minute long.
Based on four sample videos that OpenAI shared with MIT Technology Review ahead of today's announcement, the San Francisco–based firm has pushed the envelope of what's possible with text-to-video generation (a hot new research direction that we flagged as a trend to watch in 2024).
"We think building models that can understand video, and understand all these very complex interactions of our world, is an important step for all future AI systems," says Tim Brooks, a scientist at OpenAI.
[...] Impressive as they are, the sample videos shown here were no doubt cherry-picked to show Sora at its best. Without more information, it is hard to know how representative they are of the model's typical output.
It may be some time before we find out. OpenAI's announcement of Sora today is a tech tease, and the company says it has no current plans to release it to the public. Instead, OpenAI will today begin sharing the model with third-party safety testers for the first time.
In particular, the firm is worried about the potential misuses [technologyreview.com] of fake but photorealistic video [technologyreview.com]. "We're being careful about deployment here and making sure we have all our bases covered before we put this in the hands of the general public," says Aditya Ramesh, a scientist at OpenAI, who created the firm's text-to-image model DALL-E [technologyreview.com].
But OpenAI is eyeing a product launch sometime in the future. As well as safety testers, the company is also sharing the model with a select group of video makers and artists to get feedback on how to make Sora as useful as possible to creative professionals. "The other goal is to show everyone what is on the horizon, to give a preview of what these models will be capable of," says Ramesh.

[...] OpenAI is well aware of the risks that come with a generative video model. We are already seeing the large-scale misuse of deepfake images [technologyreview.com]. Photorealistic video takes this to another level.
[...] The OpenAI team plans to draw on the safety testing it did last year for DALL-E 3. Sora already includes a filter that runs on all prompts sent to the model that will block requests for violent, sexual, or hateful images, as well as images of known people. Another filter will look at frames of generated videos and block material that violates OpenAI's safety policies.
OpenAI says it is also adapting a fake-image detector developed for DALL-E 3 to use with Sora. And the company will embed industry-standard C2PA tags, metadata that states how an image was generated, into all of Sora's output. But these steps are far from foolproof. Fake-image detectors are hit-or-miss. Metadata is easy to remove, and most social media sites strip it from uploaded images by default.

OpenAI Unveils A.I. That Instantly Generates Eye-Popping Videos:

In April, a New York start-up called Runway AI unveiled technology that let people generate videos, like a cow at a birthday party or a dog chatting on a smartphone, simply by typing a sentence into a box on a computer screen.
The four-second videos were blurry, choppy, distorted and disturbing. But they were a clear sign that artificial intelligence technologies would generate increasingly convincing videos in the months and years to come.
Just 10 months later, the San Francisco start-up OpenAI has unveiled a similar system that creates videos that look as if they were lifted from a Hollywood movie. A demonstration included short videos — created in minutes — of woolly mammoths trotting through a snowy meadow, a monster gazing at a melting candle and a Tokyo street scene seemingly shot by a camera swooping across the city.
OpenAI, the company behind the ChatGPT chatbot and the still-image generator DALL-E, is among the many companies racing to improve this kind of instant video generator, including start-ups like Runway and tech giants like Google and Meta, the owner of Facebook and Instagram. The technology could speed the work of seasoned moviemakers, while replacing less experienced digital artists entirely.
VideoThis video's A.I. prompt: "Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes."CreditCredit...Video by OpenAI
It could also become a quick and inexpensive way of creating online disinformation, making it even harder to tell what's real on the internet.

OpenAI introduces Sora, its text-to-video AI model:

OpenAI is launching a new video-generation model, and it's called Sora. The AI company says Sora "can create realistic and imaginative scenes from text instructions." The text-to-video model allows users to create photorealistic videos up to a minute long — all based on prompts they've written.
Sora is capable of creating "complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background," according to OpenAI's introductory blog post. The company also notes that the model can understand how objects "exist in the physical world," as well as "accurately interpret props and generate compelling characters that express vibrant emotions."
The model can also generate a video based on a still image, as well as fill in missing frames on an existing video or extend it. The Sora-generated demos included in OpenAI's blog post include an aerial scene of California during the gold rush, a video that looks as if it were shot from the inside of a Tokyo train, and others. Many have some telltale signs of AI — like a suspiciously moving floor in a video of a museum — and OpenAI says the model "may struggle with accurately simulating the physics of a complex scene," but the results are overall pretty impressive.
[...] Earlier this month, OpenAI announced it's adding watermarks to its text-to-image tool DALL-E 3, but notes that they can "easily be removed." Like its other AI products, OpenAI will have to contend with the consequences of fake, AI photorealistic videos being mistaken for the real thing.

Original Submission

This discussion was created by hubie (1068) for logged-in users only, but now has been archived. No new comments can be posted.

OpenAI Teases a New Generative Video Model Called Sora | Log In/Create an Account | Top | 9 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

OpenAI Teases a New Generative Video Model Called Sora

Related Stories

No...No... (Score: 3, Touché) by PiMuNu on Friday February 23 2024, @12:17PM (1 child)

Re:No...(Score: 5, Insightful) by choose another one on Friday February 23 2024, @02:01PM

Sora vs Gemini(Score: 3, Informative) by looorg on Friday February 23 2024, @02:33PM

Make Sora animate Munch's The ScreamMake Sora animate Munch's The Scream (Score: 2) by Rosco P. Coltrane on Friday February 23 2024, @04:31PM (3 children)

Re:Make Sora animate Munch's The ScreamRe:Make Sora animate Munch's The Scream (Score: 2) by janrinok on Friday February 23 2024, @04:36PM (2 children)

Re:Make Sora animate Munch's The ScreamRe:Make Sora animate Munch's The Scream (Score: 2) by Rosco P. Coltrane on Friday February 23 2024, @06:14PM (1 child)

Re:Make Sora animate Munch's The Scream(Score: 2) by janrinok on Friday February 23 2024, @06:32PM

To the third-party testersTo the third-party testers (Score: 4, Informative) by DadaDoofy on Friday February 23 2024, @07:01PM (1 child)

Re:To the third-party testers(Score: 3, Funny) by Rich on Saturday February 24 2024, @03:26AM

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

OpenAI Teases a New Generative Video Model Called Sora

Related Stories

No...No... (Score: 3, Touché) by PiMuNu on Friday February 23 2024, @12:17PM (1 child)

Re:No...(Score: 5, Insightful) by choose another one on Friday February 23 2024, @02:01PM

Sora vs Gemini(Score: 3, Informative) by looorg on Friday February 23 2024, @02:33PM

Make Sora animate Munch's The ScreamMake Sora animate Munch's The Scream (Score: 2) by Rosco P. Coltrane on Friday February 23 2024, @04:31PM (3 children)

Re:Make Sora animate Munch's The ScreamRe:Make Sora animate Munch's The Scream (Score: 2) by janrinok on Friday February 23 2024, @04:36PM (2 children)

Re:Make Sora animate Munch's The ScreamRe:Make Sora animate Munch's The Scream (Score: 2) by Rosco P. Coltrane on Friday February 23 2024, @06:14PM (1 child)

Re:Make Sora animate Munch's The Scream(Score: 2) by janrinok on Friday February 23 2024, @06:32PM

To the third-party testersTo the third-party testers (Score: 4, Informative) by DadaDoofy on Friday February 23 2024, @07:01PM (1 child)

Re:To the third-party testers(Score: 3, Funny) by Rich on Saturday February 24 2024, @03:26AM