Sora Open AI: What You Need to Know

Sora Open AI

By Lightbringer Published 3 months ago • 3 min read

This is additional great news for In further uplifting advancement to generative artificial intelligence, the producer of ChatGPT on Tuesday launched a new tool that could instantly convert written text into short videos. Microsoft-backed OpenAI launched its latest product, Sora, which is a text-to-video generator. Such show-offs have also been done in this area by technology giants like Google and Meta.

In response to the announcement on social media about Sora software, one user told OpenAI CEO Sam Altman "Sam please don’t make me homeless", and he replied, "will generate you a video, what would you like?". The man suggested, "Hmmm, a monkey playing chess in a park". And immediately, Altman shared that masterpiece made by Sora on X platform.

Sora is not publicly available yet and OpenAI has revealed limited information about how it was built.

It is currently available for red teaming, which helps identify flaws in the AI system, as well as for use by visual artists, designers, and filmmakers to gain feedback on the model, the company said in a statement. According to the OpenAI, Sora may confuse the spatial details of a prompt and have difficulty in following a specific camera trajectory.

ChatGPT can now remember your conversation: Here's how it works

"Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background," the statement said, adding that it can create multiple shots within a single video.

Apart from generating videos from text prompts, Sora can animate a still image, the company said in a blogpost.

OpenAI hits landmark $2 billion revenue as growth zooms

OpenAI said it was also developing tools that can discern if a video was generated by Sora.

Last year, Meta Platforms beefed up its image generation model Emu to add two AI-based features that can edit and generate videos from text prompts.

What We Know About Sora

Sora is an AI model created by OpenAI that can generate high-quality video up to a minute long based on a text prompt.

“Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” OpenAI wrote in a blog post. “The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.”

Some Issues With Sora

OpenAI says the model can struggle to correctly display cause and effect. For example, say it shows someone drinking water out of a glass, it may not show the water level lowering with every sip.

OpenAI’s engineers are also still working on improving its spatial understanding, things like left and right or forward motion.

Here’s an example provided by OpenAI where Sora shows someone running on a treadmill but instead of running facing the machine, the jogger runs in the opposite direction.

The model is not currently available to the public and the company has not shared a release date. It’s currently in a red teaming phase, where specialized users will test it for potential harm.

When you put AI and video together, the danger is massive. We’ve seen recently how quickly deep fake images can spread and how disturbing they can be. Video raises the stakes exponentially.

Here are some safeguards OpenAI has or plans to have in place:

A detection tool that can alert users when a video was generated by Sora
C2PA metadata to track the origin of an video

Sora is also designed to reject prompts that violate OpenAI’s content policy – for instance content that illustrates violence, sexual behavior or abuse, and public figures.

In addition to red teamers, a team of visual artists, designers, and filmmakers will also have access to Sora, to provide feedback on how it can enable creative work.

The model’s been met with both excitement and fear, as with everything AI-related. Some are excited about the possibilities for businesses and individuals alike to create quality images with simple prompts.

For businesses with limited creative resources, this could be a game changer.

However, some worry about the impact on the job market for creative roles, like designers and animators. Would their roles now become redundant?

Though that’s unlikely, this fear always rises to the surface whenever a new tool comes around. It’s just too early to know, especially since the model isn’t available to the public yet.

Is it as good as it seems from OpenAI’s promos? We’ll have to wait to find out.

What Can Sora Do?

According to Open Ai, they are teaching AI to understand and also know how to simulate the physical world in motion, with the goal of training models that help people solve problems that require real-world interaction.

According to them, the Sora can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt.

Advice Guides

About the Creator

Lightbringer

Reader insights

Be the first to share your insights about this piece.

How does it work?

Add your insights

Comments

There are no comments for this story

Be the first to respond and start the conversation.

Keep reading

More stories from Lightbringer and writers in Writers and other communities.