SORA - text to video model

CocoaCoir · February 15, 2024, 7:53pm

The website is rather comprehensive. Heres a screenshot so you dont have to go to muskworld.

Previous text to video was total shit and only 1-2 seconds. This is really good and up to 1 minute per iteration. Imagine the story telling possibilities or the ADS.

Thoughts?

Habitt · February 15, 2024, 7:55pm

Its going to be used for porn. Just you watch.

Pigeonman · February 16, 2024, 1:16am

IT HAD BETTER be used for porn!

Pigeonman · February 16, 2024, 1:16am

On a serious note did no one watch Terminator?

Rhai88 · February 16, 2024, 1:23am

That better be a seriously detailed prompt. Only 60 seconds. Gonna need to get to the good part fast.

blowdout2269 · February 17, 2024, 12:51am

Oh boy…
This is gonna get weird.

…as if some of our technology isn’t weird already.

Magu · February 17, 2024, 9:35am

Thats the plan !

buzzmobile · February 17, 2024, 10:17am

On a fishing site I frequent a question was raised about a bird ID when I logged in this morning. Turns out the bird in question is a SORA.

First time I had ever heard of or seen pictures of a sora.

CocoaCoir · February 17, 2024, 4:38pm

It isn’t creating video first. It is simulating a world and then recording the result to video format we can play back.

CocoaCoir · February 18, 2024, 4:59pm

zephyr · February 20, 2024, 11:15pm

that’s actually completely innacurate. They are giving it way too much credit. It actually cannot consistently portray physics realistically, and in fact it is not running a physics simulation at all. It’s just 2D images with no simulation. There is no consistent world simulation, 3d modeling, or world rendering.

what it is actually doing is creating a visual portrayal of the prompt based on analysis of relative scale of similar depictions in its training set. it has persistence from frame to frame, and persistence for up to one minute.

Being based solely on visual relationships of scale, it can not consistently and realistically show depictions of physical phenomena across multiple instances. sand, breaking glass, someone drinking or eating, or fluid being poured from differently sized containers are all visuals that it would struggle to recreate with any physical acuracy.

So while they are telling the truth that visual representation of physics is “intuitive” and “implicit” that is only because it is baked into their data set. (Their training dataset also used video created in unreal video game engine with its built in physics simulations.)

Physics are innate to real video footage so the models replication of physical phenomena is basically only an artifact of image generation with consistency from frame to frame across a one minute video.

The model has no understanding of these phenomena, and technically, no ability to simulate them.

Topic		Replies	Views
DALL-E / Midjourney AI Generated Images, try your hand Image Gallery artwork , images	436	6187	May 23, 2023
Stable diffusion (ai art in general) Smoker's Lounge	16	576	February 11, 2024
Using AI to learn about Cannbis Basic Growing Info	14	398	June 26, 2024
The Endless Scroll of Stoner Videos Smoker's Lounge	4	221	September 11, 2021
Virtual Reality OverGrow… Welcome to the Future Smoker's Lounge	23	779	August 1, 2022
Canna-GPT: is it worth the time and effort? Smoker's Lounge	52	620	February 9, 2024
Nuclear Fusion Breakthrough could mean 'near-limitless energy' Smoker's Lounge	29	371	December 16, 2022

SORA - text to video model

Related topics