TRANSKRIPTEnglish

5 Levels of Prompting to Create ANY AI Video

16m 47s2,717 ord402 segmentsEnglish

FULLSTÄNDIGT TRANSKRIPT

0:00

There's five levels of prompting you

0:01

need to master AI video and 99% of

0:04

people are stuck at level one or two.

0:06

And here's the thing, each new level

0:08

unlocks incredible possibilities for the

0:10

type of content you can create and gives

0:13

you way more control over the AI videos

0:15

than what you probably think is possible

0:16

right now. So today, I'm breaking down

0:18

every level of prompting you need to go

0:20

from the complete beginner to a seasoned

0:23

pro. This might be the most valuable AI

0:25

guide I ever make. The first level of

0:28

prompting is simply describing the idea

0:30

you have. This is what most people do

0:32

when they first start making AI video.

0:34

So, for example, I've got some pictures

0:36

of myself and let's say I wanted to

0:38

generate a video with a huge fluffy dog

0:41

behind me coming outside the door. In

0:43

the prompt, I would just intuitively

0:45

describe what I want to happen. The

0:47

closet door behind the man opens and a

0:49

giant furry fluffy light blue dog crawls

0:52

out. At this level, you're just

0:54

describing the raw intent behind what

0:56

you want to happen. You don't have any

0:58

structure to your prompt. You don't have

1:00

any special techniques. You're just

1:02

telling the AI model in the most simple

1:04

way exactly what you want to happen.

1:06

Just because these prompts are super

1:08

simple and basic doesn't mean that the

1:11

quality of the video is going to be any

1:13

lower. In fact, the actual quality of

1:15

the video itself will roughly be the

1:18

same regardless of how complicated your

1:20

prompt is. Here are some examples from a

1:22

new sea dance model where these videos

1:24

were all generated using simple idea

1:27

prompts that are only one or two

1:28

sentences long. A massive cracking

1:30

attacks a pirate ship. The captain

1:32

slices it with his sword. Hyperrealistic

1:35

cinematic movie scene. And that's really

1:38

all you need to generate a super

1:39

highquality looking video. Here's a

1:42

video created with an even simpler one-s

1:44

sentence prompt. A nature documentary

1:47

about an otter flying an airplane.

1:50

This is the incredible story of the

1:52

pilot otter. So, if we can make a super

1:55

highquality video using just a super

1:57

simple prompt, what's the point of this

2:00

tutorial? Well, it's not just about

2:02

creating the high quality visuals inside

2:05

the video. It's about controlling what's

2:07

happening exactly the way you want. Even

2:10

though at level one of prompting you can

2:12

make some really good-looking stuff, the

2:13

main issue you're going to run into is

2:15

inconsistency between different video

2:17

generations where you feel like you need

2:19

to generate over and over and over again

2:22

until you finally get a video clip

2:24

that's worth using. Level two is

2:26

structured prompting. Instead of just

2:28

intuitively writing down the raw idea

2:31

for your video, you can instead use a

2:33

structured prompt formula and fit your

2:36

idea into that structure. Now, there's a

2:39

ton of different variations and guides

2:41

like this on the internet. These are

2:43

just structured formulas for writing

2:46

prompts that have been proven to work.

2:48

The key components of these prompts stay

2:50

the same. First is the subject, the

2:52

environment, and the action that they're

2:53

taking. These are all different

2:55

variations of subjects, environments,

2:57

and actions. The next key component of

3:00

these cinematic prompts is the camera

3:02

shot and camera movement. Camera shot

3:05

defines how the subject is framed, like

3:07

a close-up for emotion or a wide shot

3:09

for environment. Camera motion defines

3:11

how the camera moves, like a tracking

3:13

shot or a slow pushin. Together, they

3:16

turn a basic idea into something that

3:18

feels cinematic.

3:20

The third component of the cinematic

3:21

prompt structure, the visual style.

3:24

Visual style defines the overall look of

3:26

the scene, such as realistic anime or 3D

3:29

Pixar, shaping how the final video

3:30

feels. So, here's an example prompt with

3:34

this cinematic style structure. A 1980s

3:37

cinema grainy film, which is the visual

3:40

style, a medium shot, which is the

3:43

camera framing of a tired office worker

3:46

in Japan standing on an empty subway

3:48

platform, loosening his tie as a train

3:50

approaches in the distance, flickering

3:52

tunnel lights, and an analog

3:54

advertisement board glows fatally green.

4:07

Another common structured prompt that

4:10

you're going to run into is the JSON

4:12

prompt. So, a JSON file is just a simple

4:15

text format that's easy for code bases

4:17

to read and write. So, here's an example

4:20

of what a JSON prompt would look like. I

4:23

have my keywords, which are the subject,

4:26

the action, environment, camera, etc.,

4:29

and also a value assigned to each of the

4:31

keywords. And so in this case, the

4:34

subject is a lost hiker. The action is

4:36

struggling to walk through deep snow and

4:39

the environment is a blizzard in a

4:40

frozen mountain range. But the most

4:43

useful thing about these JSON prompts is

4:46

that it's super easy to organize and

4:48

swap out different keywords, especially

4:50

if you're working with multiple people

4:52

with all the keywords neatly laid out.

4:55

the results that you get when using a

4:57

JSON prompt are the same as using a

5:00

regular prompt as long as the

5:02

information that you include in both of

5:04

them are the same. So, here's a few

5:07

different comparisons of videos I have

5:09

where I generated them using a JSON

5:11

prompt and one where I didn't use a JSON

5:13

prompt format, but I included all the

5:16

same keywords and information inside the

5:18

prompt and the results are pretty much

5:20

the same. And I found this to be true

5:22

across tons of different tests. So, the

5:24

moral of the story is the JSON prompt a

5:27

great way to organize and structure your

5:28

prompts. It makes it super easy to work

5:30

with huge databases of prompts, but it's

5:33

not a magic pill that's going to

5:34

suddenly create amazing AI videos.

5:37

Another super important structure prompt

5:39

you need to know is the multi-shot

5:41

prompt, which is a single prompt that

5:43

defines multiple sequential shots, each

5:46

with its own camera angle, action, and

5:48

timing to create a full continuous

5:51

scene. Here's an example of a really

5:53

impressive cinematic sequence generated

5:56

using a multi-shot prompt. Inside the

5:58

prompt for this multi-shot sequence in

6:01

the different cuts are clearly defined

6:03

with a different camera shot and

6:04

movement and also exactly what's

6:07

happening inside of each cut.

6:22

When you write prompts that have these

6:23

structured templates, you'll find

6:25

yourself having way more control over

6:28

exactly what's inside the videos. The

6:30

third level of prompting is reference

6:32

control. So far in the first two levels,

6:35

we've been describing with our words

6:37

what we want to happen inside the AI

6:38

video. And using that, you can get

6:40

really far. But at level three, instead

6:42

of just describing with your words,

6:44

you're directly showing the AI video

6:47

with reference material. exactly how you

6:50

want your videos to be created. So,

6:52

previously we've seen a basic version of

6:55

this where I used an image of myself and

6:58

then prompted for a huge fluffy blue dog

7:01

behind me. In this case, the reference

7:03

is controlling what my appearance is

7:06

going to be. But we can use way more

7:08

than just a single image to guide the AI

7:10

video. In this example, two reference

7:12

images of a character and scene were

7:14

used. The first one with the main

7:16

character holding a dagger. In the

7:18

second image, it shows her and all the

7:19

defeated asalants on the ground. And

7:22

these two images were combined as part

7:24

of the prompt. The AI video was told to

7:27

generate a sequence of the woman

7:29

fighting the assassins with different

7:31

shot composition and perspectives thrown

7:34

in there. And the result is an extremely

7:36

dynamic fight scene all while preserving

7:38

the consistency of the appearance of the

7:40

woman. Beyond just image references, we

7:42

can now also use video references, audio

7:45

references, and of course, text

7:47

descriptions to guide the video

7:48

generation process. Here's an example

7:50

for an exciting AI generated fight scene

7:53

between two characters. And to create

7:55

this video, a bunch of different

LÅS UPP MER

Registrera dig gratis för att få tillgång till premiumfunktioner

INTERAKTIV VISARE

Titta på videon med synkroniserad undertext, justerbart överlägg och fullständig uppspelningskontroll.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

AI-SAMMANFATTNING

Få en omedelbar AI-genererad sammanfattning av videoinnehållet, nyckelpunkter och slutsatser.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

ÖVERSÄTT

Översätt transkriptet till över 100 språk med ett klick. Ladda ner i valfritt format.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

MIND MAP

Visualisera transkriptet som en interaktiv mind map. Förstå strukturen med ett ögonkast.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

CHATTA MED TRANSKRIPT

Ställ frågor om videoinnehållet. Få svar från AI direkt från transkriptet.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

FÅ UT MER AV DINA TRANSKRIPT

Registrera dig gratis och lås upp interaktiv visning, AI-sammanfattningar, översättningar, mind maps och mer. Inget kreditkort krävs.

    5 Levels of Pr… - Fullständigt Transkript | YouTubeTranscript.dev