TRANSCRIPTIONEnglish

MIT 6.S087: Foundation Models & Generative AI. IMAGE GENERATION

56m 27s10,038 mots1,413 segmentsEnglish

TRANSCRIPTION COMPLÈTE

0:01

okay welcome to the fourth lecture uh on

0:06

the course called fation Model intive

0:09

AI uh today we will do a very brief uh

0:14

managing of data

0:16

uh data is one of the key components in

0:19

this new type of AI and really deserves

0:21

its own course but we're going to talk a

0:23

little bit about it and then we're going

0:24

to cover stable diffusion which is a uh

0:28

text to image uh generation

0:31

ai dtive ai um okay so what do we have

0:36

left on this course so next week we are

0:38

going to have a lecture on emerging

0:41

Foundation models and and their

0:42

applications so this is foundation

0:44

models in the wild in the in the market

0:46

and in commercial settings especially

0:49

we'll have two uh guest speakers as well

0:52

uh so manoles will talk about AI in

0:55

genomics and Applied Biology and art

0:58

time will talk about autonomous agents

1:01

um and then the last lecture will be on

1:04

AI ethics and Regulation and then we'll

1:06

have a panel at the end of it then

1:07

discuss uh the ethical aspects of

1:12

AI okay so to summarize a little bit

1:15

right first lecture was an introduction

1:17

a very quick intuitive answer to what is

1:19

uh Foundation models and generative

1:21

AI we went on a little bit of a

1:24

philosophical digression and and the

1:27

history of AI the second lecture went

1:29

through all the different algorithms uh

1:32

from a high level perspective and then

1:34

we went into depth the last lecture on

1:37

chpt and right so what do we do in in uh

1:40

what's the key behind Foundation models

1:42

it's be able to learn from observations

1:45

you don't need human beings in a loop

1:47

and you can scale up as as much as you

1:49

want and that you get from this is a

1:51

very kind of contextual relational

1:53

understanding of meaning that mean is

1:54

defined by the company it keeps it's

1:56

self

1:57

referential right so dog is something

1:59

that's walk by owner with a leash it's

2:01

something that has anistic relationship

2:03

with cats it's something with that

2:04

chases F chases fris PES when those are

2:07

thrown this is how we understand what a

2:09

dog is right not actually your parent

2:11

labeling it or really you optimizing

2:15

certain goals like reinforcement

2:16

learning it's you observing dogs in

2:18

different context and correlating dogs

2:20

with other Concepts that's how you

2:22

understand what a dog is that's your

2:24

know that's the main

2:27

trick uh again uh

2:31

uh something that I think is you know

2:33

true when it comes to AI is that you

2:36

know there a very most understanding is

2:38

intuitive and is relies on all the

2:41

different relational uh edges so you

2:44

don't never really fully understand

2:46

something you can always become better

2:48

and it's a lot about just familiar

2:49

familiarizing yourself so if you don't

2:52

understand everything in lecture that's

2:53

fine just try to get some intuition and

2:55

familiarize familiarize yourself with it

2:58

and then kind of keep going

3:01

okay so uh data why is it so important

3:05

well I think a kind of uh complimentary

3:11

perspective on all the breakthroughs in

3:12

AI that we've been seeing is thinking

3:14

about in terms of data and actually how

3:17

the new AI just looks at data the old

3:20

type of data but looks at data in a new

3:21

type of way so it becomes more powerful

3:24

and can use more of it and that's

3:26

basically a very big part of what's

3:27

happening right now and the data is is

3:29

really really key

3:32

um and in some some ways I think data is

3:36

the you know if you want to apply AI in

3:38

actual settings you might care about

3:40

looking at the data understanding data

3:42

is going to be key for you and so that's

3:44

a very interdependent concept like Ai

3:47

and data go goes hand to hand it's very

3:49

very hard to develop better models if

3:50

you don't understand data and vice versa

3:53

so if you start applying AI to your own

3:55

settings understanding how the data uh

3:57

you know what data you have and also how

3:59

AI

4:00

how AI leverag data and what kind of

4:01

data he wants is going to be very very

4:03

important for you and if we take this

4:06

picture we had before about kind of you

4:08

look at the this new AI development as

4:11

some kind of Iceberg right the tip of

4:12

the iceberg are

4:15

uh chat GPT and stable diffusion for

4:18

example right hype stuff that that uh

4:21

people are talking about and then course

4:24

below that it's about understanding soft

4:26

Su press learning the training

4:28

methodology to get the these AIS right

4:31

foundation models and generative Ai and

4:33

then a really big you know significant

4:36

chunk below that the people don't talk

4:38

about as much is the data right that's

4:40

what feeds this whole

4:43

Revolution

4:46

and right and I

4:50

think

4:52

so let's look at open AI for example the

4:55

build chipt right maybe it's like 10 to

4:58

you know 10 engineers working for a year

5:00

or six months actually developing the

5:02

chbt version that we're using right now

5:05

right that's not a lot I mean that's why

5:08

you have a lot of startups now that try

5:09

to replicate CHP chbt just for a lot of

5:11

money in computer train on on data with

5:14

a small team and they can reproduce it

5:16

so that's like in itself I mean it's of

5:19

course impressive but it kind of pales

5:22

in comparison to how the internet was

5:23

created right so they they're able to

5:25

leverage all of internet download it and

5:27

and train on it but the inter was you

5:29

know we spent 20 years you know billions

5:32

of people putting a lot of data online

5:35

that that's a huge effort that you Cann

5:37

replicate right nobody can replicate

5:39

that uh process of creating the internet

5:42

from scratch no company can it would be

5:43

too too expensive and

5:46

so somehow the internet makes this all

5:50

this available and is basically the

5:51

greatest data collection effort in human

5:53

history and that's I think is even more

5:56

vital for CHP than any of the technology

5:59

right right so data is really really

6:02

key and again um I mean if I had to

6:06

choose between and for you as well if

6:08

you had to choose between having chtp or

6:10

having the internet you would rather

6:12

have the data and the internet because

6:14

you can retrain CHP you can create

6:16

better versions so the data is really

6:18

key and having access to it as well it's

6:20

also going to be problematic now with

6:22

copyright Etc where people want to say

6:24

like well stack Overflow but we have

6:26

this data now you you kind of out

6:28

competing us by using our data and

6:32

people are going to Value data more and

6:33

more now as well and it becomes very

6:35

interesting how it's going to affect the

6:37

development of AI because the internet

6:39

has been so easy to use and and people

6:41

haven't really

6:43

cared and again when you work for a

6:46

company whatever you do and you have

6:47

your own problems start looking at the

6:49

data the data has all the secrets so

6:51

it's a it's a very very important to

6:53

think about and not Overlook

6:57

um all right so

7:00

the small piece of philosophizing in

7:02

this class will be about data and so I

7:06

mean equally that that for chtp the

7:09

interesting thing is not the technology

7:12

right it's not the brain of CHP but it's

7:14

the internet maybe it's a similar

7:17

perspective of also human beings like

7:18

maybe we're not that impressive as

7:20

intelligence but actually the data that

7:22

we've created and the whole like get

7:24

biosphere has created that's what really

7:26

matters so maybe we're just more data

7:29

creators for the gene for example if you

7:31

know about this theory about the selfish

7:33

Gene trying to reproduce itself like

7:35

maybe we just collecting data for for

7:37

another purpose basically so right let's

7:40

say that an alien came to uh Earth and

7:43

discover earth and we be like oh they

7:45

probably want to kidnap us and look at

7:47

our brains and dissect us understand us

7:49

maybe they'll be like well no we don't

7:51

care about you guys we want your data we

7:53

just want like all the history of the

7:55

earth and what's going on here collect

7:57

that and then can they can derive their

DÉBLOQUER PLUS

Inscrivez-vous gratuitement pour accéder aux fonctionnalités premium

VISUALISEUR INTERACTIF

Regardez la vidéo avec des sous-titres synchronisés, une superposition réglable et un contrôle total de la lecture.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

RÉSUMÉ IA

Obtenez un résumé instantané généré par l'IA du contenu de la vidéo, des points clés et des principaux enseignements.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

TRADUIRE

Traduisez la transcription dans plus de 100 langues en un seul clic. Téléchargez dans n'importe quel format.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

CARTE MENTALE

Visualisez la transcription sous forme de carte mentale interactive. Comprenez la structure en un coup d'œil.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

DISCUTER AVEC LA TRANSCRIPTION

Posez des questions sur le contenu de la vidéo. Obtenez des réponses alimentées par l'IA directement à partir de la transcription.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

TIREZ LE MEILLEUR PARTI DE VOS TRANSCRIPTIONS

Inscrivez-vous gratuitement et débloquez la visionneuse interactive, les résumés IA, les traductions, les cartes mentales, et plus encore. Aucune carte de crédit requise.

    MIT 6.S087: Fou… - Transcription Complète | YouTubeTranscript.dev