文本记录English

MIT 6.S087: Foundation Models & Generative AI. IMAGE GENERATION

56m 27s10,038 字数1,413 segmentsEnglish

完整文本记录

0:01

okay welcome to the fourth lecture uh on

0:06

the course called fation Model intive

0:09

AI uh today we will do a very brief uh

0:14

managing of data

0:16

uh data is one of the key components in

0:19

this new type of AI and really deserves

0:21

its own course but we're going to talk a

0:23

little bit about it and then we're going

0:24

to cover stable diffusion which is a uh

0:28

text to image uh generation

0:31

ai dtive ai um okay so what do we have

0:36

left on this course so next week we are

0:38

going to have a lecture on emerging

0:41

Foundation models and and their

0:42

applications so this is foundation

0:44

models in the wild in the in the market

0:46

and in commercial settings especially

0:49

we'll have two uh guest speakers as well

0:52

uh so manoles will talk about AI in

0:55

genomics and Applied Biology and art

0:58

time will talk about autonomous agents

1:01

um and then the last lecture will be on

1:04

AI ethics and Regulation and then we'll

1:06

have a panel at the end of it then

1:07

discuss uh the ethical aspects of

1:12

AI okay so to summarize a little bit

1:15

right first lecture was an introduction

1:17

a very quick intuitive answer to what is

1:19

uh Foundation models and generative

1:21

AI we went on a little bit of a

1:24

philosophical digression and and the

1:27

history of AI the second lecture went

1:29

through all the different algorithms uh

1:32

from a high level perspective and then

1:34

we went into depth the last lecture on

1:37

chpt and right so what do we do in in uh

1:40

what's the key behind Foundation models

1:42

it's be able to learn from observations

1:45

you don't need human beings in a loop

1:47

and you can scale up as as much as you

1:49

want and that you get from this is a

1:51

very kind of contextual relational

1:53

understanding of meaning that mean is

1:54

defined by the company it keeps it's

1:56

self

1:57

referential right so dog is something

1:59

that's walk by owner with a leash it's

2:01

something that has anistic relationship

2:03

with cats it's something with that

2:04

chases F chases fris PES when those are

2:07

thrown this is how we understand what a

2:09

dog is right not actually your parent

2:11

labeling it or really you optimizing

2:15

certain goals like reinforcement

2:16

learning it's you observing dogs in

2:18

different context and correlating dogs

2:20

with other Concepts that's how you

2:22

understand what a dog is that's your

2:24

know that's the main

2:27

trick uh again uh

2:31

uh something that I think is you know

2:33

true when it comes to AI is that you

2:36

know there a very most understanding is

2:38

intuitive and is relies on all the

2:41

different relational uh edges so you

2:44

don't never really fully understand

2:46

something you can always become better

2:48

and it's a lot about just familiar

2:49

familiarizing yourself so if you don't

2:52

understand everything in lecture that's

2:53

fine just try to get some intuition and

2:55

familiarize familiarize yourself with it

2:58

and then kind of keep going

3:01

okay so uh data why is it so important

3:05

well I think a kind of uh complimentary

3:11

perspective on all the breakthroughs in

3:12

AI that we've been seeing is thinking

3:14

about in terms of data and actually how

3:17

the new AI just looks at data the old

3:20

type of data but looks at data in a new

3:21

type of way so it becomes more powerful

3:24

and can use more of it and that's

3:26

basically a very big part of what's

3:27

happening right now and the data is is

3:29

really really key

3:32

um and in some some ways I think data is

3:36

the you know if you want to apply AI in

3:38

actual settings you might care about

3:40

looking at the data understanding data

3:42

is going to be key for you and so that's

3:44

a very interdependent concept like Ai

3:47

and data go goes hand to hand it's very

3:49

very hard to develop better models if

3:50

you don't understand data and vice versa

3:53

so if you start applying AI to your own

3:55

settings understanding how the data uh

3:57

you know what data you have and also how

3:59

AI

4:00

how AI leverag data and what kind of

4:01

data he wants is going to be very very

4:03

important for you and if we take this

4:06

picture we had before about kind of you

4:08

look at the this new AI development as

4:11

some kind of Iceberg right the tip of

4:12

the iceberg are

4:15

uh chat GPT and stable diffusion for

4:18

example right hype stuff that that uh

4:21

people are talking about and then course

4:24

below that it's about understanding soft

4:26

Su press learning the training

4:28

methodology to get the these AIS right

4:31

foundation models and generative Ai and

4:33

then a really big you know significant

4:36

chunk below that the people don't talk

4:38

about as much is the data right that's

4:40

what feeds this whole

4:43

Revolution

4:46

and right and I

4:50

think

4:52

so let's look at open AI for example the

4:55

build chipt right maybe it's like 10 to

4:58

you know 10 engineers working for a year

5:00

or six months actually developing the

5:02

chbt version that we're using right now

5:05

right that's not a lot I mean that's why

5:08

you have a lot of startups now that try

5:09

to replicate CHP chbt just for a lot of

5:11

money in computer train on on data with

5:14

a small team and they can reproduce it

5:16

so that's like in itself I mean it's of

5:19

course impressive but it kind of pales

5:22

in comparison to how the internet was

5:23

created right so they they're able to

5:25

leverage all of internet download it and

5:27

and train on it but the inter was you

5:29

know we spent 20 years you know billions

5:32

of people putting a lot of data online

5:35

that that's a huge effort that you Cann

5:37

replicate right nobody can replicate

5:39

that uh process of creating the internet

5:42

from scratch no company can it would be

5:43

too too expensive and

5:46

so somehow the internet makes this all

5:50

this available and is basically the

5:51

greatest data collection effort in human

5:53

history and that's I think is even more

5:56

vital for CHP than any of the technology

5:59

right right so data is really really

6:02

key and again um I mean if I had to

6:06

choose between and for you as well if

6:08

you had to choose between having chtp or

6:10

having the internet you would rather

6:12

have the data and the internet because

6:14

you can retrain CHP you can create

6:16

better versions so the data is really

6:18

key and having access to it as well it's

6:20

also going to be problematic now with

6:22

copyright Etc where people want to say

6:24

like well stack Overflow but we have

6:26

this data now you you kind of out

6:28

competing us by using our data and

6:32

people are going to Value data more and

6:33

more now as well and it becomes very

6:35

interesting how it's going to affect the

6:37

development of AI because the internet

6:39

has been so easy to use and and people

6:41

haven't really

6:43

cared and again when you work for a

6:46

company whatever you do and you have

6:47

your own problems start looking at the

6:49

data the data has all the secrets so

6:51

it's a it's a very very important to

6:53

think about and not Overlook

6:57

um all right so

7:00

the small piece of philosophizing in

7:02

this class will be about data and so I

7:06

mean equally that that for chtp the

7:09

interesting thing is not the technology

7:12

right it's not the brain of CHP but it's

7:14

the internet maybe it's a similar

7:17

perspective of also human beings like

7:18

maybe we're not that impressive as

7:20

intelligence but actually the data that

7:22

we've created and the whole like get

7:24

biosphere has created that's what really

7:26

matters so maybe we're just more data

7:29

creators for the gene for example if you

7:31

know about this theory about the selfish

7:33

Gene trying to reproduce itself like

7:35

maybe we just collecting data for for

7:37

another purpose basically so right let's

7:40

say that an alien came to uh Earth and

7:43

discover earth and we be like oh they

7:45

probably want to kidnap us and look at

7:47

our brains and dissect us understand us

7:49

maybe they'll be like well no we don't

7:51

care about you guys we want your data we

7:53

just want like all the history of the

7:55

earth and what's going on here collect

7:57

that and then can they can derive their

解锁更多

免费注册以访问高级功能

互动查看器

观看带有同步字幕、可调节叠加层和完整播放控制的视频。

免费注册以解锁

AI 摘要

获取由 AI 立即生成的视频内容摘要、要点和结论。

免费注册以解锁

翻译

一键将字幕翻译成 100 多种语言。以任何格式下载。

免费注册以解锁

思维导图

将字幕可视化为交互式思维导图。一目了然地了解结构。

免费注册以解锁

与字幕聊天

提出关于视频内容的问题。直接从字幕中获取由 AI 驱动的答案。

免费注册以解锁

从您的字幕中获得更多

免费注册并解锁交互式查看器、AI 摘要、翻译、思维导图等。无需信用卡。

    MIT 6.S087: Foundation Models &… - 完整文字记录 | YouTubeTranscript.dev