TRANSCRIPTEnglish

Build Your Own AI Assistant (Better Than OpenClaw) - No Subscription, No Tokens

15m 34s2,303 words308 segmentsEnglish

FULL TRANSCRIPT

0:07

Hi, this is Mike Barnes. Today I'm going

0:10

to talk to you about uh an alternative

0:13

to the very popular project called Open

0:16

Claw. Um this is something that uh I

0:19

built. I've been working uh installing

0:22

Open Claw and playing with it for uh two

0:25

weeks since it came out. And uh

0:28

basically I felt that the risks uh for

0:31

the system were just too expensive. Uh

0:34

as you can see on my slide here uh high

0:37

costs is one of the uh real big

0:39

problems. Uh I've been watching some of

0:42

the uh people on YouTube saying that

0:44

they're spending a couple hundred

0:45

dollars a day on uh token costs using

0:49

anthropic and there are some people

0:51

suggesting uh lower costs of doing it.

0:54

But uh these costs are pretty hard to

0:56

control because as the knowledge base

0:59

grows in the system, it continuously

1:02

uses more and more uh uh tokens and

1:05

these tokens uh charge up and you just

1:08

never know how much it's going to be.

1:10

And then of course we've all been

1:11

hearing about the privacy risk because

1:14

uh the system has virtual complete

1:16

control over your system and some people

1:18

are also buying separate computers like

1:21

uh the Mac minis to do it. the system is

1:25

sort of inflexible. There's a limited

1:27

amount of of customization and uh also

1:31

vendor lock lock in risks. Um so I tried

1:36

to use open claw with

1:40

um I was getting good results for a

1:42

while and then the system would break

1:43

and I continuously tried. I did go out

1:46

to outside models. The performance was

1:49

certainly much better, but uh I decided

1:52

that I was going to scrap uh openclaw

1:55

and build something myself. Uh and I was

1:58

going to you uh use uh GLM5, which is a

2:02

model uh that runs for free on on a

2:06

website called z.ai

2:08

to help me develop uh the code. So what

2:12

I wanted to do is to come up with a

2:14

self-improving system that offered a web

2:18

dashboard uh that protected privacy and

2:21

was free and open source with uh no

2:24

subscription required. So uh this is

2:28

probably underestimating what the cost

2:30

would be on a monthly basis. Uh but uh

2:33

you you at a minimum would be spending

2:36

between $15 and $100 to be able to use

2:40

uh the various APIs. This says J GBD4,

2:43

but obviously GPD4 isn't available

2:44

anymore. We'd be using, you know, 5.2 or

2:47

something like that. But after we set up

2:50

this system, uh there's no per token

2:53

charge, no subscription fees, and you

2:56

have unlimited queries forever.

2:59

And uh we're doing all the processing on

3:02

this one locally. Now um we're using

3:04

Alama as the base. So it is possible to

3:07

get uh tokens through the cloud. There's

3:10

a limited amount that resets every week

3:13

uh with Alama. And you can buy more

3:15

tokens. and Alama has some very capable

3:18

uh cloud-based tokens if you want to go

3:20

that way that would be far less

3:21

expensive uh than than using uh uh

3:25

Anthropics or OpenAI uh or even uh

3:29

Google. So the architecture basically is

3:33

is that we developed a web dashboard. We

3:36

have a llama runtime. uh the uh coding

3:39

which I've done for all the agents was

3:41

done on GLM5

3:43

and uh we have uh local fast models

3:46

running go out and query uh GLM5

3:50

on more advanced requirements I've built

3:53

in a document management system I've

3:55

created autolearning and knowledge

3:57

persistence so um here are some of the

4:01

foundations of how to set this up you

4:03

have to install NodeJS uh plus a lama

4:06

NodeJS is required to install Alama. You

4:10

want to pull the models that you're

4:11

going to use. Uh in this particular

4:13

case, I used OSS uh 20B. Uh I used uh

4:18

Kimmy 2.5 cloud, Lava and Gemma. Uh and

4:23

Lava and Jima are both u vision models

4:26

which would allow me to uh do OCR and

4:29

also identify um images.

4:33

When once we get our MPM start, we can

4:36

then use a browser uh and to use

4:39

localhost 3000 to to operate this

4:42

system. Uh we have a natural uh chat

4:45

interface through the browser as well as

4:48

through um I set this up to go with uh

4:51

telegraph.

4:53

I had been using WhatsApp with open call

4:56

but open call uh but when I used um

4:59

WhatsApp it interfered with my my work

5:02

because I do use it for work. Um I added

5:05

because of the work I do I work with the

5:07

federal government. I gave it the API

5:10

for SAM.gov. I put a capability to do

5:13

research there. I added a mo a model

5:17

selector. uh I created it so that uh it

5:21

improves itself. So I think that the

5:24

local process uh it wins because it's

5:27

cheaper, it's safer and better. And so

5:30

what I want to try to do here is to show

5:32

you what I built. Uh you can use this as

5:35

a reference uh in in doing your own

5:37

build. and uh you will wind up with

5:40

something I think will be much less

5:42

expensive, far more customized

5:45

um and much more secure than than using

5:48

open call. So I'm going to start out

5:51

right now showing you um uh the the

5:53

telegram. So, I'm I'm going to bring up

5:56

u the Telegram web interface.

6:01

>> And uh I set this up in this almost the

6:03

exact same way that you would do open

6:05

call, which winds up with my own bot, my

6:08

own bot here. I can now hit uh a slash

6:11

and hit help. And it gives me all my uh

6:15

capabilities here.

6:17

>> So, the model that I have running is

6:19

GPTOSS20B.

6:21

Uh it's uh roughly equivalent to the

6:23

original chat GPT, but it is running

6:26

locally. I have it set up so that I can

6:28

use uh Miniax M uh.2.5 on the cloud for

6:33

very complex tasks. I've used up all my

6:35

tokens uh for this week. So I have that

6:38

will reset next week. And for vision, I

6:41

have Lava uh 7B, but you can also use uh

6:45

one of the Gemma models as well. All of

6:47

these can run locally. And I have the

6:49

system set up. So that for different

6:52

tasks it will select different models I

6:55

have uh research for various uh topics

6:59

that I'm interested in uh space being

7:01

the industry that I work in uh AI uh my

7:05

my own hobby and then uh I can create a

7:08

list of documents uh and I can overwrite

7:12

or edit a document and then if I want

7:14

this the system to remember something I

7:17

can uh press remember I can get a status

7:20

of tokens. I can get a list of the uh

7:23

models that are available uh by typing

7:26

slashmodels

7:28

and I can also get a uh a status uh and

7:33

then while I'm in this area here I have

7:37

uh basically a chat uh capability.

7:41

So, uh, this is fine for when I'm, um,

7:44

outside of my home environment, but when

7:48

I'm in my home environment, I want to

7:50

have something that's a little bit, uh,

7:52

friendlier and a little bit easier to

7:54

use. So, um, what I did is I created

7:58

this this web system right here that you

8:01

see, and I called this, uh, Mike's

8:05

research assistant. Um, as you see here,

8:07

it's a very simple user interface. If

8:09

you've ever worked with chat GPT, uh

8:12

you'll you'll understand this. Uh I have

8:15

uh I believe my models right now. So

8:17

I'll just make sure that it's running uh

8:20

by uh typing hello and I should get some

8:22

sort of response if the uh Okay. So

8:25

hello Mike, can I support you? Uh

8:27

whether it's digging in trends for DAT,

8:29

which is a company I work for, scouting

8:31

new opportunities or just brainstorming

8:33

ideas. This is coming from the local

8:35

model. So I'm going to ask it to just

8:38

with a a button here to give me industry

8:41

news on space. So it's given here uh the

8:45

introduction and then I can uh get a

8:47

link. So uh if I uh press here, it'll

8:52

open up a new tab. The new tab will then

8:55

give me the article that it it's found

8:58

that uh uh it it believes that I would

9:01

be interested in. So, I can go back uh

9:05

to go right back to my local host. Now,

9:09

uh I have this set up here uh that it's

9:13

I've got the autolearn

9:15

and uh this means that as I'm

9:18

researching, I can I can uh learn uh

9:21

from these. Now, let's say I want to go

9:24

look at SAM.gov gov and I want to use

9:27

their API to see if there's any relevant

9:30

um opportunities for me to bid on. This

9:33

is going to go out. It's going to use

9:35

the keywords that I've given it and it's

9:37

going to give me a list of opportunities

9:40

here that I may want to uh to look at.

9:44

Um I can uh search on another topic here

9:49

which is uh AI and defense news. Um and

9:53

then uh I can get a a briefing on just

9:57

basically topics that I'm interested in

9:59

because these are things that I've been

10:02

looking at. Now in addition to this I've

10:06

uh what I I've decided to create a dream

10:09

team of agents and these uh dream teams

10:12

uh basically what they do is uh they

10:16

have certain uh capabilities or

10:18

pre-prompts that tell them how to do

10:21

certain things. So down here you'll see

10:23

the agent. I have a researcher, a

10:25

copywriter, a marketeteer, and a

10:27

strategist. And each one of them can be

10:31

assigned to a different LLM. They each

10:34

have their own prompt, and I can uh uh

10:37

ask them to do different things. So if I

10:40

click on uh the copywriter, for example,

10:43

uh I am now conversing with a different

10:46

personality than if I converse with the

10:49

researcher.

10:51

uh another capability. So uh these are

10:56

these here are my uh experts here. So I

10:59

can uh I have pre-built workflows for

11:02

proposal development uh doing

11:04

competition and uh uh and if I want to

11:09

go into doing uh a market entry, this is

11:12

all pre-prompted to let me to do that.

11:16

Um, I go back to the I can go back here

11:19

to the chat and uh I can uh have it do

11:23

with a uh slash research will do

11:26

research for me, but I have access to my

11:28

local model and I can have it do silly

11:31

things like u write a poem about space

11:35

debris

11:38

and just like chatg

11:40

it'll uh go off and and uh do whatever

11:43

task I want to. You can see the working

11:46

uh that that's going on right now. Um

11:51

as it's doing that uh let me show you

11:53

another capability I added here is let's

11:56

say I'm taking notes that I want it to

11:58

remember. I can uh come in here and say

12:02

uh please

12:04

uh you know please uh remember I have a

12:08

meeting at 3 p.m. on the 21st

12:16

and it will then uh add that to uh my

12:20

memory. Um, I also have the ability to

12:26

uh create documents that I can store.

12:29

And one of the other uh capabilities

12:32

I've had is to attach. So, this attach

12:36

will allow me to add a document or it

12:39

will allow me to um

12:42

uh to add an image. And if I'm using

12:46

let's say uh one of the vision models

12:50

like lava, I can then have that document

12:54

uh broken down in uh you know or OCR

12:57

scanned or I can have an object uh

12:59

described by attaching the op uh the

13:02

object.

13:04

>> So I built this uh using u uh basically

13:08

uh GLM uh 5.0.

13:11

Um, anyone really can outline their own

13:15

workflow uh to build this. Uh, uh, this

13:19

once again is running on my own, uh,

13:22

computer. And, um, let's see here. So,

13:27

here's here's the poem that was uh, uh,

13:30

written, you know, uh, uh, silent shr

13:33

uh, shreds of yesterday's ambition.

13:35

Titanium ghosts of Newton's dance. A

13:38

screwdriver spins through the endless

13:39

midnight where once the star is held a

13:42

watchful trance. So it's sort of amazing

13:45

that without spending any money on

13:47

tokens without uh paying uh for any

13:53

subscriptions I'm able to build this

13:56

system and this system uh with the

13:59

capabilities of the smart team I can add

14:02

other capabilities to this as well. So,

14:06

I offer this up as sort of a model that

14:10

can be used uh for people that don't

14:13

want all the limitations. Once again,

14:16

nothing is going outside of my own

14:19

environment. Uh this is running

14:22

completely on my computer using my own

14:25

uh RTX 4080 processor. I have a 5090 on

14:29

uh my laptop.

14:31

I can access this from outside of my own

14:34

premise. I offer this up as an

14:37

alternative to open claw. Uh leave it to

14:40

people who are more inclined to do

14:43

programming than I am uh to build upon

14:46

this idea. But I think this is a better

14:48

model than open claw.

14:51

Although I have to uh to give credit for

14:53

openclaw for kind of uh instigating me

14:56

into doing this project. And also want

14:59

to thank Randy Hill who um is my friend

15:02

and he's also uh the chief technology

15:04

officer for Govotics because many of

15:07

these concepts uh he developed on an

15:09

enterprise scale months ago on a a

15:12

project he called Japetto. So um this is

15:16

uh a home version uh but I think it's a

15:19

it's a very good model uh that can be

15:22

used uh for anyone who wants to build a

15:26

secure lowcost

15:29

uh assistant.

UNLOCK MORE

Sign up free to access premium features

INTERACTIVE VIEWER

Watch the video with synced subtitles, adjustable overlay, and full playback control.

SIGN UP FREE TO UNLOCK

AI SUMMARY

Get an instant AI-generated summary of the video content, key points, and takeaways.

SIGN UP FREE TO UNLOCK

TRANSLATE

Translate the transcript to 100+ languages with one click. Download in any format.

SIGN UP FREE TO UNLOCK

MIND MAP

Visualize the transcript as an interactive mind map. Understand structure at a glance.

SIGN UP FREE TO UNLOCK

CHAT WITH TRANSCRIPT

Ask questions about the video content. Get answers powered by AI directly from the transcript.

SIGN UP FREE TO UNLOCK

GET MORE FROM YOUR TRANSCRIPTS

Sign up for free and unlock interactive viewer, AI summaries, translations, mind maps, and more. No credit card required.