Full Transcript

·YouTLDR

How Export Controls Helped Not Hurt China & Power is the Bottleneck to AI | Perplexity CEO

1:35:09EnglishTranscribed Jun 16, 2026

Open in Studio

0:00

I have nothing to lose.

0:02

I came from nothing. I never even

0:04

imagined myself to be doing all this.

0:06

>> A $20 billion company,

0:08

>> $45 million users,

0:10

>> over a billion searches a month,

0:11

>> built in 3 years by 400 people.

0:14

>> These numbers like doesn't motivate me.

0:15

It's hard to get motivated by wealth.

0:17

You want to get motivated by impact.

0:19

>> This is perplexity with co-founder and

0:21

CEO Aravven Shrinus.

0:23

>> No one's ever in a comfortable position

0:25

that no one can relax.

0:26

>> They forced Google to redesign their

0:28

homepage. then bid $34 billion to buy

0:31

Chrome.

0:31

>> More than their own valuation.

0:33

>> Perplexity changed Google.com more than

0:35

any product manager at Google has ever

0:37

done. Now you look at AI mode, it looks

0:39

exactly like Perplexity.

0:40

>> He doesn't do defense. He doesn't do

0:42

comfortable. His words, attack, attack,

0:44

attack. That's my motto. Go all in and

0:46

try your best. Be on the offense all the

0:48

time.

0:49

>> You know what I hate with podcasts? When

0:50

people sit on the fence. Aravind has

0:53

really strong opinions in the show

0:55

today. He says that Micron will be more

0:57

valuable than better. He says that the

0:59

resistance to data centers will continue

1:01

and get worse. He says the biggest

1:03

problem today is a lack of power. He

1:05

claims that perplexity has changed

1:07

Google more than any Google PM. You want

1:10

opinions? This is the show for you.

1:13

Ready to go

1:25

Ara. Dude, I am so excited that we get

1:27

to do this. We've done one remote and

1:29

then we did one at Founders Forum last

1:31

year. So, thank you so much for joining

1:32

me in person.

1:33

>> Thanks a lot, Harry.

1:34

>> Dude, I It's a weird start, but just

1:36

roll with me on it. I asked this of the

1:39

best founders that I meet. Are you

1:40

motivated more by the fear of failing or

1:43

by the thrill of winning?

1:44

>> Thrill of winning.

1:46

>> Why? Because I have nothing to lose.

1:49

I came from nothing.

1:52

Like I I I never even imagined myself to

1:54

be doing all this. So my life has

1:57

already been extraordinary u beyond any

1:59

level of imagination. Um I was just in

2:02

India like you know doing my undergrad

2:05

and you know just just training neural

2:07

nets with graphics cards that people in

2:09

the labs were using for playing video

2:11

games. It was all for fun and um you

2:14

know my path led me all the way here. It

2:17

was never like a mo my for my mom just

2:19

getting a job was success because we

2:21

were not we were financially lower

2:24

middle class in India which is not even

2:28

like lower middle class in UK or the US

2:31

and so from there all we wanted to do

2:33

was get a job in Google being an

2:35

engineer at Google was considered a win

2:38

and so I'm I'm already doing remarkably

2:40

well compared to that ambition we had as

2:43

a family so there's really nothing for

2:47

me to lose. That's why anytime I try to

2:50

act like I'm trying to avoid failure,

2:52

I'm being on the defense. I remind

2:54

myself that like that's the stupidest

2:56

thing to do. Like you know, you you it's

2:59

better go all in and try your best. Be

3:02

on the offense all the time. Attack,

3:03

attack, attack.

3:04

>> When you review then what are you not

3:07

being aggressive enough on today? Maybe

3:10

in the early days we be very very loud

3:12

on social media talking about perplexity

3:15

versus Google and I used to do that

3:16

myself a lot. So and some people don't

3:19

like me for having done that. Today I'm

3:22

a lot more measured in how I talk about

3:24

our products or competitors and stuff

3:25

like that. But it's not a lack of

3:27

aggression or anything. Um it's just

3:30

that like it's that that is boring.

3:31

People already heard that enough from

3:33

me.

3:34

>> Do you regret the being so bold in your

3:36

messaging?

3:36

>> No. So it's not a nuance and maturation

3:41

of message. It's a that stale and I need

3:44

something new.

3:45

>> Not just that, I I kind of don't think

3:47

it's a relevant framing anymore. We

3:50

worked on search. Perplexity started out

3:52

as search. We built the first answer

3:54

engine in the world that people know

3:56

perplexity even today if you mention the

3:58

name perplexity people would think oh

3:59

that's an answer engine. We built a lot

4:02

more things after that. We built a lot

4:03

of agents, browser agents, deep

4:05

research, computer. We built so many

4:08

products after that but we're still

4:09

known for that first product and um the

4:13

mark has already been made the we

4:16

changed the road map of Google. You

4:18

could argue that I or the company

4:21

Perplexity changed Google.com more than

4:24

any product manager at Google has ever

4:26

done.

4:26

>> Make that argument for me.

4:28

>> Well, they never that nobody ever wanted

4:30

to ship an answer engine at Google.

4:32

Nobody like nobody wanted to tinker

4:36

anything on on the interface that made

4:38

them $250 billion a year. And uh and

4:42

then now you look at AI mode,

4:45

it looks exactly like perplexity.

4:48

There's there's not even any difference

4:49

like the font, the citations, the

4:53

specific bolding of inline text, inline

4:55

hyperlinks, um suggested follow-ups, the

4:59

whole experience is literally looking

5:01

like perplexity except it's still not as

5:04

good. And so

5:05

>> is that bad or good for you that they

5:07

learn from you and adapt?

5:09

>> It's it's it's it's both good and bad in

5:11

the sense, you know, you have to

5:13

obviously we I knew this like around end

5:15

of 2024 this is going to happen. So, it

5:18

never caught me by surprise at all. Um,

5:21

it was just a matter of time. I still am

5:23

I'm am surprised that the quality is

5:26

still not there cuz I I regularly test

5:29

every product out there. And u but I'm

5:32

I'm happy that honestly uh they they

5:35

they changed Google to be what it should

5:38

be. And um I believe that the frontier

5:42

is where the money is. The frontier in

5:45

AI is not about answering questions

5:47

anymore. It's about actually going and

5:49

doing work for you. We, you know, like

5:51

we still have the state-of-the-art deep

5:53

research the world will. And that's

5:56

actually where people subscribe to pay

5:58

for our pro or max products is not for

6:00

getting answers in in the in the

6:03

traditional way. They're asking for

6:05

sophisticated research reports. They're

6:07

asking for agents that go and do things

6:09

for you. And so we wouldn't have been

6:12

able to do all that if we were sitting

6:14

in 2024 thinking we have everything

6:17

settled here. We're we're good and

6:18

comfortable. No, we it the answer engine

6:21

was always a lead genen for the frontier

6:25

products we build. You need something,

6:27

right? Like think about it. Every

6:28

company needs to have one successful

6:31

product to build the next set of

6:33

products. And in AI, nobody can sit

6:37

comfortably thinking they have it all

6:39

sorted out, including Anthropic. If

6:42

Anthropic thinks cloud code is already a

6:44

win, in 6 or 12 months from now, they

6:47

won't even be around. And so that's the

6:49

uncomfortable it it's it's it's it's an

6:51

uncomfortable fact about the whole

6:53

field. Would you argue today, you just

6:56

told me, if you don't mind me quoting

6:58

you here, um, you just told me before we

7:00

started, that you think OpenAI isn't

7:02

ready for an IPO.

7:05

Um would you have believed you would be

7:07

in a position to say this two years ago

7:10

when nobody h wanted to deal with any

7:13

product other than chat GPT.

7:15

Think about it. So anyone even in such a

7:19

massive advantages position can be in a

7:23

can be put in a position where they're

7:24

no longer the kings. They're fighting

7:26

from behind. Right? So that's the state

7:29

of the field. That's just it's less

7:31

about perplexity or anthropic or open AI

7:34

not having modes or having modes.

7:35

>> Can I push back on you that I would I I

7:38

would stand by two years ago even when

7:40

they were a do and they are still a

7:42

dominant consumer product but I would

7:44

stand by it because I don't think they

7:45

are financially ready when you look at

7:47

the balance sheet of that that

7:49

>> maybe maybe I'll decouple that. I'll

7:51

decouple that.

7:52

>> Let's decouple that being like financial

7:54

readiness for an IPO

7:56

>> versus perception of a dominant leader.

7:59

Yeah.

8:00

>> Do you perceive them as a dominant

8:02

leader right now?

8:03

>> Yes.

8:04

>> In what?

8:05

>> Consumer search.

8:06

>> Well, except there's no money there,

8:08

right? Because it's been commoditized.

8:11

So, it's it's always a legion. Like, for

8:13

example, why why why are they going all

8:15

in on Codeex? Cuz that's where the money

8:16

is. And uh we're doing the same on

8:19

computer. Anthropic is doing the same on

8:21

cloud code. Google doesn't yet have a

8:23

product in this category, but I'm sure

8:25

they're going to come after that. Meta

8:27

is trying to launch Hatch for $200 a

8:29

month. You You see that? You see what's

8:31

happening, right? So, nobody

8:33

>> But there has to be more money than just

8:36

code codeex claw.

8:38

>> It's not about It's not about code.

8:40

That's the main thing. The the money at

8:42

least in non-advertising.

8:45

I'm not talking about advertising

8:46

revenue. In non-advertising subscription

8:49

or usage based revenue, the money is in

8:52

whatever is the frontier. And today the

8:55

frontier is about doing going out there

8:58

and doing things for you. And uh

9:00

>> do you not think then that there will be

9:02

a 100 to20 billion advertising business

9:06

for open AI? It's

9:08

>> yet to be proven. Let's let's work

9:09

through the categories of advertising.

9:12

Um who's the number one advertiser on

9:14

Google? Amazon. It's a number two.

9:18

Booking.com.

9:20

Number three or four I think Expedia.

9:22

So, um, how much do you think

9:24

Booking.com spends on Google? 16

9:26

billion, something like that. Something

9:28

some some some crazy amount like that.

9:31

Um, and, uh,

9:33

um, how do you book your hotels or

9:36

flights today? Do you book it on chat

9:38

GBT or do you book it on Google?

9:40

>> Google.

9:41

>> Why is that?

9:42

>> For me, I actually like discovery. I

9:44

would like to see the options.

9:45

>> Exactly. Right. So the interface the

9:48

interface is less about conversations

9:51

and more about exploration.

9:53

So when when the decision making is more

9:55

subjective and vibes based, you don't

9:59

need an objective

10:01

answer engine. And and so it's it's it's

10:04

and and and you think about the other

10:06

category of advertising, direct to

10:07

consumer products, fashion. Where is

10:10

most of that advertising budget going

10:12

into? It's going to meta, Instagram,

10:15

because you're just browsing. You're

10:18

just like doom scrolling or whatever

10:20

they call it, right? And so, uh, the

10:22

chat interface doesn't capture that user

10:26

intent, that user behavior right now,

10:29

which is why it was never a great fit

10:32

for advertising. And u, it also

10:35

fundamentally corrupts the trust that

10:37

people have when they go into a product

10:39

and they want the accurate answer, which

10:42

is what, you know, perplexity is known

10:43

for. Um, and then you're like, "Hey, by

10:46

the way, you know, you ask for the most

10:49

um highest like like mo best protein

10:51

shake, but by the way, these are good

10:54

protein shakes you can check out." Like

10:56

it it it kind of like hurts the trust

10:59

that people have in your platform and

11:00

your product. And so, um, that's another

11:04

reason why, if you think about it, like

11:06

like what Meta or like I think some

11:08

other companies in the past have tried

11:10

to put ads inside, um, messaging apps

11:14

and emails and it's never really worked

11:16

out. Um, it it works out in China

11:20

in WeChat because there's no other way

11:24

for them to fund the whole thing, you

11:27

know. So the the the whole economy and

11:30

and and user sentime user behavior has

11:33

been optimized around gamifying. It's

11:36

not how things work in America. So I I'm

11:39

I'm I'm bearish on advertising to really

11:43

take off in in the chat interface. I I'm

11:45

happy to be proven wrong there, but I'm

11:47

bearish on that.

11:48

>> There there are two areas that I want to

11:49

unpack there. The first and just taking

11:51

them kind of chronologically and how you

11:52

said them, money's in the frontier. The

11:55

more I hear this kind of the more I

11:57

question it because I think that we

11:59

dramatically overestimate

12:02

how important frontier models are to do

12:04

quite basic work.

12:05

>> Yeah. So frontier doesn't mean a

12:08

frontier model. Frontier just means

12:11

whatever is the frontier outcome you can

12:13

have right now with AI.

12:15

The Greg Brockman recently tweeted the

12:18

model is no longer the product, right?

12:20

Um, and it's funny because you you know

12:24

that as as a leader of a frontier lab,

12:27

he has all incentive to say the model is

12:29

the product and and that's what Google

12:31

people tell. I think one of the Google

12:33

people keeps tweeting that model is the

12:35

product. I forgot who. Um,

12:38

and so the reason Greg is right is

12:41

because um, if you take codeex or

12:45

perplexity computer clot code, what is

12:47

that? It's it's an orchestration system,

12:50

right? It it takes a model, pairs it

12:53

with an agent harness.

12:56

And what is an agent harness? Think of

12:58

it the simplest way of describing it is

12:59

like rules for how the agent loop should

13:02

run. What are all the skills and sub

13:05

agents and connectors and tools it

13:07

accesses? And uh without the harness you

13:11

don't necessarily capture and convert

13:15

the intrinsic intelligence in the model

13:18

into valuable output tokens.

13:21

The output tokens if you're if you're

13:23

literally just a reseller of model

13:26

tokens you have no business because the

13:28

model will get commoditized. So even if

13:30

you're a model builder, you don't have a

13:31

business. As an infra layer, you have

13:34

some business on serving those output

13:35

tokens. But as an application layer or a

13:38

model builder, you don't really have a

13:39

business. If you're just a reseller of

13:40

tokens that come directly out of the

13:42

model, you have business if you know how

13:46

to take the model ground it in valuable

13:50

context, orchestrated with a really good

13:53

agent harness

13:55

um connected to the right set of tools

13:56

and connectors whether it's personal

13:58

connectors or business connectors and

14:01

provide the experience to people in one

14:04

single unified system.

14:07

And uh the way we differentiate

14:09

ourselves at perplexity is we don't just

14:12

orchestrate across tools and files and

14:13

connectors. We also orchestrate across

14:17

models.

14:18

That is the differentiation that

14:20

anthropic and open AAI cannot claim

14:22

because you wouldn't find GPT5I

14:27

inside the cloud code harness. You

14:29

wouldn't find claw opus 47 or 8 inside

14:32

the codeex harness. These are competing

14:35

with each other, right? Whereas you

14:38

would find both these models inside

14:40

perplexity computer and that way we can

14:45

bring down the we can increase the token

14:47

value per watt per user. If you assume

14:52

that

14:53

if you assume that whatever decides the

14:55

dollar like the price the dollars is the

14:58

power watts fundamentally that's that's

15:00

the thing that nobody else can subsidize

15:03

other than the government. Um you know

15:06

that whoever pro you know provides the

15:09

most valuable output tokens with the

15:11

least amount of power expended to

15:14

produce them generates the greatest

15:17

value to the end user and has the most

15:20

pricing power has the most value and so

15:23

that that that is the orchestration

15:24

problem to solve who the one single the

15:27

most important metric in AI is token

15:30

value per what per user. What does it

15:33

mean for the value of open AI and

15:35

anthropic? If model is not the product

15:38

and it becomes a utility, something you

15:40

can switch into and switch out

15:41

>> interface.

15:43

Everyone thinks we're all building the

15:45

model layer or the race. We're not

15:47

actually. Um, I would even argue that

15:50

building models is a way to stay at the

15:53

frontier, but you have to own an

15:56

interface

15:57

in which valuable AI output tokens are

16:02

generated, the most valuable tokens. It

16:05

doesn't have to be the product. This is

16:07

the single most important thing to like

16:09

you know unlearn for most founders and I

16:12

had to do it too which is to be

16:14

successful in AI product layer whether

16:18

you're a model builder or not it's not

16:21

about building something that gets a

16:23

billion users that mentality has to

16:26

completely shift

16:28

there are a few power users who are

16:30

propelling this token economy right now

16:34

if you look at like all these um crazy

16:36

stories of how there's this one engineer

16:38

who got Amazon to spend like half a

16:40

billion dollars in a month because of

16:42

some stupid way they set up like agent

16:44

loop inside cloud code. Okay, maybe

16:46

that's a mistake, but there are real

16:47

engineers in meta in in other companies

16:50

spending like 10 million a year per

16:52

engineer on on on these, you know,

16:55

coding tools. There are users in

16:58

Perplexity Computer. Um, there's one

17:00

user, I think, who spends upwards of

17:02

like $10,000 a month, something like

17:05

that. Crazy. And and and not like

17:08

wasting it. They're not wasting money.

17:11

Their business runs using agent loops

17:15

that are running inside these harnesses.

17:17

And they use these products in

17:19

sophisticated ways that I I I couldn't

17:22

even conceive when we were building the

17:24

product ourselves. Even internally

17:26

inside our own company, there are some

17:29

people who have set up these kind of

17:30

like multi- aent hierarchy and agent

17:33

loops that looks like its own software

17:35

architecture.

17:37

And I often just ask these guys to come

17:39

explain to the rest of the company, hey,

17:41

like what are you doing with these

17:42

tools? Like you clearly are consuming it

17:44

way over, you know, what we thought the

17:47

average person in the company would do.

17:49

And the single biggest differentiation

17:51

between those who use agents a lot and

17:53

those who don't is whether they run

17:56

repetitive cron jobs

17:59

like whether you use AIS as one-off

18:02

tasks. You just delegate a task and then

18:04

it gets done. That's like kind of using

18:07

it for like deep research or like

18:08

whatever, right? Like one single task

18:10

versus the AI is like continuously

18:11

monitoring something for you. The AI is

18:14

continuously like triggering based on

18:16

certain events and going and doing

18:18

certain things, giving you alerts like

18:22

you set up workflows that keep running

18:23

for all the time. Every time you get an

18:26

inbound email like it triages or every

18:28

time there's a latency spike, it has to

18:31

identify which part of the codebase

18:34

caused that, it has to go and do the

18:36

root cause analysis and then identify

18:39

the right engineer. all these things the

18:42

this is where the frontier is and and so

18:44

uh going back to my main point.

18:47

These products are not going to be used

18:49

by uh you know 100 million people but

18:53

they will generate revenue that's going

18:56

to be higher than the advertising

18:57

revenue of Google or Meta. It's going to

19:00

happen.

19:00

>> Completely understand what you say

19:02

there. I do just want to focus in on a

19:04

specific element there when you were

19:06

saying like the power users because I

19:07

think one of the core numbers is

19:09

actually Mark Benov said they spend 300

19:11

million on anthropic which works out to

19:12

be about

19:13

>> it'll be interesting to know from him if

19:16

that 300 million came from you know what

19:18

is the distribution across employees

19:21

>> so it works out to be that was on

19:22

developers within Salesforce so it's

19:24

about 3.8% 8% of developer salaries.

19:26

What percent of developer salaries do

19:28

you think will be spent on tokens in 24

19:32

months time? Cuz that fundamentally

19:35

changes the value of open air and

19:36

anthropic. If it stays at 3.8%.

19:39

They will not be $5 trillion companies.

19:41

But if it's 100% like Brandon at Mccor

19:44

said it will be in a year, they they'll

19:46

be 10 trillion companies. Well, um I

19:49

think they can certainly beat $10

19:50

million companies whether it's going to

19:52

be um a full percent of the developer

19:55

payroll today or not because there's a

19:57

lot of non-developer

20:00

work that'll also be done with a agents.

20:04

Um and that's actually what we focus on

20:06

for Perplexity Computer. We're not going

20:08

after the developer market. We're going

20:10

after anything that developers don't

20:13

non-developers do. basically um your

20:16

your finance department or your corp dev

20:18

or your like um sales reps or your data

20:21

science teams um your your research

20:24

analysts I think that's actually even

20:27

bigger market that it's it's it's not

20:30

even like like think of it as like clot

20:33

code multiplied by 10 that that's the

20:35

size of that market

20:36

>> if I push you on developer salary spend

20:39

what percent of token spend as a portion

20:42

of salary do you think We'll see in 24

20:44

months.

20:45

>> It's hard to say.

20:47

I think the costs are going to go down.

20:49

That's why it's hard to say.

20:51

>> You think the cost will go down? Cuz

20:52

this is the kind of the challenge that

20:53

we've had. We thought when we went from

20:54

chat to agent that costs would go down

20:56

and token costs would go down. They've

20:58

gone up.

20:59

>> Yeah. For now.

21:00

>> Help me understand that and how that

21:02

changes.

21:03

>> I think in software um you kind you kind

21:06

of want to pay for the frontier. Um it's

21:10

kind of like if you know some engineer

21:12

is awesome. If you know you have like

21:14

the next Jeff Dean,

21:16

would you rather hire that person and

21:18

not hire five five people who are medium

21:21

engineers but not Jeff Dean level with

21:24

the same amount of budget you have? Yes.

21:26

Right. Let's say you had a million

21:28

dollars. You could hire five people

21:31

worth 200K or you could hire one Jeff

21:33

and pay them a million. What would you

21:35

do?

21:36

>> One Jeff.

21:37

>> Yeah. So I think you would pay for the

21:39

frontier. Um but what stays frontier

21:43

keeps changing. Um in in 12 months from

21:46

now let's say thought experiment there

21:49

is an open source model as good as Opus

21:51

48.

21:51

>> Mhm.

21:52

>> And um you still have to pay for

21:54

inference. You know nothing is truly

21:57

free but it's going to be like let's say

22:00

10 times cheaper than Opus 48. And when

22:02

you pair it with the right agent

22:04

harness,

22:06

you know, and all the connectors,

22:07

GitHub, everything, all your developer

22:10

workflows work fine, why would you um

22:13

assume that the token spend is going to

22:15

be still high? It's not going to be for

22:18

the same things you're doing today. It's

22:19

not going to be. But there might be a

22:21

different set of things you might do

22:24

with the frontier that you're not

22:26

conceiving today. Uh my prediction would

22:28

be soft a agents that are like

22:32

completely autonomous software

22:33

engineers. Today I think we we're all

22:36

using tools like cloud code or codeex to

22:40

write code but not as literal software

22:43

engineers. There is a large sway of

22:44

people that is now bearish on your

22:46

frontier models who open eyes and your

22:48

anthropics because they're realizing

22:49

that you can actually do a lot with open

22:51

models for a fraction of the price. What

22:53

you're saying is actually that is true

22:57

but

22:58

>> we will still pay for the frontier and

22:59

so they will still ac

23:02

and and I think this distinction

23:06

it feels like a contradiction. It's not

23:08

though. It feels like two things cannot

23:12

be true simultaneously.

23:14

But that that's not quite the case. In

23:16

fact, I would argue that the frontier is

23:21

increasingly going to be a thing that um

23:24

very few individuals might even want.

23:26

Like you could argue that after a point

23:28

like it's not even interesting that AI

23:30

can write software. You you we've

23:31

normalized it, right? Let's say let's

23:33

say that that's going to be the case.

23:35

Instead of companies being built with

23:36

like tens of thousands of software

23:38

engineers unlike the past, there'll be a

23:40

lot more companies with smaller software

23:43

teams and each of us will be using a lot

23:45

of AIS. So um that's actually good for

23:48

the world. We'll be seeing a lot of

23:50

different businesses. We'll be seeing

23:52

allocation of software labor in places

23:56

that was never even possible. and and uh

24:00

whatever is a frontier is going to be

24:02

things that kind of like AI is going and

24:05

designing chips, AI is designing drugs,

24:08

AI is figuring out how to build robots,

24:11

AI is figuring out how to cure cancer.

24:14

These are applications where you don't

24:17

have like 10 million users. It's like a

24:20

few companies, but the effect of that

24:24

work will touch a lot of human lives. I

24:27

think to me that that's where the

24:28

frontier is headed. Um you could also

24:31

see that from the moves that Frontier

24:33

Labs are making. Anthropic bought um a

24:38

wet lab could be for the talent could be

24:40

for the infrastructure to run like wet

24:42

lab experiments but imagine taking all

24:45

those tokens and putting it in the mid

24:47

training instead of just tokens from

24:49

GitHub right um so then that's going to

24:52

produce something interesting.

24:54

>> Don't laugh. Is there an asimtope to

24:56

frontier problems to be solved? I know

24:58

that sounds ridiculous, but if you are

25:00

continuously on the chase for the next

25:01

frontier problem, you get to cancer, you

25:04

get to climate change. And my word, I

25:06

hope they solve both in like heaven,

25:08

that's a huge amount to solve.

25:10

But if you're on the treadmill of

25:12

continuously, is there an asmtope to

25:14

that? Do you see?

25:15

>> There's no there's no mathematical

25:17

argument

25:18

to there being a cap on the amount of

25:23

economic value

25:25

one can create with with with um AGI or

25:30

ASI like systems. Um and Elon Elon has a

25:34

good argument for this like like where

25:36

he says money loses all meaning in a

25:39

post AGI economy

25:42

because you'll be producing

25:45

an abundance of energy and labor

25:48

and fundamentally the economy is

25:49

grounded to energy and labor. If you can

25:51

produce an abundance of them,

25:54

well what what meaning does money have?

25:57

Um and um and so I I don't think we run

26:01

out of things to solve at the frontier.

26:03

I think we're always going to be creat

26:05

like like why why would why did people

26:06

even want to understand the universe?

26:08

Like like why did we want to understand

26:10

subatomic particles, quantum physics,

26:13

black hole theory, um you know the

26:16

origins of the universe like what what

26:18

what is the purpose? But we still went

26:21

ahead and did it because that's kind of

26:23

what the purpose of humanity has always

26:26

been to understand the unknown. You

26:29

know, David Deutsch is famous for saying

26:30

this, right? Like we are the only

26:32

species capable of being curious about

26:35

what is already familiar. Like you can

26:37

stare at a fruit and you know that it's

26:39

a mango and like you know exactly like

26:42

how it tastes, you know how it looks,

26:43

you know the shape, you know what

26:45

seasons it grows and and and stuff, but

26:47

you can still look at it and ask one

26:49

more question about it that you haven't

26:52

asked before. Other animal species

26:54

cannot. Once they kind of once they have

26:57

it in their mental model, what it looks

26:58

like and touches and feels like, they're

27:00

going to ignore it. It's not it's no

27:01

longer interesting to them. Kashi, you

27:03

mentioned about agent usage and you said

27:06

if you do repetitive tasks versus one

27:08

off say chron jobs, you know, I think

27:10

Sam said it's we're going to have 24/7

27:12

AI and um yeah, they've talked about a

27:15

hardware product that's going to come

27:16

out.

27:17

>> Do you think we will have continuous

27:18

agents running?

27:20

>> Yeah. In so

27:21

>> and I think that's kind of why I believe

27:25

>> the orchestration problem I I talked

27:27

about maximizing the token value.

27:30

>> Can you just help me? Sorry. when you

27:31

say the orchestration problem.

27:32

>> Yeah. So, so okay. So there are like

27:34

four objectives

27:36

um accuracy, intelligence and accuracy

27:39

and then privacy and cost.

27:42

You know these are all competing with

27:44

each other. So you can you could argue

27:45

that um you could max out on

27:48

intelligence and accuracy by building

27:50

giant giant data centers and spending a

27:52

lot of power to uh you know run them and

27:56

u you could miss out on privacy and

27:59

costs cuz everything will be centralized

28:02

and and and you're going to be paying a

28:03

lot. Um, you could argue that everything

28:06

can run locally and so that'll be good

28:09

for privacy and cost but may not be

28:12

frontier intelligence, may not be

28:14

frontier accuracy. So the solution is to

28:18

figure out a sweet spot. You know, use

28:21

local models when necessary, use server

28:23

side models when necessary and

28:25

orchestrate across local models and

28:27

serverside models. Uh, grounded in

28:30

valuable personal context. Sometimes the

28:32

intelligence might already be there but

28:34

the system might not work that because

28:36

the harness isn't grounded in the right

28:38

set of tools right so build a worldass

28:42

harness that can even make an okayish

28:44

model appear great and be able to use

28:47

the right model for the right task and

28:49

the right part of the task sub aents and

28:52

and even like utilize the compute we all

28:55

have in our own devices all that that

28:57

you know doesn't need to be always on a

28:59

server that is an orchestration from a

29:01

router, an awesome router, a master

29:03

orchestrator router. Now, um, if you do

29:08

that, you can realize the vision of a

29:10

24/7 AI without people freaking out

29:14

about going bankrupt

29:16

because no one's going to be able to

29:18

afford a 24/7 AI, Frontier AI, running

29:21

on the server. Imagine you turned it on

29:24

and you you could never switch it off

29:26

unless something crazy happened. Um,

29:30

you're the the the the the thing that

29:31

most people worry about those AI is

29:33

like, "Oh, what if it does something

29:35

crazy?" But the real concern actually is

29:37

the cost. Um nobody's going to be able

29:39

to afford it a cron job at the fidelity

29:43

of few seconds, you know, um that that

29:46

runs all the time. And so, um the

29:49

bottleneck there is actually

29:51

orchestration and local compute. And so

29:54

I believe like like um one needs to

29:57

build a continuously learning

30:00

local model um that can save you on like

30:05

compaction

30:07

context windows. So you and and and try

30:10

to preserve as much compute locally and

30:13

rely on the server side frontier only

30:15

when necessary and keeps learning, keeps

30:17

adapting, keeps evolving. And that model

30:20

um is not just a model. It's a model

30:22

plus the harness plus the local chip and

30:25

the compute and the ecosystem of devices

30:27

it controls. That system um is going to

30:30

be your own intelligence. Essentially

30:32

the data center moved to your local

30:34

device and you you get to control it.

30:37

You get to own it. You don't get to

30:38

worry about somebody like you know

30:40

spying on you or looking at all your

30:42

tokens. Very valuable personal tokens.

30:45

Imagine you have like very sensitive

30:46

deal materials. Let's say you're doing a

30:48

deal um and then um a Frontier Lab has

30:52

all your tokens that you use to like

30:54

write a memo.

30:56

Imagine somebody could hack into that

30:57

server and steal your deal from you. You

31:00

wouldn't want that, right?

31:02

>> I'm going to be honest. I think there's

31:03

much more valuable things for people to

31:05

steal from

31:07

London based VC.

31:09

>> But yes, I can figure you're not just

31:10

yet another London VC. You have like a

31:12

$400 million fund last time read it. So

31:15

>> imagine like you know you're

31:17

>> already making your moves for the $4

31:18

billion fund right so so everyone has

31:22

certain levels of like you know

31:23

sensitive stuff and and so I think

31:25

that's where I believe that the 247

31:28

always on agent is going to be realized

31:32

by the company that wants to play the

31:34

role of the orchestrator not the model

31:37

builder not the frontier model builder

31:39

but the orchestrator and uh and I think

31:42

that's what that's what we want to do um

31:44

computers has been positioned explicitly

31:46

as the agent orchestrator. The the

31:49

musicians in the orchestra are these sub

31:52

agents that utilize these different

31:55

models. Think of them as the instruments

31:59

and uh the tools, the connectors, the

32:00

models. These are all the instruments

32:02

and the musicians are the sub aents and

32:05

the symphony is the work and the system

32:08

is the orchestra and and computer is the

32:11

orchestra conductor. that that that's

32:12

how it's been positioned. So what it

32:15

orchestrates

32:17

keeps evolving, right? It it changes. It

32:20

changes from,

32:22

you know, models to files to tools to

32:24

chips to devices. But but it doesn't

32:26

even matter like you don't care as long

32:28

as it orchestrates things correctly and

32:30

and and and maximizes the token value

32:33

for what per user. If you can solve this

32:36

problem, you will capture the most

32:39

economic value in AI long term.

32:41

Shortterm it might look like oh like

32:43

this other lab's revenue is growing you

32:45

know exponentially this that but long

32:47

term this is the one objective that

32:49

truly matters.

32:50

>> Who is best positioned to do that?

32:52

>> I believe it's us cuz you have the

32:54

incentive of not token maxing you have

32:56

the incentive of delivering the most

32:59

value to the user like we we every time

33:02

any part of the AI stack improves

33:06

our product improves. Um since the

33:09

beginning of the year, Anthropics models

33:10

have made tremendous progress. But

33:13

what's also true is that our revenue has

33:16

more than tripled since the beginning of

33:17

the year. Tripled since this beginning

33:19

of the year and uh we and a lot of

33:23

thanks to model progress made by

33:25

anthropic and we also brought our

33:27

burndown thanks to OpenAI competing with

33:30

them and bringing down the cost of the

33:33

same capability. And now with progress

33:36

in open source and local models and

33:37

local chips, we're going to move some of

33:39

the inference back to the local devices

33:41

and bring down the cost even more. So

33:45

every time any part of the AI stack,

33:47

whether it's chips, models, harnesses,

33:50

any of these gets better, our system

33:53

improves tremendously. And if our system

33:55

improves tremendously, our users love it

33:56

and they pay more. They spend more and

33:59

so our business grows. So I think to

34:02

your question of who's best positioned

34:04

to win in that world for that objective

34:07

of being an orchestrator is the one

34:10

whose product or business

34:13

benefits from other people's progress at

34:16

any layer of the stack. And so if Jensen

34:20

produces a better chip, it's great for

34:21

us. If Dario produces a better model,

34:23

it's great for us. If Apple produces a

34:26

better device, it's great for us. And

34:28

like I I love the fact that we are able

34:30

to be a very positive player at every

34:34

layer of the stack and not have to rely

34:38

on any one person to win. When we look

34:40

at the different providers that we said

34:44

kind of server side versus you on device

34:46

when we look at server side a lot of

34:49

people talk about an AI infrastructure

34:50

bubble which I think is funny stupid and

34:52

moronic. To what extent do we have a

34:54

data center supply problem today from

34:56

what you see? I think the biggest

34:58

problem is actually in power. So what

35:01

let's break down what is a data center.

35:03

Is it like that you just buy like a

35:05

bunch of chips from Dell or Super Micro

35:08

and No, that that's just one part of it.

35:11

You actually have to go secure land or

35:15

you have to lease something, lease a

35:16

property and uh you have to buy a bunch

35:19

of turbines to generate power or you

35:23

have to work with like power suppliers,

35:26

grid suppliers and uh you also have to

35:29

work on cooling. So there's a lot of

35:32

other work you got to put in that is far

35:34

far slower. you have to get permits to

35:36

do all these things and uh and so

35:39

usually what's happening is um there's a

35:42

lot of lead time to doing this and um

35:46

the models that are um already in use

35:49

today these have been trained in the

35:50

hopper generation so the blackwell

35:53

generation model I think the first model

35:55

that's blackwell generation category is

35:58

um mitos

36:01

and it's already scary like people are

36:02

already like freaking out about it So

36:05

imagine that um everyone pre-trains a

36:08

model um on like a million or like you

36:11

know hundreds of thousands of black

36:12

wells now those models are going to be

36:14

far more powerful than what exists today

36:16

and then the ver rubins are coming next

36:19

year in full capacity like like all the

36:21

data centers of wear rubons will be in

36:23

next uh you know used next year that

36:26

model will be even more powerful. So I

36:28

think we there is a certain physical

36:31

buildout time

36:33

that always bottlenecks frontier

36:37

capabilities. That's why there's a value

36:39

in that layer. Whoever knows how to do

36:42

this puts it puts together a bunch of

36:45

GPUs and chips and networking and power

36:47

and cooling and actually like

36:50

orchestrating all this software layer on

36:52

top and you know is able to convert that

36:55

into frontier output tokens. that that

36:57

that vertical integration has a lot of

36:59

value. So that's why the markets are

37:01

pricing infrastructure companies with a

37:04

higher uh PE ratio

37:07

>> than companies like Meta for example.

37:09

Even though Meta builds a lot of infra

37:11

is valued as a software company. When we

37:14

see like you know Meta's capex spend and

37:16

it wanting to increase in the last few

37:18

days and thinking about raising more and

37:19

more money to increase capex spend I get

37:21

it with a lot of the AI providers like

37:23

your open eyes or Anthropase because

37:25

they are making money from their AI

37:27

products. For Meta, the capex spend

37:29

correlates to increasing accuracy on

37:31

ads, which is like a six to eight% bump

37:34

in revenue. I get it. But for the capex

37:36

spend, it doesn't make sense.

37:38

>> Well, um I I I believe like they they

37:41

are understanding

37:43

what the market's saying. You know, I

37:44

don't think they're

37:47

dumb to not see what what what's being

37:50

said. I think they're introducing a lot

37:51

of subscription products um from what

37:54

I'm reading. So they're definitely going

37:55

to like basically the company needs to

37:58

not just be a social platform maximizing

38:00

engagement and turning that into ad

38:02

revenue, right? And I think um that

38:05

requires them to launch a lot of like

38:07

agents subscription based products and

38:10

maybe even a cloud meta cloud that that

38:14

rents out servers like what Elon's doing

38:15

at SpaceX and and maybe once they do

38:18

that

38:20

the the narrative might change, right?

38:23

But um to go back to my point, it might

38:26

not be inconceivable that

38:29

38:31

Micron, the supplier of HPMs, might be

38:35

more valuable than Meta in the next 6 to

38:37

12 months. It's already at like a

38:39

trillion and Meta is like 1.3 to 1.4

38:43

trillion.

38:44

>> Can you help me understand that? Because

38:46

memory is already a massive bottleneck.

38:48

It's increased 5x in price in terms of

38:50

the cogs, right? Um, but people are

38:53

going, "Wow, Micron is fully priced at

38:55

this point." Why is it not fully priced?

38:59

>> Because it's still the bottleneck.

39:01

Whatever is the bottleneck

39:04

will command the price.

39:07

Um, AMD is doing really well because

39:10

CPUs became a bottleneck again. Agent

39:13

loops, agent harnesses are all running

39:16

on CPUs. The tokens are produced by the

39:18

frontier models on GPUs. But whatever

39:21

work like let's say like claude

39:24

generates a coding script that decides

39:26

to download 500 files from different

39:28

websites and then you know munches a lot

39:31

of data and transforms it in certain

39:32

ways and generates a plot and then hosts

39:34

it on a website that you can you can

39:36

share with your people. All that

39:38

computers is running on CPUs.

39:41

Agents are using CPUs more than humans,

39:44

right? And so suddenly there's a rise in

39:47

enterprise CPUs and the beneficiaries of

39:50

these are like Intel and AMD. So then

39:54

they get to be the bottleneck like like

39:55

whoever's going to be the bottleneck

39:57

will win and and so infra is the

39:59

bottleneck right now because there's a

40:01

lot of demand and we just don't have the

40:03

supply and so whoever supplies memory

40:06

SSDs for storage CPU compute suddenly

40:10

these are all like interesting like um

40:13

they're more important than companies

40:15

that are just building data centers and

40:18

not knowing how to turn that into a

40:19

valuable outputs. Do you believe your

40:21

Nebius and your Core Weaves will be a

40:23

sustainable

40:26

multiund billion company in the future

40:28

or is it solving a short-term supply

40:30

problem?

40:31

>> Um I certainly think they can be

40:33

sustainable.

40:34

>> Yeah.

40:34

>> Um I think there are some I don't like

40:38

look I don't know particularly which of

40:40

those is going to win and there's also

40:42

other players like Cruso and um Firebird

40:46

and a bunch of companies. It's all about

40:49

being resourceful. You got to take power

40:51

from areas where there's a lot of

40:53

natural resources

40:55

and the cost to bring up the data center

40:57

is pretty cheap and the time to bring up

40:59

the data center is cheap and your

41:01

service is reliable. Like if somebody

41:03

commits to buying 100,000 GPUs from you,

41:06

um the service should be pretty good. Um

41:08

and uh you should be able to secure the

41:12

supply ahead of time. Plan well. Um and

41:15

I think some companies are even

41:16

innovating at the power layer. You know,

41:19

um generating their own power is one way

41:22

to bring down the margins. Uh and so I

41:26

think there's certainly like value in

41:28

that layer because um it's hard to

41:31

replicate work. That's how I see it. You

41:34

could argue that OpenAI can do all the

41:37

work that Core V was doing and that's

41:40

kind of what they wanted to do with

41:41

Stargate. But why is Corev more

41:44

successful at building data centers than

41:45

OpenAI? It's

41:46

>> hard to do. It's operationally

41:48

intensive.

41:49

>> Yeah, operationally intensive. You got

41:50

to focus. You got to like spend most of

41:52

your time um securing permits like

41:55

figuring out power, figuring out like

41:57

bottlenecks in the supply chain here and

41:59

there um and constantly plan ahead and

42:03

like test all these systems carefully.

42:06

deal with like random physical issues

42:08

that you know arise in like you know

42:11

running a data center there's something

42:13

called TCO you know cost of operations

42:17

>> you got to factor that in so uh that

42:19

said I I I don't think there's value um

42:22

if you're just like a server renter if

42:26

you're just a GPU server rack renter if

42:29

you're just leasing it to different

42:30

companies on certain hourly pricing

42:32

rates there's not a lot of value you

42:35

have to actually build some software on

42:39

top kind of like how AWS did. It's

42:42

called Amazon Web Services, not Amazon

42:44

servers, right? So, um you have to have

42:47

some software orchestration on top that

42:50

allows you to get software margins on

42:52

top of what you're doing. And I think

42:56

that's why you're seeing moves like NBS

42:59

43:00

like like going for the AI model

43:02

inference like you know taking open-

43:05

source models or hosting your models and

43:08

and and um that's a business model of

43:11

certain other companies like fireworks

43:12

and you know um ben and all that but you

43:15

could imagine neocloud just going for

43:17

that business.

43:18

>> That was exacting me my question. So, I

43:19

just had the co-founder of Nebius on the

43:21

show and the really clear takeaway was

43:23

the the challenge that he has, which is

43:24

there's a huge amount of money that

43:26

wants just capacity and compute.

43:28

>> Yeah.

43:28

>> With the awareness that he needs to

43:29

build a full stack product if he wants

43:31

to have a long-term sustainable

43:32

business. That was the core realization

43:34

for me. When I look at the inference

43:36

layer, like you said, fireworks or base

43:38

10, how do you think that plays out? Do

43:40

we have standalone hundred billion

43:42

dollar companies in inference alone or

43:44

do we see that commodity?

43:47

I mean you it's just it's all about

43:48

working backwards like what does it take

43:50

to build a hundred billion company

43:52

assume like

43:54

>> 10 billion in revenue

43:55

>> exactly 10 billion in revenue 30 to 40%

43:59

gross margins good amount of net income

44:01

good cash flow okay 10 billion in

44:03

revenue

44:05

um is not that inconceivable for for a

44:08

company that can both do AI hosted

44:11

inference and server capacity and data

44:14

center buildouts

44:16

very operationally well. It's all about

44:18

like you know there there are some

44:20

factors beyond their control like open

44:22

source models continuing to be awesome.

44:24

If open source models stop to actually

44:26

be good where the gap between them and

44:28

the frontier is like more than 12 months

44:30

or like 15 months 18 months then I don't

44:33

think these companies really have a

44:34

business model because um they're not

44:37

going to be able to host they're only

44:39

going to be able to

44:41

rent capacity to open Athropic and so um

44:46

that's exactly what Roman and Nebia said

44:48

he said if consolidation happens and

44:50

there's anthropic and open AAI or two or

44:52

three dominant providers that is the

44:54

biggest threat.

44:54

>> That's correct. Yeah. And so um but you

44:58

got to make a leap of faith assumption

45:00

that you know like the the models from

45:02

China or Nvidia is making good progress

45:04

on their models in Limatron. Um you got

45:07

so there's going to be enough factors in

45:09

the market to keep u consolidation

45:14

as an outcome from from like stopping

45:17

from happening. But you don't control

45:18

your own destiny if you're those

45:20

companies. That that that's basically

45:21

the problem. Totally get that. Okay. So,

45:24

we can have standalone companies that

45:26

are hundred billion dollars in inference

45:27

alone. So, I'm just pillaging you for

45:30

your knowledge. When we look at the

45:31

model selection companies like an open

45:33

router or like factory AI just released

45:36

that kind of model selection or model

45:37

rooting product which did very well on

45:40

launch, is there hundred billion dollar

45:42

companies in the model selection and

45:43

routting business?

45:45

>> Probably not.

45:47

Um I think you can't just be a

45:51

provider of router. You have to use the

45:53

router to produce something meaningful.

45:56

Um actually most of the business value

46:00

of open router is less than the router

46:02

even though the product is called open

46:03

router. It's not routing across models

46:06

there. It's actually just routing across

46:09

different endpoints of the same model.

46:12

So um why okay so maybe let let's let's

46:15

let's ask this question. If you wanted

46:17

to use claude opus or

46:22

um I don't know like GBD55 developer why

46:26

would you not want to just use it with

46:27

your own API key versus using it inside

46:30

open router? Number one argument. The

46:34

single simplest argument as to why you

46:35

would want to do that is

46:38

model fallbacks. Sometimes your API keys

46:40

might not have the rate limits or even

46:43

if you have the rate limits it might

46:45

there might be an error on open AI

46:47

servers that you know don't guarantee

46:49

you the response time you need to run

46:51

your application and uh open router

46:55

would go and earn the uh you know they

46:58

would pay for capacity for like one year

47:00

ahead uh with the funding they have and

47:03

secure the rate limits and multiple

47:05

endpoints across multiple different

47:07

providers of openi models be it Bedrock

47:10

or Azure or OpenAI themselves. And so

47:13

that routing is valuable. It's

47:15

essentially an infra problem they're

47:18

solving which is reliable token supply.

47:21

It's not actually oh like they're

47:24

lowering the cost by deciding if this

47:26

prompt should go to like GPT or clot or

47:28

something like that. That's not what

47:29

they're actually selling to the

47:30

developer. That's not actually the

47:32

business model. And uh and then for a

47:36

lot of these Chinese open source models,

47:38

there's not you probably don't want um

47:41

your API tokens from going to like let's

47:43

say you don't want your API tokens going

47:44

to China. Um and so you and and let's

47:49

say you don't have the bandwidth to work

47:51

with like different inference providers

47:53

or verify who's good and you know who's

47:55

not. You're just thrusting open router

47:57

to take care of all that and then you

47:59

know they're going to like supply the

48:00

tokens to you. So it it's it's routing

48:03

not at the level of like oh like

48:05

deciding which model is cheap for what

48:07

task. It's more like um a reliable token

48:10

supply and I think there's some value in

48:13

that layer definitely uh otherwise they

48:15

wouldn't have these many um users and

48:17

these many trillions of tokens being

48:19

routed a month but it's um it's not like

48:23

you know high gross margins business

48:26

it's the way the business model works

48:27

for them is actually um they would

48:30

secure a discount from the model

48:32

providers by guaranteeing a lot of

48:34

supply but they would still charge the

48:37

user their listing price on the API and

48:40

that difference is their margins. Do you

48:43

understand?

48:43

>> I I I totally get you. We spoke about

48:45

bottlenecks and you said about HPM, high

48:47

performance memory and micron and the

48:48

value that you know they have to say and

48:51

what it can be. What bottleneck will we

48:54

have in 3 years that we're not

48:55

discussing today?

48:57

>> I think power will remain the

48:59

bottleneck. Yeah,

49:00

>> I think it's it feels like that to me.

49:03

Unless something dramatically changes in

49:05

the way data center buildouts happen. Uh

49:09

I actually believe that there will be a

49:11

lot of resistance to building data

49:12

centers. It's because people incorrectly

49:15

think that data centers consume a lot of

49:17

water or eat up a lot of power which is

49:21

both both are untrue. Satya even made

49:24

the statement that it it's like a can of

49:26

water or something uh in terms of how

49:28

efficient these companies are.

49:30

>> Do you think that's why they're putting

49:32

up resistance to them? I don't. I think

49:33

it's cuz it's a symbol of job losses. Uh

49:36

increasing wealth in

49:37

>> it's a lot of things. It's a lot of

49:39

things. Um it's a lot of apprehensions

49:43

um fear about like what's going to

49:45

happen channelizing in so many different

49:48

ways. Um, sometimes it's channelizing

49:50

through hatred for wealth inequality and

49:52

like wanting to tax people. Sometimes

49:54

it's channeling through like concerns

49:56

for the environment and like climate

49:57

change. Um, sometimes it's uh

50:00

channelizing in a way where um you're

50:03

all like oh like the price of the grid

50:05

is going up because you guys are

50:07

building all these data centers and then

50:08

or like I'm paying more for my phones

50:10

and laptops now because the RAM prices

50:13

have gone up because you guys went and

50:14

bought all of it. So I think there's a

50:17

lot of different ways in which it's

50:18

getting channelized but the common

50:20

sentiment is um like like a pretty bad

50:24

sentiment about AI.

50:25

>> Do you think it will be meaningful

50:28

to the development of those data

50:29

centers? I think right now 40 out of 100

50:32

are not being developed because of

50:34

public resistance.

50:35

>> Yeah. So um that that's where the power

50:38

bottleneck is and um you could see maybe

50:42

certain countries sees the opportunity

50:44

for this and um

50:47

um allow these model builders to go

50:50

build data centers there. Um Elon's

50:54

going to space to do that. Um so that's

50:56

going to be an interesting experiment.

50:58

Um because there's a lot of energy from

51:01

the sun that can be harnessed there. and

51:05

uh there's a lot of natural resources in

51:08

other countries. Regulations might be

51:10

more friendly.

51:12

So, we're still going to see data center

51:13

buildouts. It might not happen in the

51:16

US. And um but but the fact that you

51:20

have to solve physical problems like you

51:24

actually have to deal with the supply

51:25

chain, the permits, securing power, like

51:28

making sure like things work and getting

51:31

the lead times lower and lower. You're

51:34

not solving problems like cloning some

51:37

SAS apps here, right? You're or like

51:39

you're building a go to market team or

51:41

like um doing better marketing against

51:44

the competitor's products. Like yes,

51:46

those are also hard problems, but these

51:48

are like much harder problems where like

51:51

you're not in full control of your

51:53

destiny and you need a lot of capital

51:54

and connections and like the right

51:56

people uh sometimes even like political

51:59

help to unlock progress. And so that's

52:03

why this will continue to remain the

52:04

bottleneck in my opinion. And there's a

52:07

lot of risk as well because if you do

52:11

encounter another Deep Seek moment here

52:14

where there's a vastly more efficient

52:16

model that's been built with a very

52:18

different vertically integrated

52:20

architecture

52:21

and you built out all this capacity and

52:24

you're like, damn, that's I overbuilt.

52:27

there's something far more efficient

52:28

that can run on on on people's local

52:30

devices, their MacBooks, their Windows

52:32

PCs.

52:34

Yeah. Like you're probably freaking out

52:36

then. And so you hope that

52:38

>> How likely do you think that is though?

52:40

>> It's probably like 20% 30% chance. The

52:43

reason I think there's some possibility

52:45

is that because of the export controls

52:48

um you are so the deepseek is not

52:53

building with the Nvidia stack. they're

52:56

building with the Huawei stack

52:59

and because there are export controls on

53:03

not just the Nvidia GPUs but also on

53:05

HPMs

53:07

these um architectures that that DeepS

53:10

building are far more like memory

53:13

efficient they made innovations on the

53:16

KV cache to be really small enough that

53:19

you can host it on the SSDs

53:22

and you don't need high bandwidth memory

53:25

for inference time and they're going to

53:26

have a completely different architecture

53:28

for inference, completely different

53:30

architecture for storage cuz they're not

53:33

allowed to use the 3D nans. So their

53:36

architecture is going to look it's not

53:39

just the model architecture. The model

53:40

architecture is already pretty

53:42

different. They've made innovations on

53:43

the attention layer. They made

53:45

innovations on like the the training

53:47

algorithm so that it doesn't consume a

53:50

lot of interconnect capacity. So they

53:52

they've made a lot of they basically

53:53

their whole stack is getting vertically

53:55

integrated to their hardware and their

53:58

chips and their fabs and so on and so

54:00

that's a very different bet from what

54:03

America's making.

54:04

>> Do you think the export controls have

54:06

helped or hurt us? jury still a lot

54:10

shortterm it's helping because the only

54:12

reason I my belief the only reason where

54:16

why there's even like a 12 month gap

54:20

between open source and frontier is

54:22

export controls it's definitely helped

54:24

and and and definitely like companies

54:26

like anthropic lobbyed very hard for it

54:29

but um there is a chance that because of

54:33

that they now get really good at the

54:35

physical layer and One advantage they

54:38

have is they can actually build data

54:40

centers

54:42

a lot a lot a lot faster. Power is not a

54:45

problem. Permits are not a problem.

54:47

People are not a problem. Labor is not a

54:49

problem. Expertise is not a problem. And

54:51

so by forcing them to go out there and

54:54

build all this, you're converting them

54:56

into a far more

54:58

like potent competitor.

55:00

>> Do you think we still dramatically

55:02

underestimate China's capabilities?

55:04

>> I think so. Because if AI is like not

55:06

just digital, that's also physical AI.

55:10

You got to build fabs, robots, chips and

55:14

harness the energy really well and um

55:17

package it into local devices.

55:20

I think they have a lot more advantages

55:22

than America.

55:24

>> How important is it that we have our own

55:25

TSMC in the US? So TSMC is actually

55:30

there is a fab of TSMC in Arizona. Like

55:34

not a lot of people talk about this but

55:36

TSMC is investing like $150 billion into

55:39

that into into building American fabs

55:42

and um they've already invested $40

55:45

billion or something like that. 60

55:46

billion last time I checked. So there is

55:49

a TSMC in Arizona that's coming up.

55:52

There's also um Intel and that's why you

55:55

know American government owns 10% of

55:57

Intel. U Nvidia and SoftPank own 5%

56:00

each. So there is a lot of investment

56:02

going into an American fab as well as

56:05

TSMC is investing into its American

56:07

fabs. Elon's building terra fab like I

56:10

think it people have woken up to the

56:12

importance of building fabs but um this

56:16

is also why China is particularly very

56:19

very competent because given the

56:21

capabilities of China that we just

56:23

mentioned there really articulately I

56:25

know it's a ridiculous question but sort

56:29

it um if I were to say to you your job

56:31

is to make sure America stays

56:33

competitive

56:34

what would you do to ensure that you

56:37

retained competitiveness in an

56:40

increasingly strong China.

56:41

>> I think I think take physical

56:44

infrastructure a lot more seriously and

56:47

continue funding it um and not like have

56:51

all these

56:52

you know I I wouldn't say meaningless.

56:55

It's more like not propagate fake news

56:59

around data centers

57:01

um about how data centers are polluting

57:03

and contaminating water or like they're

57:06

sucking up all the water um and and and

57:08

actually be fact driven and so you know

57:11

I hope our product helps there like you

57:13

you can you can go to perplexity and ask

57:15

any question and get fact checked on

57:18

your assumptions but yeah like it's very

57:20

important that we educate the public

57:25

um about what's actually going on in in

57:27

a language they easily understand and

57:30

not fear-monger, okay? Like not be like,

57:33

oh, like all their jobs are going to go

57:34

away like this, that like there's going

57:36

to be lots of amazing companies that are

57:38

going to get built with far fewer people

57:41

getting multi-billion dollar, multiund

57:43

million valuations with like 20, 30

57:45

people and propelling like trillions of

57:49

dollars of new GDP. Like let's talk

57:51

about how to enable that. let's talk

57:53

about how to build that and create a

57:56

more positive future together, right? Uh

58:00

instead of, oh, like 90% of the jobs are

58:03

going to be gone. Like you're all going

58:04

to get screwed over by our models and

58:07

like and and it's it's our it's our

58:09

moral duty to tell you all this like

58:11

blah blah blah. Like that doesn't make

58:12

any sense to me. Like you can't win by

58:15

saying that and also like complaining

58:17

about not being able to build data

58:18

centers fast. Do you think we've done a

58:20

complete disservice by having the

58:22

marketing message that Dario has had

58:24

that all jobs are going and it's all

58:26

doom and gloom?

58:27

>> Yeah,

58:28

I think so. I mean I think you know they

58:32

have contradictory messages in their own

58:34

58:36

uh different social

58:40

engagement so far where the most recent

58:43

one I heard was there is no evidence

58:45

that AI is taking over jobs.

58:48

But so I I I think there there needs to

58:51

be a consistent communication around

58:53

this. And

58:57

I also think that um very little is

59:00

being spoken about how AIs can help you

59:04

build companies in a very very different

59:06

way like the current AIS AI.

59:10

It it's already true that so many things

59:12

you would hire people for you can do it

59:14

with agents. But one way of looking at

59:18

it is like oh like what happens to all

59:20

the jobs. But the other way of looking

59:21

at it is like, hey, like I can I never

59:24

had the chance to go build out a company

59:26

on this idea that I've been having all

59:28

this all this while and maybe me and a

59:31

group of friends can come together and

59:33

build this and can you guys figure out a

59:35

way to give us compute credits or you

59:38

know Amazon gave a lot of compute

59:40

credits to a lot of startups like when

59:41

we started Perplexity

59:44

we had like around $200,000 worth of

59:46

Amazon credits and GCP credits and Azure

59:49

credits.

59:50

um that almost like together

59:52

cumulatively this was worth like a

59:53

million dollars in computer credits. Now

59:58

in today's world it's going to be like a

59:59

million dollars of computer credits. And

1:00:01

we're doing that like we we're funding

1:00:02

this thing called a billion dollar build

1:00:04

where we're giving a million dollars of

1:00:06

computer credits to any group of people

1:00:08

who have a credible path to building a

1:00:10

billion dollar company and I want like

1:00:12

thousand such companies to be built.

1:00:13

>> What did you think of Sam Alman giving

1:00:15

$2 million of tokens to YC companies

1:00:17

initially?

1:00:17

>> I think we should do more of that.

1:00:20

Yeah, that that's the right thing to do.

1:00:22

Like we should do a lot more of this

1:00:25

because you want new companies to be

1:00:29

built. Um and and and even if they're

1:00:32

worth multi00 million, right, it's good.

1:00:37

If there are thousands of them, like

1:00:39

that's a lot of new GDP. I I spoke to an

1:00:42

Betski before the show and she said how

1:00:44

AI pilled the team is for you. How big

1:00:47

is the team today?

1:00:48

>> It's like 400 people.

1:00:50

>> 400 people. How big will it be in two

1:00:52

years time?

1:00:53

>> I don't know. It's hard to say. Maybe

1:00:55

800 or,000.

1:00:56

>> So, will companies follow the same

1:00:59

headcount trajectory that they have

1:01:00

always followed and we will just solve

1:01:02

new problems. Or will they be

1:01:03

dramatically more efficient with a much

1:01:05

fewer number of people?

1:01:07

>> Definitely, they'll be dramatically more

1:01:08

efficient. Right. and and that's why I I

1:01:12

I am a believer in building a lot more

1:01:16

efficient companies

1:01:18

not and being an example for all these

1:01:20

companies ourselves like like people

1:01:22

should look at perplexity and be like oh

1:01:24

like with 400 people um you can build

1:01:28

like a multi like like I don't know like

1:01:30

20 billion $20 billion company um and so

1:01:35

that means with like 40 people I could

1:01:38

probably build a billion dollar or $2

1:01:39

billion

1:01:41

you know, and that that that's totally

1:01:43

doable. Totally doable. And um and and

1:01:48

so for us, maybe that means is with

1:01:49

4,000 people, we could be worth 200

1:01:52

billion. We we could be worth $2

1:01:54

trillion with like 10,000 people.

1:01:57

You know, I think I think that doesn't

1:01:59

mean it's bad for all the um hundred,000

1:02:03

people we did not hire for a typical $2

1:02:06

trillion company.

1:02:08

I would rather have those 100,000 people

1:02:10

be split into groups of like hundred

1:02:12

thousand groups like that and each of

1:02:14

those thousand groups are worth a few

1:02:16

billion dollars. That's awesome. And I

1:02:19

think a lot more people need to be

1:02:20

entrepreneurial.

1:02:22

Um there are people who would be bad

1:02:25

employees in any company because they're

1:02:27

just like difficult to work with. They

1:02:29

they don't listen to like instructions.

1:02:32

So like they don't follow like road maps

1:02:35

or not they're not like easy to

1:02:36

collaborate with. But maybe the the flip

1:02:39

side of that is those those are the kind

1:02:40

of qualities that founders typically

1:02:42

have.

1:02:42

>> Ain there is a population and a very

1:02:44

large population that are not AI native

1:02:46

people that are not using AI to improve

1:02:49

workflows, improve efficiency.

1:02:51

What would you advise them?

1:02:53

>> Get started. First steps get started and

1:02:56

and and channelize your curiosity.

1:02:59

Right. Um, you don't need to use AIS to

1:03:03

do your existing work. If that's if your

1:03:06

existing work is boring to you, you

1:03:09

probably won't enjoy it even if you use

1:03:10

AIs to do it.

1:03:12

>> You got a lot of heat for saying people

1:03:14

don't like their job. So,

1:03:16

>> I I didn't say if you actually listen to

1:03:18

my interview, I did not say that. So,

1:03:22

people want clickbait articles and they

1:03:24

take something I said in one sentence

1:03:26

and out of context and make it into a

1:03:28

headline. What did you say? I I

1:03:30

specifically said this. Hey, like there

1:03:33

are a lot of people who don't enjoy

1:03:34

their jobs. Does any like By the way,

1:03:36

the fact that that that that thing went

1:03:39

viral is not because I was completely

1:03:42

wrong. I think a lot of people resonated

1:03:43

with the fact that I was actually honest

1:03:46

in saying a lot of people don't enjoy

1:03:48

their jobs. And that has nothing to do

1:03:50

with your economic position or standing

1:03:52

in society. You might even be like

1:03:54

really wealthy but doing a job that you

1:03:56

completely don't enjoy and like

1:03:58

destroying like the peak years of your

1:04:00

adult life working on something that is

1:04:03

horrible or like like depressing. So

1:04:05

like my point is that if that's you and

1:04:10

if the reason you could never leave your

1:04:12

job is because you were always worried

1:04:15

if you how would you build a company

1:04:17

from scratch? Like there are all these

1:04:19

things to figure out how you have to

1:04:21

hire a lot of people. Oh, you have to

1:04:22

like set up an office this that. Like

1:04:24

that's changed. For the first time in

1:04:27

history, you can get started on an idea

1:04:31

with like one or two other friends and

1:04:34

and and and maybe have a real genuine

1:04:36

shot at building a billion dollar

1:04:38

company. Totally get that. Everything

1:04:40

that we've discussed today has been on

1:04:42

the back of unprecedented demand up and

1:04:45

to the right. We need more memory. We

1:04:47

need more data center supply. We need on

1:04:49

demand and server side. Everything's

1:04:51

like up and to the right. Seeing some

1:04:53

cracks in and Uber's saying, "I'm not

1:04:56

sure I'm getting the productivity gains

1:04:57

that I thought." Microsoft aligning with

1:04:59

them, putting a $1,500 token budget. Do

1:05:03

you think we will have a continuous up

1:05:05

and to the right acceptance that

1:05:07

productivity gains are unwavering? We

1:05:09

have to do this, or will there be

1:05:11

falterings along the way? I mean, I'm

1:05:14

sure there's going to be falterings

1:05:15

along the way and people are rightfully

1:05:18

freaking out about token maxing, which

1:05:20

is why I think you need some form of

1:05:22

hybrid agenic inference. You need some

1:05:25

amount of inference compute to run

1:05:27

locally that you're not paying for

1:05:30

tokens on um unmetered intelligence

1:05:33

essentially.

1:05:34

>> How will the best companies of the

1:05:35

future structure token budgets? My hope

1:05:38

is that they don't have to understand

1:05:39

that

1:05:41

they will be able to work with an

1:05:43

orchestrator who does it for them. It's

1:05:44

not going to be easy for you to

1:05:46

constantly keep track of like which

1:05:47

models are the best at what things and

1:05:49

how do you allocate oh this is the

1:05:51

budget for coding, this is the budget

1:05:52

for finance or like like how do you even

1:05:55

understand like which models are good at

1:05:56

each of those things and like how much

1:05:58

do you spend on each of these divisions?

1:05:59

You're not going to be able to keep

1:06:00

track.

1:06:01

>> I I had a friend on the show the other

1:06:03

day say that Google will be the token

1:06:05

king. They can produce the lowest cost

1:06:08

tokens out of anyone. They own full

1:06:09

stack TPUs, data centers, networking,

1:06:11

power, procurement.

1:06:13

Do you think that's true that they will

1:06:15

be the lowest cost token producer?

1:06:18

>> They have advantages all all advantages

1:06:21

one needs to have to be that. But they

1:06:23

underestimated the importance of coding

1:06:25

models and so they far behind the

1:06:27

frontier right now. So again, they could

1:06:29

catch up, totally capable, totally

1:06:32

competent team, but today they're not

1:06:35

quite at the frontier.

1:06:36

>> I was shocked the other day. I saw the

1:06:38

Cloudflare announcement that um now

1:06:41

agent traffic has overtaken human

1:06:43

traffic for them.

1:06:45

>> Why? Why are you shocked?

1:06:46

>> It was quicker than I thought.

1:06:49

>> Okay.

1:06:50

>> Personally, I thought that would happen,

1:06:52

but in two years, maybe not now.

1:06:56

How does the world change when agent

1:06:59

traffic far exceeds human traffic?

1:07:01

>> I think people are just going to have a

1:07:02

lot more agency.

1:07:04

That's it.

1:07:06

>> Do websites go away? Does design not

1:07:08

matter? Does the advertising model of

1:07:10

the internet die completely?

1:07:12

>> No, it doesn't. Because my belief is

1:07:16

that

1:07:17

the advertising model around like travel

1:07:21

or shopping or like u fashion are not

1:07:26

getting disrupted by agents because the

1:07:28

judgment is not objective.

1:07:31

Any anything where the judgment is

1:07:34

objective, the transaction is based on

1:07:36

objective judgment that's going to get

1:07:37

disrupted by agents.

1:07:40

anything where the transaction is more

1:07:42

subjective like the decisions are more

1:07:44

subjective like like what is the best

1:07:46

piece of furniture inside this this this

1:07:48

podcast like why this particular table

1:07:51

or like those kind of things

1:07:53

>> probably for the mic you would buy an

1:07:55

objective decision the table

1:07:57

>> you probably are caring about the

1:07:59

aesthetics of the room I think I I think

1:08:01

that's kind of how I I feel the world

1:08:03

will split and subjective things will

1:08:06

still be ad based objective things will

1:08:08

be agent based

1:08:10

I watched your commencement speech on

1:08:11

the back of speaking to Samir at Excel

1:08:13

and he said I had to watch it. So

1:08:15

obviously I watched it. Um and one of

1:08:17

the points you made was the defining

1:08:18

skill of the area is asking better

1:08:20

questions.

1:08:21

>> Yeah.

1:08:22

>> What question is no one asking today

1:08:25

that maybe everyone should be asking?

1:08:27

>> I think people need to ask more about

1:08:29

like okay assuming I have a lot of

1:08:32

agency available to me what do I do?

1:08:36

Imagine like I gave you a headcount of

1:08:38

like 100,000 people or 10,000 people

1:08:42

and and and

1:08:44

you know enough computer credits to run

1:08:46

those agents. What would you do?

1:08:50

Like let's say I ask you Harry like you

1:08:52

know let's say I you have suddenly like

1:08:55

10,000 agents at your disposal. What

1:08:57

would you do? Like I I remember you

1:09:00

telling me or not me but but in some

1:09:03

episode of yours where

1:09:05

>> you said you're only you only did this

1:09:07

podcasting because you felt like you

1:09:09

didn't have an arbitrage to go win deals

1:09:12

>> 100%. Yeah. It's why I still do it. I

1:09:15

mean I love what I do but yeah.

1:09:17

>> Okay. So you've gotten some amount of

1:09:19

distribution. So now assuming that you

1:09:21

can you you have let's say you could

1:09:23

spend $100 million on a genenic

1:09:25

inference and grounded with all the

1:09:28

connectors and stuff and it's all

1:09:30

working. What would you do to to with

1:09:33

with that capability to um further your

1:09:36

goals? Like what what what should your

1:09:37

goals even be then? I I think that's the

1:09:40

question I would ask assuming that in

1:09:42

the next three to five years you're

1:09:44

going to be able to like delegate

1:09:46

whatever digital task you want and and

1:09:49

with the right harness and agents and

1:09:50

like be able to delegate that.

1:09:52

>> Fundamentally it would be to build aic

1:09:54

infrastructure to be able to find,

1:09:58

identify, outreach, set up, win great

1:10:02

investments and have the media sit on

1:10:05

top and power that. that is intensely

1:10:07

difficult to do and would be the holy

1:10:10

grail to investing but like

1:10:12

>> that would power what my end goal

1:10:14

ambition is.

1:10:15

>> Yeah. So your goal is to be the you know

1:10:18

run like a 10 to 100x larger fund right

1:10:20

that that's basically what I'm hearing

1:10:22

from you.

1:10:23

>> So that's like let's assume it's like a

1:10:25

$40 billion fund from 400 million. Um

1:10:30

then all you got to ask is like assuming

1:10:33

I have all the headcount I need to do

1:10:35

this like how much faster can I do it? I

1:10:38

I think that's how I would frame this

1:10:40

question. I think Elon has like a

1:10:42

similar thing he spoke about once where

1:10:46

okay assume that a task somebody tells

1:10:48

you a task is going to take 10 years. Um

1:10:51

ask the question what would it take to

1:10:53

do it in 10 months?

1:10:57

Maybe it's impossible to do it in 10

1:10:59

months, but you'll probably get pretty

1:11:01

far asking those questions

1:11:05

compared to somebody who takes it for

1:11:07

granted that it's it's going to take 10

1:11:09

years.

1:11:10

>> All right, interviewer, we put it on

1:11:12

you. What's your 10year and how does

1:11:15

that look in a 10-month time frame?

1:11:18

>> I think our our mission beyond any level

1:11:21

of capitalism is to make the planet more

1:11:23

curious.

1:11:25

You know the product is always intended

1:11:27

to helping people ask the next question

1:11:30

and uh my goal is to truly realize that

1:11:34

like that level of agency

1:11:37

that needs to exist in this world is

1:11:40

quite not there.

1:11:40

>> I think I think that needs to be

1:11:42

grounded in numbers dude to make it like

1:11:45

possible. It's like me saying oh I want

1:11:46

the best investments

1:11:48

like which is why a $40 billion fund is

1:11:51

helpful.

1:11:51

>> Sure. I can say the same thing like 2

1:11:53

trillion you know it doesn't matter

1:11:54

right like 100x 10x 1000x these are all

1:11:57

like um motivational milestones

1:12:01

>> do you think will be a trillion dollar

1:12:03

company

1:12:04

>> yeah anyone can be a trillion dollar

1:12:06

company SKH and Samsung are worth a

1:12:08

trillion last last couple of weeks did

1:12:12

you know Samsung started off as a

1:12:13

grocery store did you know that you

1:12:16

didn't know that okay so it's true they

1:12:18

started selling dried

1:12:21

Seriously. Um, Heinix was um SK the SK

1:12:26

group started off as um um textiles

1:12:29

company. So, anyone can be worth a

1:12:32

trillion dollar company and like you

1:12:34

just have to work your way towards that.

1:12:36

I mean, the exact same logic for you

1:12:40

that you laid out for how can a company

1:12:42

be worth um hundred billion. Okay, you

1:12:46

said you need to make a $10 billion in

1:12:47

revenue. Isn't that the same for

1:12:50

trillion? Like you need to make a

1:12:51

hundred billion in revenue.

1:12:52

>> Mhm.

1:12:54

>> And there was actually some very

1:12:55

interesting data that CO2 revealed. I

1:12:57

don't know if you saw it recently, which

1:12:58

basically says about the probability of

1:13:00

reaching the next level of value is much

1:13:03

higher. So when you're at a billion,

1:13:04

it's like much more likely to reach 10

1:13:06

billion. 10 billion much more likely.

1:13:07

>> Yeah, that's true actually for even

1:13:08

people. Like it's way more likely for a

1:13:12

person with $100 million in in liquid

1:13:15

net worth to become a billionaire than

1:13:18

someone with $10 million.

1:13:19

>> Are you not worried about the wealth

1:13:20

inequality? Aaron, if we being blunt, we

1:13:23

both are very lucky now to live in kind

1:13:25

of nice worlds and rarified airs. Are

1:13:28

you not worried by just how much money a

1:13:30

very small number of people have and how

1:13:32

[ __ ] hard it is for everyone else and

1:13:35

that gap is getting bigger?

1:13:37

I think the way to like ensure that

1:13:39

that's not that doesn't remain the case

1:13:42

is to distribute the benefits more

1:13:45

widely. You got you got to let anyone by

1:13:48

the way the people who are using our

1:13:50

tools like I had an Uber driver I'm not

1:13:53

even like making this thing up so um as

1:13:56

honest as it can get. an Uber driver in

1:13:59

San Francisco um once told me that he

1:14:03

watched one of my uh YouTube interviews

1:14:06

where I explain how you can build a

1:14:09

product or a web app with an AI from

1:14:12

scratch, went on to do it and um um used

1:14:16

AIS to add like billing and all that and

1:14:19

that makes more passive income for him

1:14:22

than um driving Ubers and so he actually

1:14:25

reduced the amount of time he's driving

1:14:27

Uber because He he loves wipe coding new

1:14:30

apps and u that that already tells you

1:14:34

that

1:14:35

for the person with agency and a

1:14:38

positive outlook for the future,

1:14:40

anything is possible. And so if you keep

1:14:43

communicating

1:14:45

all the negative things you can about AI

1:14:47

and wealth inequality all the time and

1:14:49

that's the only thing news uh and press

1:14:52

writes about,

1:14:54

I think it'll perpetuate and people will

1:14:56

only think the bad things. And so it's

1:14:58

it's it's very essential that if you

1:15:01

think you're already doing well, it's

1:15:02

very essential that you talk about what

1:15:05

are all the things that can go well and

1:15:08

give hopes to people who were once upon

1:15:10

a time like you like you you were you

1:15:12

didn't you you started this um

1:15:15

podcasting circuit like when you had

1:15:17

nothing, right? So

1:15:18

>> nothing.

1:15:18

>> Exactly. So it's possible. So you got

1:15:20

you got to talk more about that than be

1:15:22

like oh I feel so guilty that I made it

1:15:24

and now I'm like you know what about all

1:15:26

these people who haven't made it like

1:15:27

you can also make it.

1:15:28

>> I I think I have a more pessimistic view

1:15:30

of actual general public which is I

1:15:32

don't think that many people have

1:15:33

agency. I think a lot of people have

1:15:35

victim mentality.

1:15:36

>> You got you got to help them. Like I

1:15:37

think that's that's the most important

1:15:38

thing.

1:15:38

>> I think they got to help themselves.

1:15:40

>> Sure. But people will help themselves

1:15:42

once they see that okay like I kind of

1:15:46

want to be like this guy. let me let me

1:15:49

work hard. You need an example, right?

1:15:52

Um it's not like nobody can become um

1:15:56

get in shape. Like it it takes

1:15:58

discipline.

1:15:59

>> Takes discipline. You got to get rid of

1:16:01

bad habits. And so

1:16:03

>> and now is the best time ever to change

1:16:05

your life in 12 months. Like the ability

1:16:07

to go from nothing to to actually

1:16:09

billionaire in 12 months is now possible

1:16:12

in some respect.

1:16:13

>> Yes. And so I look, I'm not saying

1:16:16

everyone's going to make it and

1:16:17

everyone's going to be worth a billion

1:16:18

dollars.

1:16:18

>> Isn't that the caption from this this

1:16:20

show, Arvin? Everyone's going to make

1:16:22

it.

1:16:22

>> Anyone has the potential to make it.

1:16:26

>> So it's it's it's as likely for

1:16:29

Perplexity to become worth $2 trillion

1:16:32

as as a founder who's yet to secure your

1:16:34

funding to be worth a billion dollars.

1:16:38

So it's it's equally hard. Equally hard.

1:16:40

And um and I think you just have to give

1:16:44

yourself, you know, shots at the goal

1:16:47

and um be curious. That's that's the

1:16:49

message from the commencement speech. Be

1:16:51

curious.

1:16:51

>> We have SpaceX. We have Anthropic. We

1:16:53

have Open AI going public. It feels like

1:16:55

someone's kind of shot the gun and the

1:16:57

race is on. Is there enough money

1:17:01

to fund three such large IP?

1:17:04

>> There will be some reallocation for

1:17:05

sure.

1:17:07

like um there there might be some

1:17:10

holders of like SAS stocks who would put

1:17:12

it into anthropic or something. Let's

1:17:16

say you believe that enterprise AI is

1:17:18

going to take off. You might want to

1:17:20

hedge between having a lot of Microsoft

1:17:22

stock and Salesforce stock versus like

1:17:25

putting some of that into anthropic. So

1:17:28

let's say like Vanguard or Black Rockck

1:17:30

own like you know cumulatively they own

1:17:32

like $200 billion of Microsoft and

1:17:34

Salesforce. They might be like, "Okay,

1:17:35

I'm going to take 30 40 billion of that

1:17:37

and put it into anthropic."

1:17:39

Fine. You know, not a bad bit to make.

1:17:42

>> What happens to all the enterprise SAS

1:17:44

companies that are public growing? Yeah.

1:17:47

Fine.

1:17:49

>> They have to they have to weather the

1:17:51

storm.

1:17:52

>> Is it a storm or is it a continuous

1:17:54

precipitation?

1:17:56

>> I think you have to bring down the costs

1:17:58

and produce new value.

1:18:01

Salesforce has done well because they

1:18:04

always went and bought the next thing.

1:18:06

If you're just selling the same

1:18:07

software, you're probably not going to

1:18:09

be around. IBM is still around because

1:18:10

they went and bought Red Hat and

1:18:13

Hashikarp and now they're buying

1:18:15

Confluent. So, there are ways for these

1:18:17

companies to stay alive and extend their

1:18:20

lifespans and stuff. It's obviously

1:18:22

going to be hard to preserve a brand

1:18:24

that's as relevant

1:18:27

like like I don't think the IBM brand is

1:18:29

that relevant anymore in terms of like

1:18:32

evoking an emotion and people to go use

1:18:33

their products but as a business it's

1:18:36

going to be awesome you know it's going

1:18:37

to be fine.

1:18:37

>> I have to finish on you said IPO in

1:18:40

2028.

1:18:42

I had to ask this. I woke up to this in

1:18:43

my like you know um group. We have a

1:18:46

team WhatsApp and it's like ask IPO

1:18:49

2028. I hope I hope it can be sooner

1:18:51

than that.

1:18:52

>> When do you know when you're ready? Like

1:18:54

is there like a billionaire? Are you at

1:18:56

500 million er now?

1:18:58

>> More than that. Far far more than that

1:19:00

actually.

1:19:00

>> Really?

1:19:01

>> We we're not ready to share it, but um

1:19:06

growing really fast.

1:19:08

>> Revenue growth matters much more to you

1:19:09

than profitability

1:19:11

>> today. I think in general, by the way,

1:19:13

you can look at public markets. Um

1:19:17

people want topline growth.

1:19:19

more than um bottom line efficiency

1:19:22

right now because it's very hard. It's

1:19:24

it's rare.

1:19:25

>> Well, you definitely need one.

1:19:28

>> Of course, sustainable businesses,

1:19:30

>> you need to have a model in place to get

1:19:35

the bottom line efficiency when that

1:19:36

becomes the objective and and you need

1:19:38

to also like have a path to getting

1:19:40

there.

1:19:40

>> Where are you cost inefficient today

1:19:42

where you expect to be significantly

1:19:43

better in 2 to 3 years? We're training

1:19:45

our own models, post- trainining it on

1:19:48

top of amazing open source models, and

1:19:51

that will bring down the cost that we

1:19:54

currently spend on Frontier model

1:19:56

tokens.

1:19:57

We expect to continue to use Frontier

1:20:00

models for designing new experiences and

1:20:02

new capabilities that do not exist today

1:20:04

in our products. But whatever exists

1:20:06

today in our products right now, we

1:20:09

expected to completely re rely on like

1:20:12

models we own and serve ourselves. And

1:20:13

that's

1:20:15

going to be the best way to bring down

1:20:16

the costs and increase our margins.

1:20:17

>> Will the largest enterprises in the

1:20:19

world all be fine-tuning open models to

1:20:22

have tailored models that are much more

1:20:23

specific to them?

1:20:24

>> Absolutely. Because it's in your

1:20:27

incentives to bring down the cost.

1:20:28

>> Does that not provide another bad case

1:20:30

for the large frontier model providers?

1:20:32

>> Frontier model providers will only

1:20:34

remain relevant if they remain at the

1:20:36

frontier. If for 6 months you're not

1:20:39

seeing a new capability,

1:20:41

it's bad for them.

1:20:43

And so that's the uncomfortable nature

1:20:46

of this field. You no one's ever in a

1:20:49

comfortable position. Like I said in the

1:20:51

start, no one's no one can relax. This

1:20:54

1:20:55

>> [ __ ] a hard business. It's got

1:20:57

harder.

1:20:58

>> It's going to get even harder. And and

1:21:01

um that's the nature. This is the the

1:21:04

price is too big. Like you've never seen

1:21:07

like like take entropic. I think it's um

1:21:10

worth like one to one and a half

1:21:11

trillion some something in that range.

1:21:14

That's basically the valuation of meta

1:21:18

and and and this all was created in like

1:21:20

6 years.

1:21:22

Meta took like 20 years to build. So the

1:21:26

price is so big and so no one can um no

1:21:31

one can be comfortable

1:21:33

and and and and and anyone who's winning

1:21:35

today can lose tomorrow including

1:21:39

including the mod providers.

1:21:41

Previous this year, there was like a

1:21:43

three-month period where people were

1:21:44

like, "Oh, Py, what's happening with

1:21:46

Pacity?" Do you pay attention? Do you

1:21:48

give a [ __ ]

1:21:49

>> Of course, I pay attention to all that.

1:21:50

>> Do you care? There was one in particular

1:21:52

in San Francisco. Do you remember where

1:21:53

they were like, "Oh, what's the company

1:21:54

you had short?"

1:21:55

>> Yeah, we were voted the most likely to

1:21:56

fail. Cursor was voted the second most

1:21:59

likely to fail. Open AAI was voted the

1:22:01

third or something. Um,

1:22:02

>> you didn't give a [ __ ]

1:22:04

>> I I feel like we're all doing well.

1:22:07

Curser, I think, is getting sold. SpaceX

1:22:10

Open AI

1:22:12

>> is

1:22:14

>> going public soon.

1:22:15

>> We tripled our revenue since that

1:22:18

>> since that uh judgment was made. So

1:22:21

brought down the burn by more than 50%.

1:22:25

So I I don't know like my my my sense is

1:22:27

that um I also feel most of those people

1:22:31

who sit on these like meetups and vote

1:22:35

don't actually build anything useful.

1:22:41

Yeah.

1:22:41

>> Uh, okay. We're going to do a quick fire

1:22:43

around because, uh, I could talk to you

1:22:45

all day. Um, first one, what's one

1:22:48

widely held belief that you think is

1:22:51

completely wrong?

1:22:52

>> I think a lot of people are obsessed

1:22:54

about like, you know, identifying a mode

1:22:56

in the first year or two of their

1:22:57

company. But, um, I think like the only

1:23:00

shot you have is move fast. like veloc

1:23:03

in my mind like moving fast is a way of

1:23:06

expressing humility because you you're

1:23:08

constantly making contact with the world

1:23:10

and trying to question your assumptions

1:23:11

all the time.

1:23:12

>> Where are you still moving too slow

1:23:14

internally today?

1:23:15

>> I think we can be even more AI.

1:23:18

It's insane I'm saying this because we

1:23:20

are building some of the most

1:23:22

interesting AI products and internal

1:23:25

adoption of our own products, our

1:23:27

competitors products can be even higher

1:23:29

and and um this is despite us being

1:23:31

extremely

1:23:33

um agent build internally and trying to

1:23:36

delegate as much to agents. Yeah, that's

1:23:39

where that that's like a big area for my

1:23:41

my hope is that we can turn this company

1:23:43

almost into an AGI and and um have that

1:23:46

doesn't mean no humans work here, but

1:23:50

there will be an AGI that has all the

1:23:52

context it needs to run different

1:23:57

divisions of the company

1:23:59

in a semi-autonomous way with some

1:24:01

scaffolding provided by humans here and

1:24:04

there. And and that's not that's going

1:24:06

to feel that's not going to feel scary

1:24:08

at all. We'll normalize that that

1:24:11

feeling very fast. It's just going to

1:24:14

feel like 10, you know, uh the 10x

1:24:17

engineers running certain aspects of the

1:24:19

company.

1:24:19

>> If I gave you unlimited money, what

1:24:21

would you do today that you're not

1:24:23

doing?

1:24:24

>> I would build data centers.

1:24:25

>> You would?

1:24:26

>> Yeah.

1:24:27

>> In space?

1:24:28

>> I don't have expertise to do that, but I

1:24:31

would I would start with land on Earth.

1:24:34

You know, I think there's a lot of land

1:24:37

and and maybe you can be resourceful in

1:24:39

securing permits and power in different

1:24:41

countries, but I would start there. I

1:24:43

think it's, you know, I like I said,

1:24:46

like I think physical infrastructure

1:24:48

buildouts is like the return of the

1:24:51

industrial age again

1:24:54

like like the the the forefathers who

1:24:56

built the industrial revolution,

1:24:58

um oil pipelines, steel bridges,

1:25:02

factories producing cars, all these

1:25:04

things that we take for

1:25:07

granted today were built by people who,

1:25:10

you know, spend a lot of time thinking

1:25:12

about how to scale these things in a

1:25:14

costefficient way and so um we need to

1:25:17

do that a lot for AI and um yeah that's

1:25:21

what I would do of course you you you

1:25:23

cannot just be building infra you need

1:25:26

to be able to utilize all that infra to

1:25:29

producing valuable output tokens to the

1:25:31

user

1:25:32

but um we're already good at doing that

1:25:35

so infra is the thing I would focus on

1:25:38

you can buy and hold for 10 years SpaceX

1:25:42

anthropic or open AI the three IPOs

1:25:46

coming in the next few months which you

1:25:48

buy and hold for 10 years and why

1:25:50

>> SpaceX

1:25:52

why it's an end of one company

1:25:56

like Enthropic and OpenAI can claim they

1:25:59

do whatever each other does but u um

1:26:04

SpaceX is the only company building

1:26:07

space infrastructure for connectivity.

1:26:10

Have you been on a flight with Starlink?

1:26:12

No,

1:26:13

>> you should.

1:26:15

You will hate being on a flight without

1:26:16

Starlink after that. Imagine we can

1:26:20

record this. I can watch this podcast

1:26:22

while flying on a plane.

1:26:25

Starlink lets you do that. That's just

1:26:27

one aspect of the business. That's just

1:26:29

one aspect of

1:26:30

>> one small aspect of the business.

1:26:31

>> Yeah. Like there's a lot of like I'm

1:26:33

excited about possibilities to travel

1:26:35

from Australia to like uh San Francisco

1:26:38

in like 30 minutes. You know, all this

1:26:40

feels like sci-fi, but I'm excited about

1:26:42

all these possibilities.

1:26:43

>> What job does not exist today that will

1:26:45

be incredibly common in 5 years time?

1:26:48

>> I think it already exists. So, if the

1:26:50

forward deployed engineer is definitely

1:26:52

on the rise, I guess like people with a

1:26:54

really good sense of like um quality

1:26:57

control.

1:26:59

Maybe a better uh way to answer this is

1:27:02

like most jobs that exist like valuable

1:27:04

jobs that exist are usually like

1:27:06

reincarnations of something that already

1:27:08

existed.

1:27:09

Like so I don't think we're going to see

1:27:10

completely new things. It's just going

1:27:12

to reincarnate in different ways.

1:27:13

>> You can advise your little sibling who's

1:27:16

finishing university today and just done

1:27:19

a computer science degree. One thing,

1:27:22

what would you advise them?

1:27:24

>> Stay curious.

1:27:27

Don't don't don't give into like FOMO

1:27:29

and trying to max out on something here

1:27:31

in the short term. Like don't go to

1:27:34

Twitter and feel like a loser that

1:27:36

people on Frontier Labs are getting so

1:27:39

rich and like you everything feels

1:27:41

hopeless to you or something like there

1:27:43

is so much more to build like we are

1:27:45

just getting started like the the the

1:27:47

application layer era or like

1:27:50

infrastructure buildouts there there's

1:27:52

like a lot of opportunities we are

1:27:54

seeing more spinouts from open AI

1:27:56

anthropic you name it every single day

1:27:59

do we have hundreds of these neolabs

1:28:02

in vertical models.

1:28:04

>> No, not not a big believer in too many

1:28:08

of them. I think you got to produce some

1:28:10

differentiation. That's the most

1:28:12

important thing. Um like like I I if

1:28:16

would you call Deep Seek a Neolab?

1:28:18

>> No.

1:28:19

>> Why?

1:28:20

>> I think very stupidly for me I don't

1:28:22

call it a Neolab because I I attribute

1:28:24

Neolabs like spinouts from larger labs.

1:28:27

>> I see. and and kind of verticalized

1:28:29

which is probably wrong on both par like

1:28:32

axes

1:28:33

>> but it's horizontal and it's not a spin

1:28:35

out.

1:28:36

>> Yeah. I mean I kind of like the idea of

1:28:38

labs taking a differentiated bet. Okay.

1:28:40

If somebody really questions the

1:28:42

transformer architecture itself or

1:28:45

somebody really questions needing to

1:28:46

build on Nvidia GPUs or something like

1:28:48

that like like foundational bets makes

1:28:51

or or somebody questions or somebody

1:28:53

goes out and builds for robotics models.

1:28:55

I think I think that's like somewhat

1:28:57

uncorrelated and different and that

1:28:58

makes sense for a lab, but I feel like

1:29:00

they're like just labs for the sake of

1:29:02

being lab and I don't think they're

1:29:03

going to make it.

1:29:04

>> Can you paint for me? I I do like this

1:29:07

one. What's the most plausible story

1:29:10

whereasty becomes a trillion dollar

1:29:13

company? What do you do then? The

1:29:15

orchestration layer.

1:29:17

>> I mean, accuracy and orchestration is is

1:29:19

is like two goals that have been

1:29:21

consistently true since the beginning of

1:29:22

our company. So I think we'll continue

1:29:25

to do that. We'll be orchestrating

1:29:27

across devices, chips, models, tools,

1:29:29

files, connectors, everything. Right. So

1:29:31

what would I do once that happens? I

1:29:34

don't know. We we'll chart our path to

1:29:36

10 trillion.

1:29:37

>> Are you happy now? But are you are you

1:29:39

enjoying this?

1:29:40

>> Of course. I mean, I wouldn't like

1:29:43

there's so many things I could be doing

1:29:45

if not for this. And um I think the

1:29:49

process is is is what motivates you. So

1:29:52

you asked me I think somewhere in

1:29:53

between you need to give me a number of

1:29:55

where you want. I don't I don't work

1:29:57

like that actually. I I like for example

1:30:00

like the these numbers like getting to

1:30:02

two trillion or 20 trillion

1:30:04

are are exciting but like that that

1:30:07

doesn't motivate me. It's it's hard to

1:30:09

get motivated by wealth. You you you

1:30:11

want to get motivated by impact. Who's

1:30:14

the smartest person you've met? Final

1:30:16

one. You've met Jensen Hang. You've met

1:30:19

the best of the best. I've been

1:30:20

fortunate enough to Who's the smartest?

1:30:23

>> People are smart in their own ways. It's

1:30:25

hard to compare. Like I met Jensen,

1:30:27

Elon, all these guys and like Bezos.

1:30:30

>> What was it like meeting Elon?

1:30:33

>> Amazing. I mean Elon's like a very uh

1:30:37

focused person like like that. He might

1:30:39

not appear that way on Twitter but you

1:30:42

know with a lot of like random tweets

1:30:43

but he's extremely laser sharp focused

1:30:46

on whatever he's doing at that moment in

1:30:48

time. Actually the the one skill that as

1:30:51

an entrepreneur that I would really like

1:30:52

to take like build from somebody like

1:30:55

him like take from somebody like him and

1:30:57

have it for myself is that ability to

1:30:58

just zone out of all the other things

1:31:01

that's happening in your business or

1:31:04

other businesses and just focus on that

1:31:07

limiting problem right now like the

1:31:09

bottleneck problem and and and and

1:31:11

ignore everything else. It's very hard

1:31:13

to do like even within perplexity I

1:31:16

cannot just focus on like one part of

1:31:18

the business alone. It's very difficult

1:31:22

like I'm I'm always looking at other

1:31:23

things simultaneously. And um his style

1:31:28

is to just always look at the limiting

1:31:30

problem and just ignore everything else.

1:31:32

And uh that's very hard to do because

1:31:35

you you actually have to be really good

1:31:37

at concentration.

1:31:39

You you have to be really good at

1:31:40

ignoring

1:31:42

even important things which are

1:31:45

distractions to your core objective

1:31:46

right now.

1:31:47

>> Was Jensen hang what you thought he'd

1:31:49

be?

1:31:50

>> Far better.

1:31:51

>> Really?

1:31:51

>> Yeah. Jensen is so truth seeeking. It's

1:31:54

insane.

1:31:55

I think he or somebody else told me or

1:31:58

read in a book that he

1:32:00

um is so intense that he wakes up every

1:32:03

day and tells himself that he sucks

1:32:06

and goes in like like he's so intense

1:32:10

that he tells everybody around him that

1:32:13

they're 30 days away from going out of

1:32:15

business. Think about it, right? $5

1:32:18

trillion

1:32:20

um guaranteed to make $500 billion in

1:32:23

revenue in the next two years. Um

1:32:27

and um has the most advanced chips in

1:32:29

the world and and and he operates with

1:32:32

that mentality that he could be 30 days

1:32:34

away from going out of business. That is

1:32:37

what it takes to be Jensen Huang. And uh

1:32:41

there's so much to learn from these

1:32:43

guys. There's so much to learn. I think

1:32:45

uh there's one aspect of like you know

1:32:48

being comfortable where you are thinking

1:32:50

you made it uh you know that that feels

1:32:54

good to get here so far but these guys

1:32:57

are not stopping like I I don't think

1:32:58

Elon wants to stop at uh if you look at

1:33:01

his pay package for SpaceX it's

1:33:03

structured around um creating a colony

1:33:06

in Mars with a million inhabitants and

1:33:09

uh building enough comput in space so

1:33:11

that's why it's not like motivating to

1:33:14

1:33:15

worth a 10 trillion in net worth or

1:33:17

something. you know, if if he does these

1:33:20

things, I'm sure he's going to get

1:33:21

there, but um it's more motivated around

1:33:25

like making the impossible things happen

1:33:29

and and and having like that long-term

1:33:31

outlook like you um I think that has

1:33:34

been the biggest thing to learn from

1:33:36

maybe these two individuals in

1:33:37

particular is um

1:33:40

a lot of people view this like

1:33:42

entrepreneurship as like oh if it wins

1:33:45

if if I win and I have a great outcome

1:33:48

and I sell my company. I would have like

1:33:51

generational money. I don't have to work

1:33:55

ever again. And then what you end up

1:33:58

like like just staying at home and like

1:34:00

your your your kids will obviously have

1:34:02

like trust funds and they're not going

1:34:05

to get inspired watching their dad play

1:34:08

paddle.

1:34:09

>> Yeah. You know, you're not going to set

1:34:11

the right example for them. and they're

1:34:13

not going to be able to take your wealth

1:34:14

and multiply it cuz they they didn't

1:34:17

watch somebody who actually did that.

1:34:20

You they you did it before

1:34:23

they they they were like adults. And so

1:34:26

um I think you always need to be doing

1:34:29

something. Um like like Jensen said some

1:34:33

recently that he hopes to die on the job

1:34:34

or something like that. Like that's the

1:34:37

attitude you need to have. Like you got

1:34:39

you you need to work forever. I was so

1:34:41

upset though when Jensen said, "If I'd

1:34:43

known how hard it was going to be, I

1:34:45

wouldn't have done it." When he did, I

1:34:47

don't know if you saw that into I was

1:34:48

like, "Oh,

1:34:51

>> yeah. I think it's pretty hard, but you

1:34:53

you don't do it because you do it

1:34:57

despite that." I think I think that's

1:34:58

how it works.

1:34:59

>> Arvin, listen, this has been so

1:35:00

fantastic to do. I so appreciate you

1:35:02

taking the time while you're in London.

1:35:03

So, thank you so much for joining me.

1:35:05

>> Appreciate it.

More transcripts

Explore other videos transcribed with YouTLDR.

Get the TLDR of any YouTube video

Transcribe, summarize, and repurpose videos in 125+ languages — free, no signup required.

Try YouTLDR Free