Full Transcript

·YouTLDR

How Export Controls Helped Not Hurt China & Power is the Bottleneck to AI | Perplexity CEO

1:35:091,454 summary words · ~7 min readEnglishTranscribed Jun 16, 2026
Summary

The AI value proposition is shifting from commoditized model pre-training weights to smart orchestration layers where the critical metric is token value per watt per user. Meanwhile, hardware-software co-design and regional power grid accessibility have replaced simple chip supply as the dominant strategic bottlenecks of the technology explosion.

This shift proves that physical energy infrastructure access, client-edge hybrid routing, and deep compiler/attention optimizations under constraint will determine the ultimate geopolitical and commercial winners of the intelligence race.

Section summaries

0:00-3:34

The Offense-Only Mindset of Founders

optional

Aravind Srinivas explains his personal motivation, noting his upbringing in a lower-middle-class family in India. He explains that coming from nothing leaves him with no fear of failure, leading to a philosophy of persistent offensive movement. He reflects on his past aggressive social media communication style, noting that while it served its early branding purpose, he has transitioned to a more measured approach because the constant noise became stale.

  • Defensive strategies are structurally counterproductive for early-stage companies trying to challenge incumbents.
  • Brand communication must shift from provocative social positioning to disciplined execution as products scale.

Deals mostly with founder psychology and personal history rather than deep architectural or geopolitical insights.

3:34-8:08

How Perplexity Redesigned Google's Interface Roadmap

watch

Srinivas argues that Perplexity has altered Google's main interface design more than any internal Google product manager could have. He walks through Google's AI model integration, noting that their font, citation style, and Hyperlink formatting directly copy Perplexity's interface. He asserts that basic answer engines have been commoditized, forcing innovative players to focus on deep research and agent execution rather than simple search.

  • The standard Q&A search layout has been commoditized across major tech platforms.
  • Incumbent organizations struggle to innovate on cash-cow interfaces until competitive forces threaten their user base directly.

Provides essential context on why simple search retrieval is commoditizing, driving the industry toward active agents.

8:08-11:48

The Economic Incompatibility of LLM Chat and Direct Ads

watch

Aravind analyzes why conversational chat interfaces are fundamentally ill-suited for traditional advertising models. He points out that Google's largest advertisers are built around travel, fashion, and subjective exploration, which require browsing and vibes rather than a single direct answer. Furthermore, injecting ad placements directly into objective answers degrades user trust in the accuracy of the system, mimicking the failures of email-based advertising.

  • Subjective decisions like commerce and fashion rely on exploratory UI layouts rather than direct-response text boxes.
  • Sponsored search results inside high-accuracy answer interfaces degrade the core user value proposition of objective trust.
  • WeChat's ad-gamification models are unique to the Chinese economic ecosystem and do not map directly to Western consumer behaviors.

Deconstructs the structural business model failures of attempts to place traditional search advertising within LLM platforms.

11:48-15:33

The Agentic Orchestration Harness

watch

The discussion defines the technical difference between raw foundational models and an orchestration harness. Srinivas defines the harness as the set of tools, local connectors, sub-agents, and routing rules that make model intelligence useful to end-users. He explains that by orchestrating across different models (like Claude and GPT models simultaneously), third-party engines can optimize token value relative to the local power expended.

  • The software harness surrounding raw AI models represents the true source of competitive differentiation and user value.
  • To win the value capture war, platforms must orchestrate across rival model vendors to optimize token delivery per watt.

Introduces the critical concept of the orchestration harness and the primary unit economic metric of agent-based systems.

15:33-20:45

Continuous Loops and the Enterprise Token Explosion

watch

Srinivas details the emergence of high-spending enterprise power users running automated background loops inside system harnesses. He describes internal workflows where multi-agent structures run constant checks, triage network issues, and perform root-cause analysis without human interaction. This shift from transactional queries to background loops will drive a token economy larger than the aggregate ad revenues of Google and Meta.

  • High-frequency background cron-jobs generate far more token volume than reactive, human-facing chat applications.
  • The most profitable AI workflows are run by small groups of highly leveraged humans managing autonomous agent hierarchies.

Crucial explanation of how enterprise agent systems actually create high-volume economic value.

20:45-34:40

Edge Computations, Local Chips, and Data Custody

watch

To prevent enterprises from going bankrupt due to constant cloud API costs, Aravind advocates for a hybrid model. This setup routes lightweight task analysis and data compaction to local device chips, reserving high-cost server-side frontier models for complex tasks. This edge-focused design secures highly sensitive intellectual property locally, protecting it from potential cloud server compromises.

  • Hybrid local-server routing is economically necessary to sustain continuous multi-agent processing workflows.
  • Storing context locally prevents sensitive enterprise transaction records from being exposed during third-party server hacks.

Explains the technical division of labor between edge devices and cloud-based frontier model servers.

34:40-43:47

Physical Grid Permitting, Chip Limits, and CPU Demand

watch

The discussion pivots to the physical realities of the AI compute economy. Srinivas argues that grid power allocations, cooling systems, and physical construction permits are the true constraints, not raw chip counts. He suggests memory suppliers like Micron could capture more value than top social platforms, and explains how background agent loops use surprisingly high amounts of CPU compute to handle file sorting and system actions.

  • Physical energy constraints and bureaucratic permitting are the core bottlenecks limiting AI scaling velocity.
  • Running background software tasks causes agentic workloads to rely heavily on CPUs, reviving demand for traditional hardware layers.
  • Memory-bandwidth constraints give component providers immense ecosystem margin capture.

Breaks down the physical limits, infrastructure margins, and memory-CPU hardware bottlenecks of AI scaling.

43:47-52:04

Inference Clouds, Routing Realities, and Hardware AWS Models

optional

Srinivas analyzes the financial viability of alternative GPU cloud platforms like Nebius and CoreWeave. He clarifies that model routing tools like Open Router are fundamentally solving infrastructure reliability and rate-limit fallbacks rather than algorithmic optimization. He advises specialized clouds to build software services on top of raw rack space to avoid commoditization and capture software-level margins.

  • Pure infrastructure players must develop proprietary software orchestration layers to protect their margins from hardware commodity cycles.
  • API routing networks monetize by bulk-purchasing tokens to provide rate-limit fallbacks and reliable endpoints for enterprise workflows.

Explores the business models of cloud providers and API aggregators without breaking new scientific ground.

52:04-55:52

How Export Controls Accidentally Accelerated Chinese AI

watch

Aravind discusses how US hardware export controls forced Chinese developers to innovate around hardware limits. Lacking access to Nvidia chips and HBM, Chinese labs like DeepSeek vertically integrated their codebases directly into alternative domestic silicon, using SSD storage for KV-caches and optimizing attention layers. He warns that this forced self-reliance has created a highly optimized competitor stack backed by swift physical infrastructure permitting.

  • Export controls forced Chinese AI developers to innovate on software architecture rather than relying on raw hardware scaling.
  • DeepSeek optimized attention layers and KV-caches to run on SSD storage rather than high-cost High-Bandwidth Memory (HBM).
  • China's rapid permitting and state-aligned energy grids bypass the physical deployment delays crippling US data centers.

Provides a brilliant geopolitical and structural analysis of the unexpected consequences of Western technology embargoes.

55:52-1:35:05

The Billion Dollar Build, Labor Realities, and Leadership Lessons

watch

The final segment reviews the economic and organizational shifts of the AI age. Aravind details Perplexity's 'Billion Dollar Build' initiative to kickstart ultra-efficient, low-headcount startups with massive compute credits. He argues that rather than causing mass job losses, AI will enable tiny teams to build multi-billion dollar companies. He concludes by studying the focus of Elon Musk and the urgent survival mindset of Jensen Huang.

  • Incentivizing the developer ecosystem with compute credits seeds a decentralized network of high-leverage startups.
  • Relentless deployment speed acts as a vital tool to correct flawed corporate assumptions about product-market fit.
  • Elite leadership requires identifying the single limiting bottleneck of a system while ignoring distracting sub-problems.

Synthesizes low-level founder techniques with broad macro-level organizational scaling principles.

Key points

  • Token Value per Watt per User as the Metric of AGI Economics — Value does not reside in generic model parameters but in the orchestration layer that routes complex execution tasks across local client chips and server-side model nodes to optimize energy consumption.
  • Geopolitical Backfire of Hardware Export Controls — Western export bans on Nvidia GPUs and high-bandwidth memory (HBM) forced Chinese developers like DeepSeek to optimize underlying software stacks directly for restricted domestic hardware platforms.
  • Continuous Background Cron-Jobs vs. Reactive User Search — The transition from single conversational search queries to continuous, autonomous multi-agent background loops is driving exponential enterprise token consumption.
  • Power Access and NIMBYism as AI Scaling Limits — Terrestrial data center buildouts are primarily bottlenecked by physical power allocation, regulatory permits, and public resistance based on misinformed energy-grid and water resource anxieties.
the single most important metric in AI is token value per watt per user Aravind Srinivas
moving fast is a way of expressing humility because you you're constantly making contact with the world and trying to question your assumptions Aravind Srinivas

AI-generated from the transcript. May contain errors.

0:00

I have nothing to lose.

0:02

I came from nothing. I never even

0:04

imagined myself to be doing all this.

0:06

>> A $20 billion company,

0:08

>> $45 million users,

0:10

>> over a billion searches a month,

0:11

>> built in 3 years by 400 people.

0:14

>> These numbers like doesn't motivate me.

0:15

It's hard to get motivated by wealth.

0:17

You want to get motivated by impact.

0:19

>> This is perplexity with co-founder and

0:21

CEO Aravven Shrinus.

0:23

>> No one's ever in a comfortable position

0:25

that no one can relax.

0:26

>> They forced Google to redesign their

0:28

homepage. then bid $34 billion to buy

0:31

Chrome.

0:31

>> More than their own valuation.

0:33

>> Perplexity changed Google.com more than

0:35

any product manager at Google has ever

0:37

done. Now you look at AI mode, it looks

0:39

exactly like Perplexity.

0:40

>> He doesn't do defense. He doesn't do

0:42

comfortable. His words, attack, attack,

0:44

attack. That's my motto. Go all in and

0:46

try your best. Be on the offense all the

0:48

time.

0:49

>> You know what I hate with podcasts? When

0:50

people sit on the fence. Aravind has

0:53

really strong opinions in the show

0:55

today. He says that Micron will be more

0:57

valuable than better. He says that the

0:59

resistance to data centers will continue

1:01

and get worse. He says the biggest

1:03

problem today is a lack of power. He

1:05

claims that perplexity has changed

1:07

Google more than any Google PM. You want

1:10

opinions? This is the show for you.

1:13

Ready to go

1:25

Ara. Dude, I am so excited that we get

1:27

to do this. We've done one remote and

1:29

then we did one at Founders Forum last

1:31

year. So, thank you so much for joining

1:32

me in person.

1:33

>> Thanks a lot, Harry.

1:34

>> Dude, I It's a weird start, but just

1:36

roll with me on it. I asked this of the

1:39

best founders that I meet. Are you

1:40

motivated more by the fear of failing or

1:43

by the thrill of winning?

1:44

>> Thrill of winning.

1:46

>> Why? Because I have nothing to lose.

1:49

I came from nothing.

1:52

Like I I I never even imagined myself to

1:54

be doing all this. So my life has

1:57

already been extraordinary u beyond any

1:59

level of imagination. Um I was just in

2:02

India like you know doing my undergrad

2:05

and you know just just training neural

2:07

nets with graphics cards that people in

2:09

the labs were using for playing video

2:11

games. It was all for fun and um you

2:14

know my path led me all the way here. It

2:17

was never like a mo my for my mom just

2:19

getting a job was success because we

2:21

were not we were financially lower

2:24

middle class in India which is not even

2:28

like lower middle class in UK or the US

2:31

and so from there all we wanted to do

2:33

was get a job in Google being an

2:35

engineer at Google was considered a win

2:38

and so I'm I'm already doing remarkably

2:40

well compared to that ambition we had as

2:43

a family so there's really nothing for

2:47

me to lose. That's why anytime I try to

2:50

act like I'm trying to avoid failure,

2:52

I'm being on the defense. I remind

2:54

myself that like that's the stupidest

2:56

thing to do. Like you know, you you it's

2:59

better go all in and try your best. Be

3:02

on the offense all the time. Attack,

3:03

attack, attack.

3:04

>> When you review then what are you not

3:07

being aggressive enough on today? Maybe

3:10

in the early days we be very very loud

3:12

on social media talking about perplexity

3:15

versus Google and I used to do that

3:16

myself a lot. So and some people don't

3:19

like me for having done that. Today I'm

3:22

a lot more measured in how I talk about

3:24

our products or competitors and stuff

3:25

like that. But it's not a lack of

3:27

aggression or anything. Um it's just

3:30

that like it's that that is boring.

3:31

People already heard that enough from

3:33

me.

3:34

>> Do you regret the being so bold in your

3:36

messaging?

3:36

>> No. So it's not a nuance and maturation

3:41

of message. It's a that stale and I need

3:44

something new.

3:45

>> Not just that, I I kind of don't think

3:47

it's a relevant framing anymore. We

3:50

worked on search. Perplexity started out

3:52

as search. We built the first answer

3:54

engine in the world that people know

3:56

perplexity even today if you mention the

3:58

name perplexity people would think oh

3:59

that's an answer engine. We built a lot

4:02

more things after that. We built a lot

4:03

of agents, browser agents, deep

4:05

research, computer. We built so many

4:08

products after that but we're still

4:09

known for that first product and um the

4:13

mark has already been made the we

4:16

changed the road map of Google. You

4:18

could argue that I or the company

4:21

Perplexity changed Google.com more than

4:24

any product manager at Google has ever

4:26

done.

4:26

>> Make that argument for me.

4:28

>> Well, they never that nobody ever wanted

4:30

to ship an answer engine at Google.

4:32

Nobody like nobody wanted to tinker

4:36

anything on on the interface that made

4:38

them $250 billion a year. And uh and

4:42

then now you look at AI mode,

4:45

it looks exactly like perplexity.

4:48

There's there's not even any difference

4:49

like the font, the citations, the

4:53

specific bolding of inline text, inline

4:55

hyperlinks, um suggested follow-ups, the

4:59

whole experience is literally looking

5:01

like perplexity except it's still not as

5:04

good. And so

5:05

>> is that bad or good for you that they

5:07

learn from you and adapt?

5:09

>> It's it's it's it's both good and bad in

5:11

the sense, you know, you have to

5:13

obviously we I knew this like around end

5:15

of 2024 this is going to happen. So, it

5:18

never caught me by surprise at all. Um,

5:21

it was just a matter of time. I still am

5:23

I'm am surprised that the quality is

5:26

still not there cuz I I regularly test

5:29

every product out there. And u but I'm

5:32

I'm happy that honestly uh they they

5:35

they changed Google to be what it should

5:38

be. And um I believe that the frontier

5:42

is where the money is. The frontier in

5:45

AI is not about answering questions

5:47

anymore. It's about actually going and

5:49

doing work for you. We, you know, like

5:51

we still have the state-of-the-art deep

5:53

research the world will. And that's

5:56

actually where people subscribe to pay

5:58

for our pro or max products is not for

6:00

getting answers in in the in the

6:03

traditional way. They're asking for

6:05

sophisticated research reports. They're

6:07

asking for agents that go and do things

6:09

for you. And so we wouldn't have been

6:12

able to do all that if we were sitting

6:14

in 2024 thinking we have everything

6:17

settled here. We're we're good and

6:18

comfortable. No, we it the answer engine

6:21

was always a lead genen for the frontier

6:25

products we build. You need something,

6:27

right? Like think about it. Every

6:28

company needs to have one successful

6:31

product to build the next set of

6:33

products. And in AI, nobody can sit

6:37

comfortably thinking they have it all

6:39

sorted out, including Anthropic. If

6:42

Anthropic thinks cloud code is already a

6:44

win, in 6 or 12 months from now, they

6:47

won't even be around. And so that's the

6:49

uncomfortable it it's it's it's it's an

6:51

uncomfortable fact about the whole

6:53

field. Would you argue today, you just

6:56

told me, if you don't mind me quoting

6:58

you here, um, you just told me before we

7:00

started, that you think OpenAI isn't

7:02

ready for an IPO.

7:05

Um would you have believed you would be

7:07

in a position to say this two years ago

7:10

when nobody h wanted to deal with any

7:13

product other than chat GPT.

7:15

Think about it. So anyone even in such a

7:19

massive advantages position can be in a

7:23

can be put in a position where they're

7:24

no longer the kings. They're fighting

7:26

from behind. Right? So that's the state

7:29

of the field. That's just it's less

7:31

about perplexity or anthropic or open AI

7:34

not having modes or having modes.

7:35

>> Can I push back on you that I would I I

7:38

would stand by two years ago even when

7:40

they were a do and they are still a

7:42

dominant consumer product but I would

7:44

stand by it because I don't think they

7:45

are financially ready when you look at

7:47

the balance sheet of that that

7:49

>> maybe maybe I'll decouple that. I'll

7:51

decouple that.

7:52

>> Let's decouple that being like financial

7:54

readiness for an IPO

7:56

>> versus perception of a dominant leader.

7:59

Yeah.

8:00

>> Do you perceive them as a dominant

8:02

leader right now?

8:03

>> Yes.

8:04

>> In what?

8:05

>> Consumer search.

8:06

>> Well, except there's no money there,

8:08

right? Because it's been commoditized.

8:11

So, it's it's always a legion. Like, for

8:13

example, why why why are they going all

8:15

in on Codeex? Cuz that's where the money

8:16

is. And uh we're doing the same on

8:19

computer. Anthropic is doing the same on

8:21

cloud code. Google doesn't yet have a

8:23

product in this category, but I'm sure

8:25

they're going to come after that. Meta

8:27

is trying to launch Hatch for $200 a

8:29

month. You You see that? You see what's

8:31

happening, right? So, nobody

8:33

>> But there has to be more money than just

8:36

code codeex claw.

8:38

>> It's not about It's not about code.

8:40

That's the main thing. The the money at

8:42

least in non-advertising.

8:45

I'm not talking about advertising

8:46

revenue. In non-advertising subscription

8:49

or usage based revenue, the money is in

8:52

whatever is the frontier. And today the

8:55

frontier is about doing going out there

8:58

and doing things for you. And uh

9:00

>> do you not think then that there will be

9:02

a 100 to20 billion advertising business

9:06

for open AI? It's

9:08

>> yet to be proven. Let's let's work

9:09

through the categories of advertising.

9:12

Um who's the number one advertiser on

9:14

Google? Amazon. It's a number two.

9:18

Booking.com.

9:20

Number three or four I think Expedia.

9:22

So, um, how much do you think

9:24

Booking.com spends on Google? 16

9:26

billion, something like that. Something

9:28

some some some crazy amount like that.

9:31

Um, and, uh,

9:33

um, how do you book your hotels or

9:36

flights today? Do you book it on chat

9:38

GBT or do you book it on Google?

9:40

>> Google.

9:41

>> Why is that?

9:42

>> For me, I actually like discovery. I

9:44

would like to see the options.

9:45

>> Exactly. Right. So the interface the

9:48

interface is less about conversations

9:51

and more about exploration.

9:53

So when when the decision making is more

9:55

subjective and vibes based, you don't

9:59

need an objective

10:01

answer engine. And and so it's it's it's

10:04

and and and you think about the other

10:06

category of advertising, direct to

10:07

consumer products, fashion. Where is

10:10

most of that advertising budget going

10:12

into? It's going to meta, Instagram,

10:15

because you're just browsing. You're

10:18

just like doom scrolling or whatever

10:20

they call it, right? And so, uh, the

10:22

chat interface doesn't capture that user

10:26

intent, that user behavior right now,

10:29

which is why it was never a great fit

10:32

for advertising. And u, it also

10:35

fundamentally corrupts the trust that

10:37

people have when they go into a product

10:39

and they want the accurate answer, which

10:42

is what, you know, perplexity is known

10:43

for. Um, and then you're like, "Hey, by

10:46

the way, you know, you ask for the most

10:49

um highest like like mo best protein

10:51

shake, but by the way, these are good

10:54

protein shakes you can check out." Like

10:56

it it it kind of like hurts the trust

10:59

that people have in your platform and

11:00

your product. And so, um, that's another

11:04

reason why, if you think about it, like

11:06

like what Meta or like I think some

11:08

other companies in the past have tried

11:10

to put ads inside, um, messaging apps

11:14

and emails and it's never really worked

11:16

out. Um, it it works out in China

11:20

in WeChat because there's no other way

11:24

for them to fund the whole thing, you

11:27

know. So the the the whole economy and

11:30

and and user sentime user behavior has

11:33

been optimized around gamifying. It's

11:36

not how things work in America. So I I'm

11:39

I'm I'm bearish on advertising to really

11:43

take off in in the chat interface. I I'm

11:45

happy to be proven wrong there, but I'm

11:47

bearish on that.

11:48

>> There there are two areas that I want to

11:49

unpack there. The first and just taking

11:51

them kind of chronologically and how you

11:52

said them, money's in the frontier. The

11:55

more I hear this kind of the more I

11:57

question it because I think that we

11:59

dramatically overestimate

12:02

how important frontier models are to do

12:04

quite basic work.

12:05

>> Yeah. So frontier doesn't mean a

12:08

frontier model. Frontier just means

12:11

whatever is the frontier outcome you can

12:13

have right now with AI.

12:15

The Greg Brockman recently tweeted the

12:18

model is no longer the product, right?

12:20

Um, and it's funny because you you know

12:24

that as as a leader of a frontier lab,

12:27

he has all incentive to say the model is

12:29

the product and and that's what Google

12:31

people tell. I think one of the Google

12:33

people keeps tweeting that model is the

12:35

product. I forgot who. Um,

12:38

and so the reason Greg is right is

12:41

because um, if you take codeex or

12:45

perplexity computer clot code, what is

12:47

that? It's it's an orchestration system,

12:50

right? It it takes a model, pairs it

12:53

with an agent harness.

12:56

And what is an agent harness? Think of

12:58

it the simplest way of describing it is

12:59

like rules for how the agent loop should

13:02

run. What are all the skills and sub

13:05

agents and connectors and tools it

13:07

accesses? And uh without the harness you

13:11

don't necessarily capture and convert

13:15

the intrinsic intelligence in the model

13:18

into valuable output tokens.

13:21

The output tokens if you're if you're

13:23

literally just a reseller of model

13:26

tokens you have no business because the

13:28

model will get commoditized. So even if

13:30

you're a model builder, you don't have a

13:31

business. As an infra layer, you have

13:34

some business on serving those output

13:35

tokens. But as an application layer or a

13:38

model builder, you don't really have a

13:39

business. If you're just a reseller of

13:40

tokens that come directly out of the

13:42

model, you have business if you know how

13:46

to take the model ground it in valuable

13:50

context, orchestrated with a really good

13:53

agent harness

13:55

um connected to the right set of tools

13:56

and connectors whether it's personal

13:58

connectors or business connectors and

14:01

provide the experience to people in one

14:04

single unified system.

14:07

And uh the way we differentiate

14:09

ourselves at perplexity is we don't just

14:12

orchestrate across tools and files and

14:13

connectors. We also orchestrate across

14:17

models.

14:18

That is the differentiation that

14:20

anthropic and open AAI cannot claim

14:22

because you wouldn't find GPT5I

14:27

inside the cloud code harness. You

14:29

wouldn't find claw opus 47 or 8 inside

14:32

the codeex harness. These are competing

14:35

with each other, right? Whereas you

14:38

would find both these models inside

14:40

perplexity computer and that way we can

14:45

bring down the we can increase the token

14:47

value per watt per user. If you assume

14:52

that

14:53

if you assume that whatever decides the

14:55

dollar like the price the dollars is the

14:58

power watts fundamentally that's that's

15:00

the thing that nobody else can subsidize

15:03

other than the government. Um you know

15:06

that whoever pro you know provides the

15:09

most valuable output tokens with the

15:11

least amount of power expended to

15:14

produce them generates the greatest

15:17

value to the end user and has the most

15:20

pricing power has the most value and so

15:23

that that that is the orchestration

15:24

problem to solve who the one single the

15:27

most important metric in AI is token

15:30

value per what per user. What does it

15:33

mean for the value of open AI and

15:35

anthropic? If model is not the product

15:38

and it becomes a utility, something you

15:40

can switch into and switch out

15:41

>> interface.

15:43

Everyone thinks we're all building the

15:45

model layer or the race. We're not

15:47

actually. Um, I would even argue that

15:50

building models is a way to stay at the

15:53

frontier, but you have to own an

15:56

interface

15:57

in which valuable AI output tokens are

16:02

generated, the most valuable tokens. It

16:05

doesn't have to be the product. This is

16:07

the single most important thing to like

16:09

you know unlearn for most founders and I

16:12

had to do it too which is to be

16:14

successful in AI product layer whether

16:18

you're a model builder or not it's not

16:21

about building something that gets a

16:23

billion users that mentality has to

16:26

completely shift

16:28

there are a few power users who are

16:30

propelling this token economy right now

16:34

if you look at like all these um crazy

16:36

stories of how there's this one engineer

16:38

who got Amazon to spend like half a

16:40

billion dollars in a month because of

16:42

some stupid way they set up like agent

16:44

loop inside cloud code. Okay, maybe

16:46

that's a mistake, but there are real

16:47

engineers in meta in in other companies

16:50

spending like 10 million a year per

16:52

engineer on on on these, you know,

16:55

coding tools. There are users in

16:58

Perplexity Computer. Um, there's one

17:00

user, I think, who spends upwards of

17:02

like $10,000 a month, something like

17:05

that. Crazy. And and and not like

17:08

wasting it. They're not wasting money.

17:11

Their business runs using agent loops

17:15

that are running inside these harnesses.

17:17

And they use these products in

17:19

sophisticated ways that I I I couldn't

17:22

even conceive when we were building the

17:24

product ourselves. Even internally

17:26

inside our own company, there are some

17:29

people who have set up these kind of

17:30

like multi- aent hierarchy and agent

17:33

loops that looks like its own software

17:35

architecture.

17:37

And I often just ask these guys to come

17:39

explain to the rest of the company, hey,

17:41

like what are you doing with these

17:42

tools? Like you clearly are consuming it

17:44

way over, you know, what we thought the

17:47

average person in the company would do.

17:49

And the single biggest differentiation

17:51

between those who use agents a lot and

17:53

those who don't is whether they run

17:56

repetitive cron jobs

17:59

like whether you use AIS as one-off

18:02

tasks. You just delegate a task and then

18:04

it gets done. That's like kind of using

18:07

it for like deep research or like

18:08

whatever, right? Like one single task

18:10

versus the AI is like continuously

18:11

monitoring something for you. The AI is

18:14

continuously like triggering based on

18:16

certain events and going and doing

18:18

certain things, giving you alerts like

18:22

you set up workflows that keep running

18:23

for all the time. Every time you get an

18:26

inbound email like it triages or every

18:28

time there's a latency spike, it has to

18:31

identify which part of the codebase

18:34

caused that, it has to go and do the

18:36

root cause analysis and then identify

18:39

the right engineer. all these things the

18:42

this is where the frontier is and and so

18:44

uh going back to my main point.

18:47

These products are not going to be used

18:49

by uh you know 100 million people but

18:53

they will generate revenue that's going

18:56

to be higher than the advertising

18:57

revenue of Google or Meta. It's going to

19:00

happen.

19:00

>> Completely understand what you say

19:02

there. I do just want to focus in on a

19:04

specific element there when you were

19:06

saying like the power users because I

19:07

think one of the core numbers is

19:09

actually Mark Benov said they spend 300

19:11

million on anthropic which works out to

19:12

be about

19:13

>> it'll be interesting to know from him if

19:16

that 300 million came from you know what

19:18

is the distribution across employees

19:21

>> so it works out to be that was on

19:22

developers within Salesforce so it's

19:24

about 3.8% 8% of developer salaries.

19:26

What percent of developer salaries do

19:28

you think will be spent on tokens in 24

19:32

months time? Cuz that fundamentally

19:35

changes the value of open air and

19:36

anthropic. If it stays at 3.8%.

19:39

They will not be $5 trillion companies.

19:41

But if it's 100% like Brandon at Mccor

19:44

said it will be in a year, they they'll

19:46

be 10 trillion companies. Well, um I

19:49

think they can certainly beat $10

19:50

million companies whether it's going to

19:52

be um a full percent of the developer

19:55

payroll today or not because there's a

19:57

lot of non-developer

20:00

work that'll also be done with a agents.

20:04

Um and that's actually what we focus on

20:06

for Perplexity Computer. We're not going

20:08

after the developer market. We're going

20:10

after anything that developers don't

20:13

non-developers do. basically um your

20:16

your finance department or your corp dev

20:18

or your like um sales reps or your data

20:21

science teams um your your research

20:24

analysts I think that's actually even

20:27

bigger market that it's it's it's not

20:30

even like like think of it as like clot

20:33

code multiplied by 10 that that's the

20:35

size of that market

20:36

>> if I push you on developer salary spend

20:39

what percent of token spend as a portion

20:42

of salary do you think We'll see in 24

20:44

months.

20:45

>> It's hard to say.

20:47

I think the costs are going to go down.

20:49

That's why it's hard to say.

20:51

>> You think the cost will go down? Cuz

20:52

this is the kind of the challenge that

20:53

we've had. We thought when we went from

20:54

chat to agent that costs would go down

20:56

and token costs would go down. They've

20:58

gone up.

20:59

>> Yeah. For now.

21:00

>> Help me understand that and how that

21:02

changes.

21:03

>> I think in software um you kind you kind

21:06

of want to pay for the frontier. Um it's

21:10

kind of like if you know some engineer

21:12

is awesome. If you know you have like

21:14

the next Jeff Dean,

21:16

would you rather hire that person and

21:18

not hire five five people who are medium

21:21

engineers but not Jeff Dean level with

21:24

the same amount of budget you have? Yes.

21:26

Right. Let's say you had a million

21:28

dollars. You could hire five people

21:31

worth 200K or you could hire one Jeff

21:33

and pay them a million. What would you

21:35

do?

21:36

>> One Jeff.

21:37

>> Yeah. So I think you would pay for the

21:39

frontier. Um but what stays frontier

21:43

keeps changing. Um in in 12 months from

21:46

now let's say thought experiment there

21:49

is an open source model as good as Opus

21:51

48.

21:51

>> Mhm.

21:52

>> And um you still have to pay for

21:54

inference. You know nothing is truly

21:57

free but it's going to be like let's say

22:00

10 times cheaper than Opus 48. And when

22:02

you pair it with the right agent

22:04

harness,

22:06

you know, and all the connectors,

22:07

GitHub, everything, all your developer

22:10

workflows work fine, why would you um

22:13

assume that the token spend is going to

22:15

be still high? It's not going to be for

22:18

the same things you're doing today. It's

22:19

not going to be. But there might be a

22:21

different set of things you might do

22:24

with the frontier that you're not

22:26

conceiving today. Uh my prediction would

22:28

be soft a agents that are like

22:32

completely autonomous software

22:33

engineers. Today I think we we're all

22:36

using tools like cloud code or codeex to

22:40

write code but not as literal software

22:43

engineers. There is a large sway of

22:44

people that is now bearish on your

22:46

frontier models who open eyes and your

22:48

anthropics because they're realizing

22:49

that you can actually do a lot with open

22:51

models for a fraction of the price. What

22:53

you're saying is actually that is true

22:57

but

22:58

>> we will still pay for the frontier and

22:59

so they will still ac

23:02

and and I think this distinction

23:06

it feels like a contradiction. It's not

23:08

though. It feels like two things cannot

23:12

be true simultaneously.

23:14

But that that's not quite the case. In

23:16

fact, I would argue that the frontier is

23:21

increasingly going to be a thing that um

23:24

very few individuals might even want.

23:26

Like you could argue that after a point

23:28

like it's not even interesting that AI

23:30

can write software. You you we've

23:31

normalized it, right? Let's say let's

23:33

say that that's going to be the case.

23:35

Instead of companies being built with

23:36

like tens of thousands of software

23:38

engineers unlike the past, there'll be a

23:40

lot more companies with smaller software

23:43

teams and each of us will be using a lot

23:45

of AIS. So um that's actually good for

23:48

the world. We'll be seeing a lot of

23:50

different businesses. We'll be seeing

23:52

allocation of software labor in places

23:56

that was never even possible. and and uh

24:00

whatever is a frontier is going to be

24:02

things that kind of like AI is going and

24:05

designing chips, AI is designing drugs,

24:08

AI is figuring out how to build robots,

24:11

AI is figuring out how to cure cancer.

24:14

These are applications where you don't

24:17

have like 10 million users. It's like a

24:20

few companies, but the effect of that

24:24

work will touch a lot of human lives. I

24:27

think to me that that's where the

24:28

frontier is headed. Um you could also

24:31

see that from the moves that Frontier

24:33

Labs are making. Anthropic bought um a

24:38

wet lab could be for the talent could be

24:40

for the infrastructure to run like wet

24:42

lab experiments but imagine taking all

24:45

those tokens and putting it in the mid

24:47

training instead of just tokens from

24:49

GitHub right um so then that's going to

24:52

produce something interesting.

24:54

>> Don't laugh. Is there an asimtope to

24:56

frontier problems to be solved? I know

24:58

that sounds ridiculous, but if you are

25:00

continuously on the chase for the next

25:01

frontier problem, you get to cancer, you

25:04

get to climate change. And my word, I

25:06

hope they solve both in like heaven,

25:08

that's a huge amount to solve.

25:10

But if you're on the treadmill of

25:12

continuously, is there an asmtope to

25:14

that? Do you see?

25:15

>> There's no there's no mathematical

25:17

argument

25:18

to there being a cap on the amount of

25:23

economic value

25:25

one can create with with with um AGI or

25:30

ASI like systems. Um and Elon Elon has a

25:34

good argument for this like like where

25:36

he says money loses all meaning in a

25:39

post AGI economy

25:42

because you'll be producing

25:45

an abundance of energy and labor

25:48

and fundamentally the economy is

25:49

grounded to energy and labor. If you can

25:51

produce an abundance of them,

25:54

well what what meaning does money have?

25:57

Um and um and so I I don't think we run

26:01

out of things to solve at the frontier.

26:03

I think we're always going to be creat

26:05

like like why why would why did people

26:06

even want to understand the universe?

26:08

Like like why did we want to understand

26:10

subatomic particles, quantum physics,

26:13

black hole theory, um you know the

26:16

origins of the universe like what what

26:18

what is the purpose? But we still went

26:21

ahead and did it because that's kind of

26:23

what the purpose of humanity has always

26:26

been to understand the unknown. You

26:29

know, David Deutsch is famous for saying

26:30

this, right? Like we are the only

26:32

species capable of being curious about

26:35

what is already familiar. Like you can

26:37

stare at a fruit and you know that it's

26:39

a mango and like you know exactly like

26:42

how it tastes, you know how it looks,

26:43

you know the shape, you know what

26:45

seasons it grows and and and stuff, but

26:47

you can still look at it and ask one

26:49

more question about it that you haven't

26:52

asked before. Other animal species

26:54

cannot. Once they kind of once they have

26:57

it in their mental model, what it looks

26:58

like and touches and feels like, they're

27:00

going to ignore it. It's not it's no

27:01

longer interesting to them. Kashi, you

27:03

mentioned about agent usage and you said

27:06

if you do repetitive tasks versus one

27:08

off say chron jobs, you know, I think

27:10

Sam said it's we're going to have 24/7

27:12

AI and um yeah, they've talked about a

27:15

hardware product that's going to come

27:16

out.

27:17

>> Do you think we will have continuous

27:18

agents running?

27:20

>> Yeah. In so

27:21

>> and I think that's kind of why I believe

27:25

>> the orchestration problem I I talked

27:27

about maximizing the token value.

27:30

>> Can you just help me? Sorry. when you

27:31

say the orchestration problem.

27:32

>> Yeah. So, so okay. So there are like

27:34

four objectives

27:36

um accuracy, intelligence and accuracy

27:39

and then privacy and cost.

27:42

You know these are all competing with

27:44

each other. So you can you could argue

27:45

that um you could max out on

27:48

intelligence and accuracy by building

27:50

giant giant data centers and spending a

27:52

lot of power to uh you know run them and

27:56

u you could miss out on privacy and

27:59

costs cuz everything will be centralized

28:02

and and and you're going to be paying a

28:03

lot. Um, you could argue that everything

28:06

can run locally and so that'll be good

28:09

for privacy and cost but may not be

28:12

frontier intelligence, may not be

28:14

frontier accuracy. So the solution is to

28:18

figure out a sweet spot. You know, use

28:21

local models when necessary, use server

28:23

side models when necessary and

28:25

orchestrate across local models and

28:27

serverside models. Uh, grounded in

28:30

valuable personal context. Sometimes the

28:32

intelligence might already be there but

28:34

the system might not work that because

28:36

the harness isn't grounded in the right

28:38

set of tools right so build a worldass

28:42

harness that can even make an okayish

28:44

model appear great and be able to use

28:47

the right model for the right task and

28:49

the right part of the task sub aents and

28:52

and even like utilize the compute we all

28:55

have in our own devices all that that

28:57

you know doesn't need to be always on a

28:59

server that is an orchestration from a

29:01

router, an awesome router, a master

29:03

orchestrator router. Now, um, if you do

29:08

that, you can realize the vision of a

29:10

24/7 AI without people freaking out

29:14

about going bankrupt

29:16

because no one's going to be able to

29:18

afford a 24/7 AI, Frontier AI, running

29:21

on the server. Imagine you turned it on

29:24

and you you could never switch it off

29:26

unless something crazy happened. Um,

29:30

you're the the the the the thing that

29:31

most people worry about those AI is

29:33

like, "Oh, what if it does something

29:35

crazy?" But the real concern actually is

29:37

the cost. Um nobody's going to be able

29:39

to afford it a cron job at the fidelity

29:43

of few seconds, you know, um that that

29:46

runs all the time. And so, um the

29:49

bottleneck there is actually

29:51

orchestration and local compute. And so

29:54

I believe like like um one needs to

29:57

build a continuously learning

30:00

local model um that can save you on like

30:05

compaction

30:07

context windows. So you and and and try

30:10

to preserve as much compute locally and

30:13

rely on the server side frontier only

30:15

when necessary and keeps learning, keeps

30:17

adapting, keeps evolving. And that model

30:20

um is not just a model. It's a model

30:22

plus the harness plus the local chip and

30:25

the compute and the ecosystem of devices

30:27

it controls. That system um is going to

30:30

be your own intelligence. Essentially

30:32

the data center moved to your local

30:34

device and you you get to control it.

30:37

You get to own it. You don't get to

30:38

worry about somebody like you know

30:40

spying on you or looking at all your

30:42

tokens. Very valuable personal tokens.

30:45

Imagine you have like very sensitive

30:46

deal materials. Let's say you're doing a

30:48

deal um and then um a Frontier Lab has

30:52

all your tokens that you use to like

30:54

write a memo.

30:56

Imagine somebody could hack into that

30:57

server and steal your deal from you. You

31:00

wouldn't want that, right?

31:02

>> I'm going to be honest. I think there's

31:03

much more valuable things for people to

31:05

steal from

31:07

London based VC.

31:09

>> But yes, I can figure you're not just

31:10

yet another London VC. You have like a

31:12

$400 million fund last time read it. So

31:15

>> imagine like you know you're

31:17

>> already making your moves for the $4

31:18

billion fund right so so everyone has

31:22

certain levels of like you know

31:23

sensitive stuff and and so I think

31:25

that's where I believe that the 247

31:28

always on agent is going to be realized

31:32

by the company that wants to play the

31:34

role of the orchestrator not the model

31:37

builder not the frontier model builder

31:39

but the orchestrator and uh and I think

31:42

that's what that's what we want to do um

31:44

computers has been positioned explicitly

31:46

as the agent orchestrator. The the

31:49

musicians in the orchestra are these sub

31:52

agents that utilize these different

31:55

models. Think of them as the instruments

31:59

and uh the tools, the connectors, the

32:00

models. These are all the instruments

32:02

and the musicians are the sub aents and

32:05

the symphony is the work and the system

32:08

is the orchestra and and computer is the

32:11

orchestra conductor. that that that's

32:12

how it's been positioned. So what it

32:15

orchestrates

32:17

keeps evolving, right? It it changes. It

32:20

changes from,

32:22

you know, models to files to tools to

32:24

chips to devices. But but it doesn't

32:26

even matter like you don't care as long

32:28

as it orchestrates things correctly and

32:30

and and and maximizes the token value

32:33

for what per user. If you can solve this

32:36

problem, you will capture the most

32:39

economic value in AI long term.

32:41

Shortterm it might look like oh like

32:43

this other lab's revenue is growing you

32:45

know exponentially this that but long

32:47

term this is the one objective that

32:49

truly matters.

32:50

>> Who is best positioned to do that?

32:52

>> I believe it's us cuz you have the

32:54

incentive of not token maxing you have

32:56

the incentive of delivering the most

32:59

value to the user like we we every time

33:02

any part of the AI stack improves

33:06

our product improves. Um since the

33:09

beginning of the year, Anthropics models

33:10

have made tremendous progress. But

33:13

what's also true is that our revenue has

33:16

more than tripled since the beginning of

33:17

the year. Tripled since this beginning

33:19

of the year and uh we and a lot of

33:23

thanks to model progress made by

33:25

anthropic and we also brought our

33:27

burndown thanks to OpenAI competing with

33:30

them and bringing down the cost of the

33:33

same capability. And now with progress

33:36

in open source and local models and

33:37

local chips, we're going to move some of

33:39

the inference back to the local devices

33:41

and bring down the cost even more. So

33:45

every time any part of the AI stack,

33:47

whether it's chips, models, harnesses,

33:50

any of these gets better, our system

33:53

improves tremendously. And if our system

33:55

improves tremendously, our users love it

33:56

and they pay more. They spend more and

33:59

so our business grows. So I think to

34:02

your question of who's best positioned

34:04

to win in that world for that objective

34:07

of being an orchestrator is the one

34:10

whose product or business

34:13

benefits from other people's progress at

34:16

any layer of the stack. And so if Jensen

34:20

produces a better chip, it's great for

34:21

us. If Dario produces a better model,

34:23

it's great for us. If Apple produces a

34:26

better device, it's great for us. And

34:28

like I I love the fact that we are able

34:30

to be a very positive player at every

34:34

layer of the stack and not have to rely

34:38

on any one person to win. When we look

34:40

at the different providers that we said

34:44

kind of server side versus you on device

34:46

when we look at server side a lot of

34:49

people talk about an AI infrastructure

34:50

bubble which I think is funny stupid and

34:52

moronic. To what extent do we have a

34:54

data center supply problem today from

34:56

what you see? I think the biggest

34:58

problem is actually in power. So what

35:01

let's break down what is a data center.

35:03

Is it like that you just buy like a

35:05

bunch of chips from Dell or Super Micro

35:08

and No, that that's just one part of it.

35:11

You actually have to go secure land or

35:15

you have to lease something, lease a

35:16

property and uh you have to buy a bunch

35:19

of turbines to generate power or you

35:23

have to work with like power suppliers,

35:26

grid suppliers and uh you also have to

35:29

work on cooling. So there's a lot of

35:32

other work you got to put in that is far

35:34

far slower. you have to get permits to

35:36

do all these things and uh and so

35:39

usually what's happening is um there's a

35:42

lot of lead time to doing this and um

35:46

the models that are um already in use

35:49

today these have been trained in the

35:50

hopper generation so the blackwell

35:53

generation model I think the first model

35:55

that's blackwell generation category is

35:58

um mitos

36:01

and it's already scary like people are

36:02

already like freaking out about it So

36:05

imagine that um everyone pre-trains a

36:08

model um on like a million or like you

36:11

know hundreds of thousands of black

36:12

wells now those models are going to be

36:14

far more powerful than what exists today

36:16

and then the ver rubins are coming next

36:19

year in full capacity like like all the

36:21

data centers of wear rubons will be in

36:23

next uh you know used next year that

36:26

model will be even more powerful. So I

36:28

think we there is a certain physical

36:31

buildout time

36:33

that always bottlenecks frontier

36:37

capabilities. That's why there's a value

36:39

in that layer. Whoever knows how to do

36:42

this puts it puts together a bunch of

36:45

GPUs and chips and networking and power

36:47

and cooling and actually like

36:50

orchestrating all this software layer on

36:52

top and you know is able to convert that

36:55

into frontier output tokens. that that

36:57

that vertical integration has a lot of

36:59

value. So that's why the markets are

37:01

pricing infrastructure companies with a

37:04

higher uh PE ratio

37:07

>> than companies like Meta for example.

37:09

Even though Meta builds a lot of infra

37:11

is valued as a software company. When we

37:14

see like you know Meta's capex spend and

37:16

it wanting to increase in the last few

37:18

days and thinking about raising more and

37:19

more money to increase capex spend I get

37:21

it with a lot of the AI providers like

37:23

your open eyes or Anthropase because

37:25

they are making money from their AI

37:27

products. For Meta, the capex spend

37:29

correlates to increasing accuracy on

37:31

ads, which is like a six to eight% bump

37:34

in revenue. I get it. But for the capex

37:36

spend, it doesn't make sense.

37:38

>> Well, um I I I believe like they they

37:41

are understanding

37:43

what the market's saying. You know, I

37:44

don't think they're

37:47

dumb to not see what what what's being

37:50

said. I think they're introducing a lot

37:51

of subscription products um from what

37:54

I'm reading. So they're definitely going

37:55

to like basically the company needs to

37:58

not just be a social platform maximizing

38:00

engagement and turning that into ad

38:02

revenue, right? And I think um that

38:05

requires them to launch a lot of like

38:07

agents subscription based products and

38:10

maybe even a cloud meta cloud that that

38:14

rents out servers like what Elon's doing

38:15

at SpaceX and and maybe once they do

38:18

that

38:20

the the narrative might change, right?

38:23

But um to go back to my point, it might

38:26

not be inconceivable that

38:29

um

38:31

Micron, the supplier of HPMs, might be

38:35

more valuable than Meta in the next 6 to

38:37

12 months. It's already at like a

38:39

trillion and Meta is like 1.3 to 1.4

38:43

trillion.

38:44

>> Can you help me understand that? Because

38:46

memory is already a massive bottleneck.

38:48

It's increased 5x in price in terms of

38:50

the cogs, right? Um, but people are

38:53

going, "Wow, Micron is fully priced at

38:55

this point." Why is it not fully priced?

38:59

>> Because it's still the bottleneck.

39:01

Whatever is the bottleneck

39:04

will command the price.

39:07

Um, AMD is doing really well because

39:10

CPUs became a bottleneck again. Agent

39:13

loops, agent harnesses are all running

39:16

on CPUs. The tokens are produced by the

39:18

frontier models on GPUs. But whatever

39:21

work like let's say like claude

39:24

generates a coding script that decides

39:26

to download 500 files from different

39:28

websites and then you know munches a lot

39:31

of data and transforms it in certain

39:32

ways and generates a plot and then hosts

39:34

it on a website that you can you can

39:36

share with your people. All that

39:38

computers is running on CPUs.

39:41

Agents are using CPUs more than humans,

39:44

right? And so suddenly there's a rise in

39:47

enterprise CPUs and the beneficiaries of

39:50

these are like Intel and AMD. So then

39:54

they get to be the bottleneck like like

39:55

whoever's going to be the bottleneck

39:57

will win and and so infra is the

39:59

bottleneck right now because there's a

40:01

lot of demand and we just don't have the

40:03

supply and so whoever supplies memory

40:06

SSDs for storage CPU compute suddenly

40:10

these are all like interesting like um

40:13

they're more important than companies

40:15

that are just building data centers and

40:18

not knowing how to turn that into a

40:19

valuable outputs. Do you believe your

40:21

Nebius and your Core Weaves will be a

40:23

sustainable

40:26

multiund billion company in the future

40:28

or is it solving a short-term supply

40:30

problem?

40:31

>> Um I certainly think they can be

40:33

sustainable.

40:34

>> Yeah.

40:34

>> Um I think there are some I don't like

40:38

look I don't know particularly which of

40:40

those is going to win and there's also

40:42

other players like Cruso and um Firebird

40:46

and a bunch of companies. It's all about

40:49

being resourceful. You got to take power

40:51

from areas where there's a lot of

40:53

natural resources

40:55

and the cost to bring up the data center

40:57

is pretty cheap and the time to bring up

40:59

the data center is cheap and your

41:01

service is reliable. Like if somebody

41:03

commits to buying 100,000 GPUs from you,

41:06

um the service should be pretty good. Um

41:08

and uh you should be able to secure the

41:12

supply ahead of time. Plan well. Um and

41:15

I think some companies are even

41:16

innovating at the power layer. You know,

41:19

um generating their own power is one way

41:22

to bring down the margins. Uh and so I

41:26

think there's certainly like value in

41:28

that layer because um it's hard to

41:31

replicate work. That's how I see it. You

41:34

could argue that OpenAI can do all the

41:37

work that Core V was doing and that's

41:40

kind of what they wanted to do with

41:41

Stargate. But why is Corev more

41:44

successful at building data centers than

41:45

OpenAI? It's

41:46

>> hard to do. It's operationally

41:48

intensive.

41:49

>> Yeah, operationally intensive. You got

41:50

to focus. You got to like spend most of

41:52

your time um securing permits like

41:55

figuring out power, figuring out like

41:57

bottlenecks in the supply chain here and

41:59

there um and constantly plan ahead and

42:03

like test all these systems carefully.

42:06

deal with like random physical issues

42:08

that you know arise in like you know

42:11

running a data center there's something

42:13

called TCO you know cost of operations

42:17

>> you got to factor that in so uh that

42:19

said I I I don't think there's value um

42:22

if you're just like a server renter if

42:26

you're just a GPU server rack renter if

42:29

you're just leasing it to different

42:30

companies on certain hourly pricing

42:32

rates there's not a lot of value you

42:35

have to actually build some software on

42:39

top kind of like how AWS did. It's

42:42

called Amazon Web Services, not Amazon

42:44

servers, right? So, um you have to have

42:47

some software orchestration on top that

42:50

allows you to get software margins on

42:52

top of what you're doing. And I think

42:56

that's why you're seeing moves like NBS

42:59

um

43:00

like like going for the AI model

43:02

inference like you know taking open-

43:05

source models or hosting your models and

43:08

and and um that's a business model of

43:11

certain other companies like fireworks

43:12

and you know um ben and all that but you

43:15

could imagine neocloud just going for

43:17

that business.

43:18

>> That was exacting me my question. So, I

43:19

just had the co-founder of Nebius on the

43:21

show and the really clear takeaway was

43:23

the the challenge that he has, which is

43:24

there's a huge amount of money that

43:26

wants just capacity and compute.

43:28

>> Yeah.

43:28

>> With the awareness that he needs to

43:29

build a full stack product if he wants

43:31

to have a long-term sustainable

43:32

business. That was the core realization

43:34

for me. When I look at the inference

43:36

layer, like you said, fireworks or base

43:38

10, how do you think that plays out? Do

43:40

we have standalone hundred billion

43:42

dollar companies in inference alone or

43:44

do we see that commodity?

43:47

I mean you it's just it's all about

43:48

working backwards like what does it take

43:50

to build a hundred billion company

43:52

assume like

43:54

>> 10 billion in revenue

43:55

>> exactly 10 billion in revenue 30 to 40%

43:59

gross margins good amount of net income

44:01

good cash flow okay 10 billion in

44:03

revenue

44:05

um is not that inconceivable for for a

44:08

company that can both do AI hosted

44:11

inference and server capacity and data

44:14

center buildouts

44:16

very operationally well. It's all about

44:18

like you know there there are some

44:20

factors beyond their control like open

44:22

source models continuing to be awesome.

44:24

If open source models stop to actually

44:26

be good where the gap between them and

44:28

the frontier is like more than 12 months

44:30

or like 15 months 18 months then I don't

44:33

think these companies really have a

44:34

business model because um they're not

44:37

going to be able to host they're only

44:39

going to be able to

44:41

rent capacity to open Athropic and so um

44:46

that's exactly what Roman and Nebia said

44:48

he said if consolidation happens and

44:50

there's anthropic and open AAI or two or

44:52

three dominant providers that is the

44:54

biggest threat.

44:54

>> That's correct. Yeah. And so um but you

44:58

got to make a leap of faith assumption

45:00

that you know like the the models from

45:02

China or Nvidia is making good progress

45:04

on their models in Limatron. Um you got

45:07

so there's going to be enough factors in

45:09

the market to keep u consolidation

45:14

as an outcome from from like stopping

45:17

from happening. But you don't control

45:18

your own destiny if you're those

45:20

companies. That that that's basically

45:21

the problem. Totally get that. Okay. So,

45:24

we can have standalone companies that

45:26

are hundred billion dollars in inference

45:27

alone. So, I'm just pillaging you for

45:30

your knowledge. When we look at the

45:31

model selection companies like an open

45:33

router or like factory AI just released

45:36

that kind of model selection or model

45:37

rooting product which did very well on

45:40

launch, is there hundred billion dollar

45:42

companies in the model selection and

45:43

routting business?

45:45

>> Probably not.

45:47

Um I think you can't just be a

45:51

provider of router. You have to use the

45:53

router to produce something meaningful.

45:56

Um actually most of the business value

46:00

of open router is less than the router

46:02

even though the product is called open

46:03

router. It's not routing across models

46:06

there. It's actually just routing across

46:09

different endpoints of the same model.

46:12

So um why okay so maybe let let's let's

46:15

let's ask this question. If you wanted

46:17

to use claude opus or

46:22

um I don't know like GBD55 developer why

46:26

would you not want to just use it with

46:27

your own API key versus using it inside

46:30

open router? Number one argument. The

46:34

single simplest argument as to why you

46:35

would want to do that is

46:38

model fallbacks. Sometimes your API keys

46:40

might not have the rate limits or even

46:43

if you have the rate limits it might

46:45

there might be an error on open AI

46:47

servers that you know don't guarantee

46:49

you the response time you need to run

46:51

your application and uh open router

46:55

would go and earn the uh you know they

46:58

would pay for capacity for like one year

47:00

ahead uh with the funding they have and

47:03

secure the rate limits and multiple

47:05

endpoints across multiple different

47:07

providers of openi models be it Bedrock

47:10

or Azure or OpenAI themselves. And so

47:13

that routing is valuable. It's

47:15

essentially an infra problem they're

47:18

solving which is reliable token supply.

47:21

It's not actually oh like they're

47:24

lowering the cost by deciding if this

47:26

prompt should go to like GPT or clot or

47:28

something like that. That's not what

47:29

they're actually selling to the

47:30

developer. That's not actually the

47:32

business model. And uh and then for a

47:36

lot of these Chinese open source models,

47:38

there's not you probably don't want um

47:41

your API tokens from going to like let's

47:43

say you don't want your API tokens going

47:44

to China. Um and so you and and let's

47:49

say you don't have the bandwidth to work

47:51

with like different inference providers

47:53

or verify who's good and you know who's

47:55

not. You're just thrusting open router

47:57

to take care of all that and then you

47:59

know they're going to like supply the

48:00

tokens to you. So it it's it's routing

48:03

not at the level of like oh like

48:05

deciding which model is cheap for what

48:07

task. It's more like um a reliable token

48:10

supply and I think there's some value in

48:13

that layer definitely uh otherwise they

48:15

wouldn't have these many um users and

48:17

these many trillions of tokens being

48:19

routed a month but it's um it's not like

48:23

you know high gross margins business

48:26

it's the way the business model works

48:27

for them is actually um they would

48:30

secure a discount from the model

48:32

providers by guaranteeing a lot of

48:34

supply but they would still charge the

48:37

user their listing price on the API and

48:40

that difference is their margins. Do you

48:43

understand?

48:43

>> I I I totally get you. We spoke about

48:45

bottlenecks and you said about HPM, high

48:47

performance memory and micron and the

48:48

value that you know they have to say and

48:51

what it can be. What bottleneck will we

48:54

have in 3 years that we're not

48:55

discussing today?

48:57

>> I think power will remain the

48:59

bottleneck. Yeah,

49:00

>> I think it's it feels like that to me.

49:03

Unless something dramatically changes in

49:05

the way data center buildouts happen. Uh

49:09

I actually believe that there will be a

49:11

lot of resistance to building data

49:12

centers. It's because people incorrectly

49:15

think that data centers consume a lot of

49:17

water or eat up a lot of power which is

49:21

both both are untrue. Satya even made

49:24

the statement that it it's like a can of

49:26

water or something uh in terms of how

49:28

efficient these companies are.

49:30

>> Do you think that's why they're putting

49:32

up resistance to them? I don't. I think

49:33

it's cuz it's a symbol of job losses. Uh

49:36

increasing wealth in

49:37

>> it's a lot of things. It's a lot of

49:39

things. Um it's a lot of apprehensions

49:43

um fear about like what's going to

49:45

happen channelizing in so many different

49:48

ways. Um, sometimes it's channelizing

49:50

through hatred for wealth inequality and

49:52

like wanting to tax people. Sometimes

49:54

it's channeling through like concerns

49:56

for the environment and like climate

49:57

change. Um, sometimes it's uh

50:00

channelizing in a way where um you're

50:03

all like oh like the price of the grid

50:05

is going up because you guys are

50:07

building all these data centers and then

50:08

or like I'm paying more for my phones

50:10

and laptops now because the RAM prices

50:13

have gone up because you guys went and

50:14

bought all of it. So I think there's a

50:17

lot of different ways in which it's

50:18

getting channelized but the common

50:20

sentiment is um like like a pretty bad

50:24

sentiment about AI.

50:25

>> Do you think it will be meaningful

50:28

to the development of those data

50:29

centers? I think right now 40 out of 100

50:32

are not being developed because of

50:34

public resistance.

50:35

>> Yeah. So um that that's where the power

50:38

bottleneck is and um you could see maybe

50:42

certain countries sees the opportunity

50:44

for this and um

50:47

um allow these model builders to go

50:50

build data centers there. Um Elon's

50:54

going to space to do that. Um so that's

50:56

going to be an interesting experiment.

50:58

Um because there's a lot of energy from

51:01

the sun that can be harnessed there. and

51:05

uh there's a lot of natural resources in

51:08

other countries. Regulations might be

51:10

more friendly.

51:12

So, we're still going to see data center

51:13

buildouts. It might not happen in the

51:16

US. And um but but the fact that you

51:20

have to solve physical problems like you

51:24

actually have to deal with the supply

51:25

chain, the permits, securing power, like

51:28

making sure like things work and getting

51:31

the lead times lower and lower. You're

51:34

not solving problems like cloning some

51:37

SAS apps here, right? You're or like

51:39

you're building a go to market team or

51:41

like um doing better marketing against

51:44

the competitor's products. Like yes,

51:46

those are also hard problems, but these

51:48

are like much harder problems where like

51:51

you're not in full control of your

51:53

destiny and you need a lot of capital

51:54

and connections and like the right

51:56

people uh sometimes even like political

51:59

help to unlock progress. And so that's

52:03

why this will continue to remain the

52:04

bottleneck in my opinion. And there's a

52:07

lot of risk as well because if you do

52:11

encounter another Deep Seek moment here

52:14

where there's a vastly more efficient

52:16

model that's been built with a very

52:18

different vertically integrated

52:20

architecture

52:21

and you built out all this capacity and

52:24

you're like, damn, that's I overbuilt.

52:27

there's something far more efficient

52:28

that can run on on on people's local

52:30

devices, their MacBooks, their Windows

52:32

PCs.

52:34

Yeah. Like you're probably freaking out

52:36

then. And so you hope that

52:38

>> How likely do you think that is though?

52:40

>> It's probably like 20% 30% chance. The

52:43

reason I think there's some possibility

52:45

is that because of the export controls

52:48

um you are so the deepseek is not

52:53

building with the Nvidia stack. they're

52:56

building with the Huawei stack

52:59

and because there are export controls on

53:03

not just the Nvidia GPUs but also on

53:05

HPMs

53:07

these um architectures that that DeepS

53:10

building are far more like memory

53:13

efficient they made innovations on the

53:16

KV cache to be really small enough that

53:19

you can host it on the SSDs

53:22

and you don't need high bandwidth memory

53:25

for inference time and they're going to

53:26

have a completely different architecture

53:28

for inference, completely different

53:30

architecture for storage cuz they're not

53:33

allowed to use the 3D nans. So their

53:36

architecture is going to look it's not

53:39

just the model architecture. The model

53:40

architecture is already pretty

53:42

different. They've made innovations on

53:43

the attention layer. They made

53:45

innovations on like the the training

53:47

algorithm so that it doesn't consume a

53:50

lot of interconnect capacity. So they

53:52

they've made a lot of they basically

53:53

their whole stack is getting vertically

53:55

integrated to their hardware and their

53:58

chips and their fabs and so on and so

54:00

that's a very different bet from what

54:03

America's making.

54:04

>> Do you think the export controls have

54:06

helped or hurt us? jury still a lot

54:10

shortterm it's helping because the only

54:12

reason I my belief the only reason where

54:16

why there's even like a 12 month gap

54:20

between open source and frontier is

54:22

export controls it's definitely helped

54:24

and and and definitely like companies

54:26

like anthropic lobbyed very hard for it

54:29

but um there is a chance that because of

54:33

that they now get really good at the

54:35

physical layer and One advantage they

54:38

have is they can actually build data

54:40

centers

54:42

a lot a lot a lot faster. Power is not a

54:45

problem. Permits are not a problem.

54:47

People are not a problem. Labor is not a

54:49

problem. Expertise is not a problem. And

54:51

so by forcing them to go out there and

54:54

build all this, you're converting them

54:56

into a far more

54:58

like potent competitor.

55:00

>> Do you think we still dramatically

55:02

underestimate China's capabilities?

55:04

>> I think so. Because if AI is like not

55:06

just digital, that's also physical AI.

55:10

You got to build fabs, robots, chips and

55:14

harness the energy really well and um

55:17

package it into local devices.

55:20

I think they have a lot more advantages

55:22

than America.

55:24

>> How important is it that we have our own

55:25

TSMC in the US? So TSMC is actually

55:30

there is a fab of TSMC in Arizona. Like

55:34

not a lot of people talk about this but

55:36

TSMC is investing like $150 billion into

55:39

that into into building American fabs

55:42

and um they've already invested $40

55:45

billion or something like that. 60

55:46

billion last time I checked. So there is

55:49

a TSMC in Arizona that's coming up.

55:52

There's also um Intel and that's why you

55:55

know American government owns 10% of

55:57

Intel. U Nvidia and SoftPank own 5%

56:00

each. So there is a lot of investment

56:02

going into an American fab as well as

56:05

TSMC is investing into its American

56:07

fabs. Elon's building terra fab like I

56:10

think it people have woken up to the

56:12

importance of building fabs but um this

56:16

is also why China is particularly very

56:19

very competent because given the

56:21

capabilities of China that we just

56:23

mentioned there really articulately I

56:25

know it's a ridiculous question but sort

56:29

it um if I were to say to you your job

56:31

is to make sure America stays

56:33

competitive

56:34

what would you do to ensure that you

56:37

retained competitiveness in an

56:40

increasingly strong China.

56:41

>> I think I think take physical

56:44

infrastructure a lot more seriously and

56:47

continue funding it um and not like have

56:51

all these

56:52

you know I I wouldn't say meaningless.

56:55

It's more like not propagate fake news

56:59

around data centers

57:01

um about how data centers are polluting

57:03

and contaminating water or like they're

57:06

sucking up all the water um and and and

57:08

actually be fact driven and so you know

57:11

I hope our product helps there like you

57:13

you can you can go to perplexity and ask

57:15

any question and get fact checked on

57:18

your assumptions but yeah like it's very

57:20

important that we educate the public

57:25

um about what's actually going on in in

57:27

a language they easily understand and

57:30

not fear-monger, okay? Like not be like,

57:33

oh, like all their jobs are going to go

57:34

away like this, that like there's going

57:36

to be lots of amazing companies that are

57:38

going to get built with far fewer people

57:41

getting multi-billion dollar, multiund

57:43

million valuations with like 20, 30

57:45

people and propelling like trillions of

57:49

dollars of new GDP. Like let's talk

57:51

about how to enable that. let's talk

57:53

about how to build that and create a

57:56

more positive future together, right? Uh

58:00

instead of, oh, like 90% of the jobs are

58:03

going to be gone. Like you're all going

58:04

to get screwed over by our models and

58:07

like and and it's it's our it's our

58:09

moral duty to tell you all this like

58:11

blah blah blah. Like that doesn't make

58:12

any sense to me. Like you can't win by

58:15

saying that and also like complaining

58:17

about not being able to build data

58:18

centers fast. Do you think we've done a

58:20

complete disservice by having the

58:22

marketing message that Dario has had

58:24

that all jobs are going and it's all

58:26

doom and gloom?

58:27

>> Yeah,

58:28

I think so. I mean I think you know they

58:32

have contradictory messages in their own

58:34

like

58:36

uh different social

58:40

engagement so far where the most recent

58:43

one I heard was there is no evidence

58:45

that AI is taking over jobs.

58:48

But so I I I think there there needs to

58:51

be a consistent communication around

58:53

this. And

58:57

I also think that um very little is

59:00

being spoken about how AIs can help you

59:04

build companies in a very very different

59:06

way like the current AIS AI.

59:10

It it's already true that so many things

59:12

you would hire people for you can do it

59:14

with agents. But one way of looking at

59:18

it is like oh like what happens to all

59:20

the jobs. But the other way of looking

59:21

at it is like, hey, like I can I never

59:24

had the chance to go build out a company

59:26

on this idea that I've been having all

59:28

this all this while and maybe me and a

59:31

group of friends can come together and

59:33

build this and can you guys figure out a

59:35

way to give us compute credits or you

59:38

know Amazon gave a lot of compute

59:40

credits to a lot of startups like when

59:41

we started Perplexity

59:44

we had like around $200,000 worth of

59:46

Amazon credits and GCP credits and Azure

59:49

credits.

59:50

um that almost like together

59:52

cumulatively this was worth like a

59:53

million dollars in computer credits. Now

59:58

in today's world it's going to be like a

59:59

million dollars of computer credits. And

1:00:01

we're doing that like we we're funding

1:00:02

this thing called a billion dollar build

1:00:04

where we're giving a million dollars of

1:00:06

computer credits to any group of people

1:00:08

who have a credible path to building a

1:00:10

billion dollar company and I want like

1:00:12

thousand such companies to be built.

1:00:13

>> What did you think of Sam Alman giving

1:00:15

$2 million of tokens to YC companies

1:00:17

initially?

1:00:17

>> I think we should do more of that.

1:00:20

Yeah, that that's the right thing to do.

1:00:22

Like we should do a lot more of this

1:00:25

because you want new companies to be

1:00:29

built. Um and and and even if they're

1:00:32

worth multi00 million, right, it's good.

1:00:37

If there are thousands of them, like

1:00:39

that's a lot of new GDP. I I spoke to an

1:00:42

Betski before the show and she said how

1:00:44

AI pilled the team is for you. How big

1:00:47

is the team today?

1:00:48

>> It's like 400 people.

1:00:50

>> 400 people. How big will it be in two

1:00:52

years time?

1:00:53

>> I don't know. It's hard to say. Maybe

1:00:55

800 or,000.

1:00:56

>> So, will companies follow the same

1:00:59

headcount trajectory that they have

1:01:00

always followed and we will just solve

1:01:02

new problems. Or will they be

1:01:03

dramatically more efficient with a much

1:01:05

fewer number of people?

1:01:07

>> Definitely, they'll be dramatically more

1:01:08

efficient. Right. and and that's why I I

1:01:12

I am a believer in building a lot more

1:01:16

efficient companies

1:01:18

not and being an example for all these

1:01:20

companies ourselves like like people

1:01:22

should look at perplexity and be like oh

1:01:24

like with 400 people um you can build

1:01:28

like a multi like like I don't know like

1:01:30

20 billion $20 billion company um and so

1:01:35

that means with like 40 people I could

1:01:38

probably build a billion dollar or $2

1:01:39

billion

1:01:41

you know, and that that that's totally

1:01:43

doable. Totally doable. And um and and

1:01:48

so for us, maybe that means is with

1:01:49

4,000 people, we could be worth 200

1:01:52

billion. We we could be worth $2

1:01:54

trillion with like 10,000 people.

1:01:57

You know, I think I think that doesn't

1:01:59

mean it's bad for all the um hundred,000

1:02:03

people we did not hire for a typical $2

1:02:06

trillion company.

1:02:08

I would rather have those 100,000 people

1:02:10

be split into groups of like hundred

1:02:12

thousand groups like that and each of

1:02:14

those thousand groups are worth a few

1:02:16

billion dollars. That's awesome. And I

1:02:19

think a lot more people need to be

1:02:20

entrepreneurial.

1:02:22

Um there are people who would be bad

1:02:25

employees in any company because they're

1:02:27

just like difficult to work with. They

1:02:29

they don't listen to like instructions.

1:02:32

So like they don't follow like road maps

1:02:35

or not they're not like easy to

1:02:36

collaborate with. But maybe the the flip

1:02:39

side of that is those those are the kind

1:02:40

of qualities that founders typically

1:02:42

have.

1:02:42

>> Ain there is a population and a very

1:02:44

large population that are not AI native

1:02:46

people that are not using AI to improve

1:02:49

workflows, improve efficiency.

1:02:51

What would you advise them?

1:02:53

>> Get started. First steps get started and

1:02:56

and and channelize your curiosity.

1:02:59

Right. Um, you don't need to use AIS to

1:03:03

do your existing work. If that's if your

1:03:06

existing work is boring to you, you

1:03:09

probably won't enjoy it even if you use

1:03:10

AIs to do it.

1:03:12

>> You got a lot of heat for saying people

1:03:14

don't like their job. So,

1:03:16

>> I I didn't say if you actually listen to

1:03:18

my interview, I did not say that. So,

1:03:22

people want clickbait articles and they

1:03:24

take something I said in one sentence

1:03:26

and out of context and make it into a

1:03:28

headline. What did you say? I I

1:03:30

specifically said this. Hey, like there

1:03:33

are a lot of people who don't enjoy

1:03:34

their jobs. Does any like By the way,

1:03:36

the fact that that that that thing went

1:03:39

viral is not because I was completely

1:03:42

wrong. I think a lot of people resonated

1:03:43

with the fact that I was actually honest

1:03:46

in saying a lot of people don't enjoy

1:03:48

their jobs. And that has nothing to do

1:03:50

with your economic position or standing

1:03:52

in society. You might even be like

1:03:54

really wealthy but doing a job that you

1:03:56

completely don't enjoy and like

1:03:58

destroying like the peak years of your

1:04:00

adult life working on something that is

1:04:03

horrible or like like depressing. So

1:04:05

like my point is that if that's you and

1:04:10

if the reason you could never leave your

1:04:12

job is because you were always worried

1:04:15

if you how would you build a company

1:04:17

from scratch? Like there are all these

1:04:19

things to figure out how you have to

1:04:21

hire a lot of people. Oh, you have to

1:04:22

like set up an office this that. Like

1:04:24

that's changed. For the first time in

1:04:27

history, you can get started on an idea

1:04:31

with like one or two other friends and

1:04:34

and and and maybe have a real genuine

1:04:36

shot at building a billion dollar

1:04:38

company. Totally get that. Everything

1:04:40

that we've discussed today has been on

1:04:42

the back of unprecedented demand up and

1:04:45

to the right. We need more memory. We

1:04:47

need more data center supply. We need on

1:04:49

demand and server side. Everything's

1:04:51

like up and to the right. Seeing some

1:04:53

cracks in and Uber's saying, "I'm not

1:04:56

sure I'm getting the productivity gains

1:04:57

that I thought." Microsoft aligning with

1:04:59

them, putting a $1,500 token budget. Do

1:05:03

you think we will have a continuous up

1:05:05

and to the right acceptance that

1:05:07

productivity gains are unwavering? We

1:05:09

have to do this, or will there be

1:05:11

falterings along the way? I mean, I'm

1:05:14

sure there's going to be falterings

1:05:15

along the way and people are rightfully

1:05:18

freaking out about token maxing, which

1:05:20

is why I think you need some form of

1:05:22

hybrid agenic inference. You need some

1:05:25

amount of inference compute to run

1:05:27

locally that you're not paying for

1:05:30

tokens on um unmetered intelligence

1:05:33

essentially.

1:05:34

>> How will the best companies of the

1:05:35

future structure token budgets? My hope

1:05:38

is that they don't have to understand

1:05:39

that

1:05:41

they will be able to work with an

1:05:43

orchestrator who does it for them. It's

1:05:44

not going to be easy for you to

1:05:46

constantly keep track of like which

1:05:47

models are the best at what things and

1:05:49

how do you allocate oh this is the

1:05:51

budget for coding, this is the budget

1:05:52

for finance or like like how do you even

1:05:55

understand like which models are good at

1:05:56

each of those things and like how much

1:05:58

do you spend on each of these divisions?

1:05:59

You're not going to be able to keep

1:06:00

track.

1:06:01

>> I I had a friend on the show the other

1:06:03

day say that Google will be the token

1:06:05

king. They can produce the lowest cost

1:06:08

tokens out of anyone. They own full

1:06:09

stack TPUs, data centers, networking,

1:06:11

power, procurement.

1:06:13

Do you think that's true that they will

1:06:15

be the lowest cost token producer?

1:06:18

>> They have advantages all all advantages

1:06:21

one needs to have to be that. But they

1:06:23

underestimated the importance of coding

1:06:25

models and so they far behind the

1:06:27

frontier right now. So again, they could

1:06:29

catch up, totally capable, totally

1:06:32

competent team, but today they're not

1:06:35

quite at the frontier.

1:06:36

>> I was shocked the other day. I saw the

1:06:38

Cloudflare announcement that um now

1:06:41

agent traffic has overtaken human

1:06:43

traffic for them.

1:06:45

>> Why? Why are you shocked?

1:06:46

>> It was quicker than I thought.

1:06:49

>> Okay.

1:06:50

>> Personally, I thought that would happen,

1:06:52

but in two years, maybe not now.

1:06:56

How does the world change when agent

1:06:59

traffic far exceeds human traffic?

1:07:01

>> I think people are just going to have a

1:07:02

lot more agency.

1:07:04

That's it.

1:07:06

>> Do websites go away? Does design not

1:07:08

matter? Does the advertising model of

1:07:10

the internet die completely?

1:07:12

>> No, it doesn't. Because my belief is

1:07:16

that

1:07:17

the advertising model around like travel

1:07:21

or shopping or like u fashion are not

1:07:26

getting disrupted by agents because the

1:07:28

judgment is not objective.

1:07:31

Any anything where the judgment is

1:07:34

objective, the transaction is based on

1:07:36

objective judgment that's going to get

1:07:37

disrupted by agents.

1:07:40

anything where the transaction is more

1:07:42

subjective like the decisions are more

1:07:44

subjective like like what is the best

1:07:46

piece of furniture inside this this this

1:07:48

podcast like why this particular table

1:07:51

or like those kind of things

1:07:53

>> probably for the mic you would buy an

1:07:55

objective decision the table

1:07:57

>> you probably are caring about the

1:07:59

aesthetics of the room I think I I think

1:08:01

that's kind of how I I feel the world

1:08:03

will split and subjective things will

1:08:06

still be ad based objective things will

1:08:08

be agent based

1:08:10

I watched your commencement speech on

1:08:11

the back of speaking to Samir at Excel

1:08:13

and he said I had to watch it. So

1:08:15

obviously I watched it. Um and one of

1:08:17

the points you made was the defining

1:08:18

skill of the area is asking better

1:08:20

questions.

1:08:21

>> Yeah.

1:08:22

>> What question is no one asking today

1:08:25

that maybe everyone should be asking?

1:08:27

>> I think people need to ask more about

1:08:29

like okay assuming I have a lot of

1:08:32

agency available to me what do I do?

1:08:36

Imagine like I gave you a headcount of

1:08:38

like 100,000 people or 10,000 people

1:08:42

and and and

1:08:44

you know enough computer credits to run

1:08:46

those agents. What would you do?

1:08:50

Like let's say I ask you Harry like you

1:08:52

know let's say I you have suddenly like

1:08:55

10,000 agents at your disposal. What

1:08:57

would you do? Like I I remember you

1:09:00

telling me or not me but but in some

1:09:03

episode of yours where

1:09:05

>> you said you're only you only did this

1:09:07

podcasting because you felt like you

1:09:09

didn't have an arbitrage to go win deals

1:09:12

>> 100%. Yeah. It's why I still do it. I

1:09:15

mean I love what I do but yeah.

1:09:17

>> Okay. So you've gotten some amount of

1:09:19

distribution. So now assuming that you

1:09:21

can you you have let's say you could

1:09:23

spend $100 million on a genenic

1:09:25

inference and grounded with all the

1:09:28

connectors and stuff and it's all

1:09:30

working. What would you do to to with

1:09:33

with that capability to um further your

1:09:36

goals? Like what what what should your

1:09:37

goals even be then? I I think that's the

1:09:40

question I would ask assuming that in

1:09:42

the next three to five years you're

1:09:44

going to be able to like delegate

1:09:46

whatever digital task you want and and

1:09:49

with the right harness and agents and

1:09:50

like be able to delegate that.

1:09:52

>> Fundamentally it would be to build aic

1:09:54

infrastructure to be able to find,

1:09:58

identify, outreach, set up, win great

1:10:02

investments and have the media sit on

1:10:05

top and power that. that is intensely

1:10:07

difficult to do and would be the holy

1:10:10

grail to investing but like

1:10:12

>> that would power what my end goal

1:10:14

ambition is.

1:10:15

>> Yeah. So your goal is to be the you know

1:10:18

run like a 10 to 100x larger fund right

1:10:20

that that's basically what I'm hearing

1:10:22

from you.

1:10:23

>> So that's like let's assume it's like a

1:10:25

$40 billion fund from 400 million. Um

1:10:30

then all you got to ask is like assuming

1:10:33

I have all the headcount I need to do

1:10:35

this like how much faster can I do it? I

1:10:38

I think that's how I would frame this

1:10:40

question. I think Elon has like a

1:10:42

similar thing he spoke about once where

1:10:46

okay assume that a task somebody tells

1:10:48

you a task is going to take 10 years. Um

1:10:51

ask the question what would it take to

1:10:53

do it in 10 months?

1:10:57

Maybe it's impossible to do it in 10

1:10:59

months, but you'll probably get pretty

1:11:01

far asking those questions

1:11:05

compared to somebody who takes it for

1:11:07

granted that it's it's going to take 10

1:11:09

years.

1:11:10

>> All right, interviewer, we put it on

1:11:12

you. What's your 10year and how does

1:11:15

that look in a 10-month time frame?

1:11:18

>> I think our our mission beyond any level

1:11:21

of capitalism is to make the planet more

1:11:23

curious.

1:11:25

You know the product is always intended

1:11:27

to helping people ask the next question

1:11:30

and uh my goal is to truly realize that

1:11:34

like that level of agency

1:11:37

that needs to exist in this world is

1:11:40

quite not there.

1:11:40

>> I think I think that needs to be

1:11:42

grounded in numbers dude to make it like

1:11:45

possible. It's like me saying oh I want

1:11:46

the best investments

1:11:48

like which is why a $40 billion fund is

1:11:51

helpful.

1:11:51

>> Sure. I can say the same thing like 2

1:11:53

trillion you know it doesn't matter

1:11:54

right like 100x 10x 1000x these are all

1:11:57

like um motivational milestones

1:12:01

>> do you think will be a trillion dollar

1:12:03

company

1:12:04

>> yeah anyone can be a trillion dollar

1:12:06

company SKH and Samsung are worth a

1:12:08

trillion last last couple of weeks did

1:12:12

you know Samsung started off as a

1:12:13

grocery store did you know that you

1:12:16

didn't know that okay so it's true they

1:12:18

started selling dried

1:12:21

Seriously. Um, Heinix was um SK the SK

1:12:26

group started off as um um textiles

1:12:29

company. So, anyone can be worth a

1:12:32

trillion dollar company and like you

1:12:34

just have to work your way towards that.

1:12:36

I mean, the exact same logic for you

1:12:40

that you laid out for how can a company

1:12:42

be worth um hundred billion. Okay, you

1:12:46

said you need to make a $10 billion in

1:12:47

revenue. Isn't that the same for

1:12:50

trillion? Like you need to make a

1:12:51

hundred billion in revenue.

1:12:52

>> Mhm.

1:12:54

>> And there was actually some very

1:12:55

interesting data that CO2 revealed. I

1:12:57

don't know if you saw it recently, which

1:12:58

basically says about the probability of

1:13:00

reaching the next level of value is much

1:13:03

higher. So when you're at a billion,

1:13:04

it's like much more likely to reach 10

1:13:06

billion. 10 billion much more likely.

1:13:07

>> Yeah, that's true actually for even

1:13:08

people. Like it's way more likely for a

1:13:12

person with $100 million in in liquid

1:13:15

net worth to become a billionaire than

1:13:18

someone with $10 million.

1:13:19

>> Are you not worried about the wealth

1:13:20

inequality? Aaron, if we being blunt, we

1:13:23

both are very lucky now to live in kind

1:13:25

of nice worlds and rarified airs. Are

1:13:28

you not worried by just how much money a

1:13:30

very small number of people have and how

1:13:32

[ __ ] hard it is for everyone else and

1:13:35

that gap is getting bigger?

1:13:37

I think the way to like ensure that

1:13:39

that's not that doesn't remain the case

1:13:42

is to distribute the benefits more

1:13:45

widely. You got you got to let anyone by

1:13:48

the way the people who are using our

1:13:50

tools like I had an Uber driver I'm not

1:13:53

even like making this thing up so um as

1:13:56

honest as it can get. an Uber driver in

1:13:59

San Francisco um once told me that he

1:14:03

watched one of my uh YouTube interviews

1:14:06

where I explain how you can build a

1:14:09

product or a web app with an AI from

1:14:12

scratch, went on to do it and um um used

1:14:16

AIS to add like billing and all that and

1:14:19

that makes more passive income for him

1:14:22

than um driving Ubers and so he actually

1:14:25

reduced the amount of time he's driving

1:14:27

Uber because He he loves wipe coding new

1:14:30

apps and u that that already tells you

1:14:34

that

1:14:35

for the person with agency and a

1:14:38

positive outlook for the future,

1:14:40

anything is possible. And so if you keep

1:14:43

communicating

1:14:45

all the negative things you can about AI

1:14:47

and wealth inequality all the time and

1:14:49

that's the only thing news uh and press

1:14:52

writes about,

1:14:54

I think it'll perpetuate and people will

1:14:56

only think the bad things. And so it's

1:14:58

it's it's very essential that if you

1:15:01

think you're already doing well, it's

1:15:02

very essential that you talk about what

1:15:05

are all the things that can go well and

1:15:08

give hopes to people who were once upon

1:15:10

a time like you like you you were you

1:15:12

didn't you you started this um

1:15:15

podcasting circuit like when you had

1:15:17

nothing, right? So

1:15:18

>> nothing.

1:15:18

>> Exactly. So it's possible. So you got

1:15:20

you got to talk more about that than be

1:15:22

like oh I feel so guilty that I made it

1:15:24

and now I'm like you know what about all

1:15:26

these people who haven't made it like

1:15:27

you can also make it.

1:15:28

>> I I think I have a more pessimistic view

1:15:30

of actual general public which is I

1:15:32

don't think that many people have

1:15:33

agency. I think a lot of people have

1:15:35

victim mentality.

1:15:36

>> You got you got to help them. Like I

1:15:37

think that's that's the most important

1:15:38

thing.

1:15:38

>> I think they got to help themselves.

1:15:40

>> Sure. But people will help themselves

1:15:42

once they see that okay like I kind of

1:15:46

want to be like this guy. let me let me

1:15:49

work hard. You need an example, right?

1:15:52

Um it's not like nobody can become um

1:15:56

get in shape. Like it it takes

1:15:58

discipline.

1:15:59

>> Takes discipline. You got to get rid of

1:16:01

bad habits. And so

1:16:03

>> and now is the best time ever to change

1:16:05

your life in 12 months. Like the ability

1:16:07

to go from nothing to to actually

1:16:09

billionaire in 12 months is now possible

1:16:12

in some respect.

1:16:13

>> Yes. And so I look, I'm not saying

1:16:16

everyone's going to make it and

1:16:17

everyone's going to be worth a billion

1:16:18

dollars.

1:16:18

>> Isn't that the caption from this this

1:16:20

show, Arvin? Everyone's going to make

1:16:22

it.

1:16:22

>> Anyone has the potential to make it.

1:16:26

>> So it's it's it's as likely for

1:16:29

Perplexity to become worth $2 trillion

1:16:32

as as a founder who's yet to secure your

1:16:34

funding to be worth a billion dollars.

1:16:38

So it's it's equally hard. Equally hard.

1:16:40

And um and I think you just have to give

1:16:44

yourself, you know, shots at the goal

1:16:47

and um be curious. That's that's the

1:16:49

message from the commencement speech. Be

1:16:51

curious.

1:16:51

>> We have SpaceX. We have Anthropic. We

1:16:53

have Open AI going public. It feels like

1:16:55

someone's kind of shot the gun and the

1:16:57

race is on. Is there enough money

1:17:01

to fund three such large IP?

1:17:04

>> There will be some reallocation for

1:17:05

sure.

1:17:07

like um there there might be some

1:17:10

holders of like SAS stocks who would put

1:17:12

it into anthropic or something. Let's

1:17:16

say you believe that enterprise AI is

1:17:18

going to take off. You might want to

1:17:20

hedge between having a lot of Microsoft

1:17:22

stock and Salesforce stock versus like

1:17:25

putting some of that into anthropic. So

1:17:28

let's say like Vanguard or Black Rockck

1:17:30

own like you know cumulatively they own

1:17:32

like $200 billion of Microsoft and

1:17:34

Salesforce. They might be like, "Okay,

1:17:35

I'm going to take 30 40 billion of that

1:17:37

and put it into anthropic."

1:17:39

Fine. You know, not a bad bit to make.

1:17:42

>> What happens to all the enterprise SAS

1:17:44

companies that are public growing? Yeah.

1:17:47

Fine.

1:17:49

>> They have to they have to weather the

1:17:51

storm.

1:17:52

>> Is it a storm or is it a continuous

1:17:54

precipitation?

1:17:56

>> I think you have to bring down the costs

1:17:58

and produce new value.

1:18:01

Salesforce has done well because they

1:18:04

always went and bought the next thing.

1:18:06

If you're just selling the same

1:18:07

software, you're probably not going to

1:18:09

be around. IBM is still around because

1:18:10

they went and bought Red Hat and

1:18:13

Hashikarp and now they're buying

1:18:15

Confluent. So, there are ways for these

1:18:17

companies to stay alive and extend their

1:18:20

lifespans and stuff. It's obviously

1:18:22

going to be hard to preserve a brand

1:18:24

that's as relevant

1:18:27

like like I don't think the IBM brand is

1:18:29

that relevant anymore in terms of like

1:18:32

evoking an emotion and people to go use

1:18:33

their products but as a business it's

1:18:36

going to be awesome you know it's going

1:18:37

to be fine.

1:18:37

>> I have to finish on you said IPO in

1:18:40

2028.

1:18:42

I had to ask this. I woke up to this in

1:18:43

my like you know um group. We have a

1:18:46

team WhatsApp and it's like ask IPO

1:18:49

2028. I hope I hope it can be sooner

1:18:51

than that.

1:18:52

>> When do you know when you're ready? Like

1:18:54

is there like a billionaire? Are you at

1:18:56

500 million er now?

1:18:58

>> More than that. Far far more than that

1:19:00

actually.

1:19:00

>> Really?

1:19:01

>> We we're not ready to share it, but um

1:19:06

growing really fast.

1:19:08

>> Revenue growth matters much more to you

1:19:09

than profitability

1:19:11

>> today. I think in general, by the way,

1:19:13

you can look at public markets. Um

1:19:17

people want topline growth.

1:19:19

more than um bottom line efficiency

1:19:22

right now because it's very hard. It's

1:19:24

it's rare.

1:19:25

>> Well, you definitely need one.

1:19:28

>> Of course, sustainable businesses,

1:19:30

>> you need to have a model in place to get

1:19:35

the bottom line efficiency when that

1:19:36

becomes the objective and and you need

1:19:38

to also like have a path to getting

1:19:40

there.

1:19:40

>> Where are you cost inefficient today

1:19:42

where you expect to be significantly

1:19:43

better in 2 to 3 years? We're training

1:19:45

our own models, post- trainining it on

1:19:48

top of amazing open source models, and

1:19:51

that will bring down the cost that we

1:19:54

currently spend on Frontier model

1:19:56

tokens.

1:19:57

We expect to continue to use Frontier

1:20:00

models for designing new experiences and

1:20:02

new capabilities that do not exist today

1:20:04

in our products. But whatever exists

1:20:06

today in our products right now, we

1:20:09

expected to completely re rely on like

1:20:12

models we own and serve ourselves. And

1:20:13

that's

1:20:15

going to be the best way to bring down

1:20:16

the costs and increase our margins.

1:20:17

>> Will the largest enterprises in the

1:20:19

world all be fine-tuning open models to

1:20:22

have tailored models that are much more

1:20:23

specific to them?

1:20:24

>> Absolutely. Because it's in your

1:20:27

incentives to bring down the cost.

1:20:28

>> Does that not provide another bad case

1:20:30

for the large frontier model providers?

1:20:32

>> Frontier model providers will only

1:20:34

remain relevant if they remain at the

1:20:36

frontier. If for 6 months you're not

1:20:39

seeing a new capability,

1:20:41

it's bad for them.

1:20:43

And so that's the uncomfortable nature

1:20:46

of this field. You no one's ever in a

1:20:49

comfortable position. Like I said in the

1:20:51

start, no one's no one can relax. This

1:20:54

is

1:20:55

>> [ __ ] a hard business. It's got

1:20:57

harder.

1:20:58

>> It's going to get even harder. And and

1:21:01

um that's the nature. This is the the

1:21:04

price is too big. Like you've never seen

1:21:07

like like take entropic. I think it's um

1:21:10

worth like one to one and a half

1:21:11

trillion some something in that range.

1:21:14

That's basically the valuation of meta

1:21:18

and and and this all was created in like

1:21:20

6 years.

1:21:22

Meta took like 20 years to build. So the

1:21:26

price is so big and so no one can um no

1:21:31

one can be comfortable

1:21:33

and and and and and anyone who's winning

1:21:35

today can lose tomorrow including

1:21:39

including the mod providers.

1:21:41

Previous this year, there was like a

1:21:43

three-month period where people were

1:21:44

like, "Oh, Py, what's happening with

1:21:46

Pacity?" Do you pay attention? Do you

1:21:48

give a [ __ ]

1:21:49

>> Of course, I pay attention to all that.

1:21:50

>> Do you care? There was one in particular

1:21:52

in San Francisco. Do you remember where

1:21:53

they were like, "Oh, what's the company

1:21:54

you had short?"

1:21:55

>> Yeah, we were voted the most likely to

1:21:56

fail. Cursor was voted the second most

1:21:59

likely to fail. Open AAI was voted the

1:22:01

third or something. Um,

1:22:02

>> you didn't give a [ __ ]

1:22:04

>> I I feel like we're all doing well.

1:22:07

Curser, I think, is getting sold. SpaceX

1:22:10

Open AI

1:22:12

>> is

1:22:14

>> going public soon.

1:22:15

>> We tripled our revenue since that

1:22:18

>> since that uh judgment was made. So

1:22:21

brought down the burn by more than 50%.

1:22:25

So I I don't know like my my my sense is

1:22:27

that um I also feel most of those people

1:22:31

who sit on these like meetups and vote

1:22:35

don't actually build anything useful.

1:22:41

Yeah.

1:22:41

>> Uh, okay. We're going to do a quick fire

1:22:43

around because, uh, I could talk to you

1:22:45

all day. Um, first one, what's one

1:22:48

widely held belief that you think is

1:22:51

completely wrong?

1:22:52

>> I think a lot of people are obsessed

1:22:54

about like, you know, identifying a mode

1:22:56

in the first year or two of their

1:22:57

company. But, um, I think like the only

1:23:00

shot you have is move fast. like veloc

1:23:03

in my mind like moving fast is a way of

1:23:06

expressing humility because you you're

1:23:08

constantly making contact with the world

1:23:10

and trying to question your assumptions

1:23:11

all the time.

1:23:12

>> Where are you still moving too slow

1:23:14

internally today?

1:23:15

>> I think we can be even more AI.

1:23:18

It's insane I'm saying this because we

1:23:20

are building some of the most

1:23:22

interesting AI products and internal

1:23:25

adoption of our own products, our

1:23:27

competitors products can be even higher

1:23:29

and and um this is despite us being

1:23:31

extremely

1:23:33

um agent build internally and trying to

1:23:36

delegate as much to agents. Yeah, that's

1:23:39

where that that's like a big area for my

1:23:41

my hope is that we can turn this company

1:23:43

almost into an AGI and and um have that

1:23:46

doesn't mean no humans work here, but

1:23:50

there will be an AGI that has all the

1:23:52

context it needs to run different

1:23:57

divisions of the company

1:23:59

in a semi-autonomous way with some

1:24:01

scaffolding provided by humans here and

1:24:04

there. And and that's not that's going

1:24:06

to feel that's not going to feel scary

1:24:08

at all. We'll normalize that that

1:24:11

feeling very fast. It's just going to

1:24:14

feel like 10, you know, uh the 10x

1:24:17

engineers running certain aspects of the

1:24:19

company.

1:24:19

>> If I gave you unlimited money, what

1:24:21

would you do today that you're not

1:24:23

doing?

1:24:24

>> I would build data centers.

1:24:25

>> You would?

1:24:26

>> Yeah.

1:24:27

>> In space?

1:24:28

>> I don't have expertise to do that, but I

1:24:31

would I would start with land on Earth.

1:24:34

You know, I think there's a lot of land

1:24:37

and and maybe you can be resourceful in

1:24:39

securing permits and power in different

1:24:41

countries, but I would start there. I

1:24:43

think it's, you know, I like I said,

1:24:46

like I think physical infrastructure

1:24:48

buildouts is like the return of the

1:24:51

industrial age again

1:24:54

like like the the the forefathers who

1:24:56

built the industrial revolution,

1:24:58

um oil pipelines, steel bridges,

1:25:02

factories producing cars, all these

1:25:04

things that we take for

1:25:07

granted today were built by people who,

1:25:10

you know, spend a lot of time thinking

1:25:12

about how to scale these things in a

1:25:14

costefficient way and so um we need to

1:25:17

do that a lot for AI and um yeah that's

1:25:21

what I would do of course you you you

1:25:23

cannot just be building infra you need

1:25:26

to be able to utilize all that infra to

1:25:29

producing valuable output tokens to the

1:25:31

user

1:25:32

but um we're already good at doing that

1:25:35

so infra is the thing I would focus on

1:25:38

you can buy and hold for 10 years SpaceX

1:25:42

anthropic or open AI the three IPOs

1:25:46

coming in the next few months which you

1:25:48

buy and hold for 10 years and why

1:25:50

>> SpaceX

1:25:52

why it's an end of one company

1:25:56

like Enthropic and OpenAI can claim they

1:25:59

do whatever each other does but u um

1:26:04

SpaceX is the only company building

1:26:07

space infrastructure for connectivity.

1:26:10

Have you been on a flight with Starlink?

1:26:12

No,

1:26:13

>> you should.

1:26:15

You will hate being on a flight without

1:26:16

Starlink after that. Imagine we can

1:26:20

record this. I can watch this podcast

1:26:22

while flying on a plane.

1:26:25

Starlink lets you do that. That's just

1:26:27

one aspect of the business. That's just

1:26:29

one aspect of

1:26:30

>> one small aspect of the business.

1:26:31

>> Yeah. Like there's a lot of like I'm

1:26:33

excited about possibilities to travel

1:26:35

from Australia to like uh San Francisco

1:26:38

in like 30 minutes. You know, all this

1:26:40

feels like sci-fi, but I'm excited about

1:26:42

all these possibilities.

1:26:43

>> What job does not exist today that will

1:26:45

be incredibly common in 5 years time?

1:26:48

>> I think it already exists. So, if the

1:26:50

forward deployed engineer is definitely

1:26:52

on the rise, I guess like people with a

1:26:54

really good sense of like um quality

1:26:57

control.

1:26:59

Maybe a better uh way to answer this is

1:27:02

like most jobs that exist like valuable

1:27:04

jobs that exist are usually like

1:27:06

reincarnations of something that already

1:27:08

existed.

1:27:09

Like so I don't think we're going to see

1:27:10

completely new things. It's just going

1:27:12

to reincarnate in different ways.

1:27:13

>> You can advise your little sibling who's

1:27:16

finishing university today and just done

1:27:19

a computer science degree. One thing,

1:27:22

what would you advise them?

1:27:24

>> Stay curious.

1:27:27

Don't don't don't give into like FOMO

1:27:29

and trying to max out on something here

1:27:31

in the short term. Like don't go to

1:27:34

Twitter and feel like a loser that

1:27:36

people on Frontier Labs are getting so

1:27:39

rich and like you everything feels

1:27:41

hopeless to you or something like there

1:27:43

is so much more to build like we are

1:27:45

just getting started like the the the

1:27:47

application layer era or like

1:27:50

infrastructure buildouts there there's

1:27:52

like a lot of opportunities we are

1:27:54

seeing more spinouts from open AI

1:27:56

anthropic you name it every single day

1:27:59

do we have hundreds of these neolabs

1:28:02

in vertical models.

1:28:04

>> No, not not a big believer in too many

1:28:08

of them. I think you got to produce some

1:28:10

differentiation. That's the most

1:28:12

important thing. Um like like I I if

1:28:16

would you call Deep Seek a Neolab?

1:28:18

>> No.

1:28:19

>> Why?

1:28:20

>> I think very stupidly for me I don't

1:28:22

call it a Neolab because I I attribute

1:28:24

Neolabs like spinouts from larger labs.

1:28:27

>> I see. and and kind of verticalized

1:28:29

which is probably wrong on both par like

1:28:32

axes

1:28:33

>> but it's horizontal and it's not a spin

1:28:35

out.

1:28:36

>> Yeah. I mean I kind of like the idea of

1:28:38

labs taking a differentiated bet. Okay.

1:28:40

If somebody really questions the

1:28:42

transformer architecture itself or

1:28:45

somebody really questions needing to

1:28:46

build on Nvidia GPUs or something like

1:28:48

that like like foundational bets makes

1:28:51

or or somebody questions or somebody

1:28:53

goes out and builds for robotics models.

1:28:55

I think I think that's like somewhat

1:28:57

uncorrelated and different and that

1:28:58

makes sense for a lab, but I feel like

1:29:00

they're like just labs for the sake of

1:29:02

being lab and I don't think they're

1:29:03

going to make it.

1:29:04

>> Can you paint for me? I I do like this

1:29:07

one. What's the most plausible story

1:29:10

whereasty becomes a trillion dollar

1:29:13

company? What do you do then? The

1:29:15

orchestration layer.

1:29:17

>> I mean, accuracy and orchestration is is

1:29:19

is like two goals that have been

1:29:21

consistently true since the beginning of

1:29:22

our company. So I think we'll continue

1:29:25

to do that. We'll be orchestrating

1:29:27

across devices, chips, models, tools,

1:29:29

files, connectors, everything. Right. So

1:29:31

what would I do once that happens? I

1:29:34

don't know. We we'll chart our path to

1:29:36

10 trillion.

1:29:37

>> Are you happy now? But are you are you

1:29:39

enjoying this?

1:29:40

>> Of course. I mean, I wouldn't like

1:29:43

there's so many things I could be doing

1:29:45

if not for this. And um I think the

1:29:49

process is is is what motivates you. So

1:29:52

you asked me I think somewhere in

1:29:53

between you need to give me a number of

1:29:55

where you want. I don't I don't work

1:29:57

like that actually. I I like for example

1:30:00

like the these numbers like getting to

1:30:02

two trillion or 20 trillion

1:30:04

are are exciting but like that that

1:30:07

doesn't motivate me. It's it's hard to

1:30:09

get motivated by wealth. You you you

1:30:11

want to get motivated by impact. Who's

1:30:14

the smartest person you've met? Final

1:30:16

one. You've met Jensen Hang. You've met

1:30:19

the best of the best. I've been

1:30:20

fortunate enough to Who's the smartest?

1:30:23

>> People are smart in their own ways. It's

1:30:25

hard to compare. Like I met Jensen,

1:30:27

Elon, all these guys and like Bezos.

1:30:30

>> What was it like meeting Elon?

1:30:33

>> Amazing. I mean Elon's like a very uh

1:30:37

focused person like like that. He might

1:30:39

not appear that way on Twitter but you

1:30:42

know with a lot of like random tweets

1:30:43

but he's extremely laser sharp focused

1:30:46

on whatever he's doing at that moment in

1:30:48

time. Actually the the one skill that as

1:30:51

an entrepreneur that I would really like

1:30:52

to take like build from somebody like

1:30:55

him like take from somebody like him and

1:30:57

have it for myself is that ability to

1:30:58

just zone out of all the other things

1:31:01

that's happening in your business or

1:31:04

other businesses and just focus on that

1:31:07

limiting problem right now like the

1:31:09

bottleneck problem and and and and

1:31:11

ignore everything else. It's very hard

1:31:13

to do like even within perplexity I

1:31:16

cannot just focus on like one part of

1:31:18

the business alone. It's very difficult

1:31:22

like I'm I'm always looking at other

1:31:23

things simultaneously. And um his style

1:31:28

is to just always look at the limiting

1:31:30

problem and just ignore everything else.

1:31:32

And uh that's very hard to do because

1:31:35

you you actually have to be really good

1:31:37

at concentration.

1:31:39

You you have to be really good at

1:31:40

ignoring

1:31:42

even important things which are

1:31:45

distractions to your core objective

1:31:46

right now.

1:31:47

>> Was Jensen hang what you thought he'd

1:31:49

be?

1:31:50

>> Far better.

1:31:51

>> Really?

1:31:51

>> Yeah. Jensen is so truth seeeking. It's

1:31:54

insane.

1:31:55

I think he or somebody else told me or

1:31:58

read in a book that he

1:32:00

um is so intense that he wakes up every

1:32:03

day and tells himself that he sucks

1:32:06

and goes in like like he's so intense

1:32:10

that he tells everybody around him that

1:32:13

they're 30 days away from going out of

1:32:15

business. Think about it, right? $5

1:32:18

trillion

1:32:20

um guaranteed to make $500 billion in

1:32:23

revenue in the next two years. Um

1:32:27

and um has the most advanced chips in

1:32:29

the world and and and he operates with

1:32:32

that mentality that he could be 30 days

1:32:34

away from going out of business. That is

1:32:37

what it takes to be Jensen Huang. And uh

1:32:41

there's so much to learn from these

1:32:43

guys. There's so much to learn. I think

1:32:45

uh there's one aspect of like you know

1:32:48

being comfortable where you are thinking

1:32:50

you made it uh you know that that feels

1:32:54

good to get here so far but these guys

1:32:57

are not stopping like I I don't think

1:32:58

Elon wants to stop at uh if you look at

1:33:01

his pay package for SpaceX it's

1:33:03

structured around um creating a colony

1:33:06

in Mars with a million inhabitants and

1:33:09

uh building enough comput in space so

1:33:11

that's why it's not like motivating to

1:33:14

be

1:33:15

worth a 10 trillion in net worth or

1:33:17

something. you know, if if he does these

1:33:20

things, I'm sure he's going to get

1:33:21

there, but um it's more motivated around

1:33:25

like making the impossible things happen

1:33:29

and and and having like that long-term

1:33:31

outlook like you um I think that has

1:33:34

been the biggest thing to learn from

1:33:36

maybe these two individuals in

1:33:37

particular is um

1:33:40

a lot of people view this like

1:33:42

entrepreneurship as like oh if it wins

1:33:45

if if I win and I have a great outcome

1:33:48

and I sell my company. I would have like

1:33:51

generational money. I don't have to work

1:33:55

ever again. And then what you end up

1:33:58

like like just staying at home and like

1:34:00

your your your kids will obviously have

1:34:02

like trust funds and they're not going

1:34:05

to get inspired watching their dad play

1:34:08

paddle.

1:34:09

>> Yeah. You know, you're not going to set

1:34:11

the right example for them. and they're

1:34:13

not going to be able to take your wealth

1:34:14

and multiply it cuz they they didn't

1:34:17

watch somebody who actually did that.

1:34:20

You they you did it before

1:34:23

they they they were like adults. And so

1:34:26

um I think you always need to be doing

1:34:29

something. Um like like Jensen said some

1:34:33

recently that he hopes to die on the job

1:34:34

or something like that. Like that's the

1:34:37

attitude you need to have. Like you got

1:34:39

you you need to work forever. I was so

1:34:41

upset though when Jensen said, "If I'd

1:34:43

known how hard it was going to be, I

1:34:45

wouldn't have done it." When he did, I

1:34:47

don't know if you saw that into I was

1:34:48

like, "Oh,

1:34:51

>> yeah. I think it's pretty hard, but you

1:34:53

you don't do it because you do it

1:34:57

despite that." I think I think that's

1:34:58

how it works.

1:34:59

>> Arvin, listen, this has been so

1:35:00

fantastic to do. I so appreciate you

1:35:02

taking the time while you're in London.

1:35:03

So, thank you so much for joining me.

1:35:05

>> Appreciate it.

More transcripts

Explore other videos transcribed with YouTLDR.

Get the TLDR of any YouTube video

Transcribe, summarize, and repurpose videos in 125+ languages — free, no signup required.

Try YouTLDR Free