AI Is Now Making Your MacBook More Expensive

Thanks for watching!

welcome again to another episode of cloud

unplugged we have two big topics uh well

one big topic one smaller one um we

have anthropic uh accusing alibaba of

basically a distillation attack against

their model so reverse engineering um what

they can work out how the model was

built um what type of features and

capabilities it's providing

to accelerate their own AI growth.

And then we also have the fact that

Mapbox just got twenty percent more

expensive because of AI.

So if you're out to buy a MacBook,

you better go and get a bank loan.

So how are things going with the Anthropic

story, Salman?

What was going on there?

Well, John, here's the question.

Why would you want to spend hundreds of

millions of dollars to train a model when

you could just learn from one that already

exists?

And that is the question Alibaba asked.

And what they did,

what Anthropic realized that between April

and June,

so I think the timelines are between the

twenty second of April and the fifth of

June.

Anthropic identified that there's a

coordinated campaign by Alibaba's Qwenn AI

models.

You know,

the Qwenn AI models that perhaps you've

used or perhaps you like them or not.

They used almost twenty five thousand

accounts and they conducted around twenty

million interactions with Claude and we'll

come to what an interaction is in a

second.

The goal was to systematically extract

Claude's capabilities.

how it clothes reasons,

how it uses the tools, how, you know,

the coding abilities to train their own

smaller coin model.

So this is what Anthropic found out.

And what they did was as soon as

they identified,

and they called it corporate espionage

because it is corporate espionage,

what they found out is then they went

to the U.S.

senators, a couple of U.S.

senators, and told them about it, that,

you know, this is what's happened.

And they sent a letter to the Senate

Banking Committee.

And then two days later,

once they identified it and sent the

letter to the Banking Committee,

you remember when Mythos and Fable got

banned from non US use

That ban got put in place.

But the initial thing is that Alibaba was

using these models, the Claude models,

to train their own coin models.

That's the headline story, John.

And of course,

there's a lot to it than just what's

happened.

But this is what they found out.

Do you think,

because I don't know if you saw,

but Anthropic...

have obviously beaten OpenAI on the

market.

I think OpenAI dropped to like twenty

seven percent of market share for

enterprises.

But it sounds like maybe because how many

subscriptions were they getting from

China?

So we're saying that basically that twenty

percent,

that additional twenty percent that made

them forty seven percent market share was

really coming from China.

And maybe there is no,

maybe they're on a par.

I mean, were they making money from China?

We don't know.

I mean, that's right, right, John?

Look, you know, we said that Airbnb,

not Airbnb, Uber came around and said,

well,

we're not really seeing a return on

investment.

Well, Alibaba did.

They're training their models.

They've got really good ROI.

They've got the best ROI.

Yeah, the best ROI out of that.

So, John, it's like,

if you look at an interaction,

it's anywhere between five hundred to a

thousand tokens, right?

Roughly.

Just talking about roughly five hundred to

a thousand tokens.

Let's just say a thousand tokens.

And there's about twenty nine million

interactions.

So it's about, you can say,

between ten to twenty billion tokens that

were used, right?

Roughly, roughly.

It's just rough, rough estimates.

is a reasonable amount of money,

like maybe in hundreds of thousands of

dollars.

So perhaps not a lot in the grand

scheme of things.

And we're like, turn it back on.

We've got an IPO coming up.

Come on, guys.

Come on, China.

Get some more subscriptions going.

We're about to IPO.

So the thing here is that, of course,

this is not the first time that's

happened.

They also found out in the past that

Deep Seek carried out about one hundred

and fifty thousand of these interactions.

Moonshot AI, again,

another model from China,

about three point four million

extractions, right exchanges.

And there was another one called Minimax,

about thirteen million.

I mean, there's still still a lot,

but not at this level of almost thirty

million interactions.

Right.

So this is.

this has happened in the past of course

anthropic every time they find it they

improve their anomaly detection they've

been trying to figure out how they can

make it more secure because this is one

of the things that people have been

talking about this model distillation

where people can use your model to learn

the secrets that you're using but then you

know do you know how they do it

though as in like i mean i i

let's put you on the spot because you

may or may not know but i'm not

sure how

how do they use that information to then

improve their own model and accelerate the

learnings through another model?

Yeah, I think,

I don't know the exact details, John,

but it's feeding whatever output of those

specific questions that they have and bake

it into the model patterns, right?

So they ask specific questions,

specific patterns,

and then just feed that into the model

itself, perhaps.

Right, they're basically prompting.

So they've got a load of,

specific tailored specific that are then

used as a mechanism to then obviously

extract data in a format that then helps

them with the model that then they can

then accelerate their own training using

the app of another model correct and yeah

that's that that's correct this feels like

a sense of um

karma isn't it isn't that called karma

where you train a model based on every

other people's work but you're not happy

that somebody else trains their model and

your model that was trained on other

people's work just so i understand the

slight hypocrisy here so they're not happy

that somebody else is getting ahead based

on the data that they train their model

on which obviously wasn't theirs um but

they're not happy that somebody else does

it to them is that is that what

i'm hearing here

That's exactly what you're hearing.

That's exactly what some people are

saying.

They are saying that, well,

isn't what anthropic getting the taste of

the bit of the taste of their own

medicine because they trained it on data

that already exists, right?

Of course, it's open.

But this comes under John, of course,

with all of these commercial things,

fair usage policy, right?

So this is protecting their IP to some

extent.

Because if you use fake accounts,

bots or whatever,

about twenty five hundred twenty five

thousand of those i think legally it does

it you know it does count as corporate

espionage but this is why this is why

anthropic went to the senate banking

committee and the us the the commerce

department to restrict maybe to put in

some of these uh some

some of these guard rails around to

protect themselves.

I think it's a bit more geopolitical than

technical, but that's what's happened.

What do you think about all of this?

I just find it ironic.

I mean, I don't know if it's ironic,

but just hypocritical.

Just the hypocrisy levels of when it

happens to others, we're okay with that.

When it happens to me,

I'm really not okay with that.

Like the double standards as a business.

You know,

it does feel a little bit ridiculous that

they have an issue with their own business

potentially getting risked because they

don't work for the state.

The state hasn't got investment in

Anthropic.

It's not a government model, right?

So it's a pure commercial risk.

Let's be honest.

Because there is no,

you don't need to go to the US

Senate for

something that is a real commercial threat

potentially it doesn't become a national

threat because somebody can produce a

model like yours because say the argument

was that their quen model was on a

on a par uh with fable or mythos

then what would they have done like you

wouldn't need to block because they've put

the investment in they've trained the

model to the same standard so it's just

equal measure so there's nothing to do

right to anthropic anthropic couldn't do

anything

in that situation where China's built a

comparable model to them, right?

Yeah.

But I think, John,

based on the back of this,

the ban on this export control of Mythos

and Fable shows that this is being treated

the same way as other technology.

That's an export ban that doesn't get sold

to every country.

So this is where this is going.

Yes, the government is not involved,

but the government sees the potential on

what these models can be used.

Don't forget that Anthropic are the ones

who denied US government usage of their

models with the removal of guardrails so

they can use it for whatever they wanted

to use it for.

So it's the same company.

And then they went back on it.

But here's the thing, though, John,

if you compare these models,

I was just looking at the cost.

these models right i don't know i've never

used any of the alibaba's quen models but

the entropic if you just take opus opus

is five dollars per million input tokens

and twenty five dollars per million output

tokens right now the mythos and fable of

course they were more expensive yeah do

you want to guess how much quen is

oh i mean is it done in dollars

you've done it in dollars you haven't like

i'm just doing a dollar now i'm not

changing it to yen or anything like that

yeah okay so that's good um

I don't know.

I guess what are we talking?

Probably I'd be annoying if I'm right as

well, which is kind of the risk,

isn't it?

That I fluke myself.

I fluke myself into the number.

I'm going to say ten.

Ten dollars.

Ten dollars.

Okay.

It's five cents per million input tokens

and twenty cents per million output

tokens.

Why are we not using Quen then?

Why are we doing nonthropic then Salman?

So what I'm telling you is that there

are some models out there which are way

more because you could train based on

other models.

I don't know because

The market,

the way they present themselves is that

this is more affordable model to use.

Of course,

I've never checked the capabilities.

There's a bunch of benchmarks out there

that you can see always put Claude Opus

to the top.

But okay, fair enough.

Claude is the best Opus.

The Opus model are the best at the

moment.

But do you need that?

Maybe you can get away with Quinn.

It's five cents.

John,

this is like hundred times cheaper than

the models that we use every day.

Outside of the cost,

which is obviously ridiculous,

that there's such a disparity between the

costs.

But you're in a global economic situation,

right?

We've got globalization.

Companies now have a global footprint.

You've got people in Europe using

Anthropic, et cetera, et cetera.

Obviously,

there's a whole sovereignty thing going

on, which we've mentioned several times.

So I think everybody's looking at everyone

on the risk of not owning something,

especially a frontier model,

where if for some reason the adoption –

kind of goes through the roof and the

ROI does start to get recognized.

And then there is a disruption to

employment.

Let's just assume that path gets taken.

You don't own anything.

All of that value has gone to a

specific company.

And that company has nothing to do with

your country, right?

It's not European tax.

It's not going back into Europe or any

European country.

It's going into the States and it's

obviously grown the US economy.

And so knowing that you're in a globalised

position and that you've got a competitor,

Quen,

that's then sneakily trying to accelerate

its AI adoption.

Is it a little bit more underhand?

Is it like someone maybe taking steroids

in the Olympics?

Maybe.

Maybe it is a little bit like that.

But they're all competing.

They're all competing at the end of the

day,

and you can't suddenly start be okay on

one hand.

When we make the rules, it's okay.

And we're controlling the globe.

That's okay.

And so long as you follow our pattern,

that's fine.

But if you try and do it as

well, that's kind of not okay, guys.

It's not okay that you improve your mind.

It's not okay that you can monetize yours.

Do you know what I mean?

It just feels a little bit like that.

And you're charging five cents.

What are you doing?

Yeah,

charging five cents is a bit outrageous.

They can't make any money out of that,

surely.

But, John, it's the scale, right?

So it depends how they train,

how they run these models.

And, of course, here's the thing, though.

Sorry, just a quick one, though, on that.

Yeah.

Does Alibaba Cloud have an open source

model as well?

Have they bothered with that or they

don't?

They're just all commercial?

I have no idea.

Yeah,

I know you've got obviously some open

source, but I don't think they do,

do they?

At the moment,

the only country that's doing

doing a lot of open source model is

China, right?

So China's like deep seeks coming out as

open source.

There's so many models that come out.

That was it.

That's where the thread was.

Yeah,

deep seek is coming out as open source.

You've obviously got Gemma as well,

which is Google, isn't it?

And I have just fact checked, John,

because we can do that with the use

of internet.

Quinn is most widely adopted open source

AI families.

That was it.

I thought it was open source.

You're right.

So I guess the risk is then beyond

commercial because it basically means,

well, if this local model,

open source model gets more widely

adopted, you know,

then how do you make money?

If it's on a par and you can

run it yourself,

then we're going to go out of business.

So we need to stop it.

So, yeah.

you know another thing is right so they

caught this attack but they caught it

after the fact right yeah so you mean

has it already got better so what you're

saying is john we need to get quen

it's now we didn't get quen john i'm

telling you it's basically you know went

up right now let's go and get it

Yeah.

You know, in Scooby-Doo,

at the end of it,

when they catch the perpetrator and they

take the mask off, they take, like,

the Quinn's mask off.

Actually, underneath it's just Opus.

So I think that's what it is.

Oh, yeah, underneath it.

You know, it's one of them.

So it's just Opus.

Oh, no, it'll be Mythos.

That's the illusion,

because we don't know Mythos,

because we've never heard of it.

The mask illusion, that is Mythos.

And you actually see, like, Kaiser Soze.

Yeah, exactly.

But you know, the thing here is, John,

yes, over time,

they've improved their anomaly detection,

trying to figure out because it's kind of

hard to figure out these attacks and the

patterns that they exist.

So they can see how they how they

catch these people, right?

People were doing this.

But imagine all the other people who are

getting away with distillation.

All the other sports.

I'm not saying it's happening.

Perhaps some of the other competitors.

That sounds so guilty right now.

That was such a,

the way you phrased that was like a

real,

it was like you really wanted to confess,

but had to hold yourself back on the

fact that you've got something going down.

Is that what you've been up to?

Listen,

I'm not going to say any of the

SAR models that are going to come out.

I'm not talking about any of that.

You'll find out later on.

I'm just telling you, John.

SAR AI.

I'm just telling you that this kind of

attack is... But again,

the model is there for the usage.

If you stick within fair usage policy,

you're learning from the teacher, John.

Is that legal?

Yeah, they learned from the best.

They copied their model and was like, oh,

you can steal and train.

Yeah, let's follow suit.

I do think, though,

somebody was using Quen.

That's why in the back of my mind,

because I remember having a conversation

around somebody was using the Quen open

source model for quite a lot of their

coding and then there was some internal

discussions not to go off on too much

of a tangent but there's internal

conversations around actually you could

conserve your money by running something

like quen to kind of get going to

a certain extent say we're going to write

some code or whatever you're going to do

you could use your local quen model get

it so far you know to iterate to

a certain point and then use the polish

and the rest of it maybe like the

getting it reviewed,

making those changes with something then

like Anthropic so that actually you can

kind of save your costs by trying to

get to a certain point.

So you don't obviously hemorrhage tokens.

And actually some people have already

started to do that as a bit of

a way of working,

which is kind of interesting.

you know what john there's probably a very

good topic to discuss at a future uh

cloud and plug right okay i get it

i can i take the hint i take

the hits on them and that's all right

how do we how do we i'll drop

that no sorry sorry um i'll drop that

just for another time

How do we efficiently use our tokens,

right?

Because you're absolutely right.

It's all about the tokens, John.

And if you can use a cheaper model,

it's all about them tokens.

I need to mark the token max.

If we can use some of these cheaper

models to do most of the groundwork, then,

of course, why we can't polish it.

I kind of do the same with...

With Claude, I use Haiku,

which is much cheaper than Opus to get

going with a bunch of stuff.

And then I switch over to Opus when

Haiku starts to lag a little bit and

it's not really doing what it's supposed

to do.

Because the Claude models,

they're not five cents per million input

tokens, right, John?

So I have to do something about that.

Yeah.

But what about, I guess,

moving on to MacBooks?

And the fact that they're going to cost

twenty percent more.

I mean, they weren't cheap.

MacBooks are more on the expensive end of

the price spectrum,

but they're going to go up by roughly

fifteen to twenty percent.

Fifteen to twenty five percent this week,

they said.

Um, all because of the AI demand,

obviously the, the,

the memory and the chips that are in

your laptop also go into the AI servers.

Now you've got the competition obviously

happening on a shortage, um,

of like memory, et cetera.

And that's obviously put the demand up and

obviously supply and demand it's then made

the cost higher.

And therefore this is a trickle down into,

um, into obviously the, the,

the laptop market or in this case,

MacBook, but.

What do you think about that as in

now consumers are starting to feel the

pinch on this AI hype?

Yeah, John, supply and demand, right?

Because the hyperscalers are buying up all

this memory that goes in not just

MacBooks, by the way, it's also iPads,

iPad Pros and all of them.

Everything has gone up by like,

twenty percent of the cost.

So the RAM

hyperscalers are buying everything right

so they're buying the gpu chips they're

buying the rams so supply and demand

there's not enough memory available in the

market for to be put into these these

machines it's costing a bit more money for

apple to build it and the problem here

is it's not just

just Apple,

pretty soon we'll see this trickle down

into other devices, too,

and probably phones as well at some point.

So unless the bubble bursts and people

start realizing that bubble is expanding

way more than it should,

I think the price is just going to

keep going up.

So this is basically what we're doing,

John, you and I.

If we buy an iPad or a Mac,

we're subsidising the arms race.

That's what we're doing.

You and I, John.

We are responsible for this.

The booze.

Do you remember the last time you were

saying that they spoke to students and

obviously with the job market and job

market threat and I think it was Eric

Schmidt, wasn't it?

Or something was doing his talk and

everyone was booing him every time he

mentioned AI.

And now...

just the general public is going to start

booing because they'll be like, right,

I'm going to get the new iPhone or

I'm going to go and get the new

MacBook or I'm going to go and whatever

device.

And then they're going to realize it's

like three or four hundred pounds more

expensive than it was last year.

And then they're going to be booing and

be like, yeah,

that's

That's because of AI, mate.

They were like, well,

you can use AI to create all the

funny images that you wanted, but sorry,

unfortunately,

you're going to have to pay a bit

more.

Exactly.

Now you're feeling the pinch.

Yeah,

it's a bit crazy when it starts to

hit consumers like that,

as in the trickle down effect that's kind

of happening now because of the demand.

And that's a little bit.

Crazy, really.

Correct.

I mean, we also see the models themselves.

Slightly over time,

they'll get more expensive.

We've seen co-pilot go into usage-based

consumption.

Well,

that's okay because you are in the market

for using the model, so you understand,

okay,

I have to pay a little bit more.

But people who are buying iPads and

laptops,

they're not in the market for models.

Not everybody, John,

as much as you and I like to

think, not everybody's using AI.

Exactly.

Do you know, though,

how you can make your money back on

all of this if you want to make

some money?

You're a content guy,

so you can produce your content.

You can now monetize it.

You can now monetize your content.

And you can make your money back.

So all the AI bots that are scraping

your content and stealing it,

you can gate it now and get some

payment.

Get that money's coming in for all that

content to pay for your new MacBook that

the AI was stealing to begin with.

Or maybe you're using AI to make the

content that's then making your MacBook

expensive to obviously create it.

So then you've obviously got to get the

money back to pay for it.

You caught me.

You caught me, John.

You caught me.

It's an interesting time.

Yeah.

Interesting times and the money game.

But yeah.

Sorry about that.

Amazon, yeah,

have also released something because

they've been in New York talking about

that.

So I just thought it was a bit

of a slight interlink between costs and

money and things like that.

I thought I'd weave that one in.

um so i guess really we need to

get on quen um asap i think is

the the summary save your anthropic money

go and give it to alibaba because they've

pretty much stolen everything from

anthropic anyway so don't even need to you

don't need to be spending all that cash

now on uh anthropic you can now invest

that into a five cents price point with

uh with using quen and um

save that cash so you can buy a

MacBook.

Exactly.

That money has to come from somewhere,

John.

The extra two,

three hundred pounds has to come from

somewhere.

So might as well stop using the Opus

models and use the Quen models.

I have used the deep seek models before.

Absolutely fine.

But I haven't used them in anger as

much.

Perhaps it's time for me to have a

look into the Quen models.

Yeah, to be fair, I haven't used.

I do know people that have been,

but I haven't used it either.

cool so the next episode i think we

said though we were going to break down

a story and we've cheated and we've

basically done news again i'm pretty sure

in the last episode we said next episode

we're gonna you know go deep on a

topic and then we've like next episode

arrives and we haven't so apologies for

that that's just us uh being a bit

short for time but i think next time

we are going to get into a very

specific topic i think we're talking about

um

the trust of an AI agent and what

goes into it and how can you,

from a security perspective,

people are using it today,

What does it really mean when you're

giving it access to things that you have

and what's the imperative risk there to

you?

And then what does it mean to a

business as well when you have not just

you as an individual,

but your entire workforce all in mass

using AI and connecting things up?

And is it quite a risky thing to

just roll out or are there ways of

protecting it so you can get the value

out of it?

So that will be probably what we're going

to discuss next time, all being well.

No promises, though.

But that's where we want to be for

the next episode.

Cool.

Cool.

All right.

Well, a bit short and sweet today,

but we should be back next week with

a bit of a deep dive around Agentic.

Cool.

Thank you.

Bye-bye.

Creators and Guests

Salman Iqbal
Host
Salman Iqbal
Salman is an experienced Cloud, Data and AI leader, lover of all things AI, Cloud, Platform Engineering and Development tooling.
AI Is Now Making Your MacBook More Expensive
Broadcast by