EU Cloud Wars, AI Copyright laws and Chaos

Thanks for watching!

Hey,

welcome to another episode of Cloud

Unplugged.

We have a few stories.

I know you're going to cover these,

Salman, some of these.

We've got the EU sovereign cloud contract

for one hundred and eighty million to talk

through.

We've got the big copyright settlement

against Anthropic called the BART versus

Anthropic.

I know we have some random stories,

two random stories.

I think you've got one, I've got one.

We'll save those for the end as a

little bit of a teaser and a bit

random.

So yeah,

shall we crack on or anything you want

to add?

Yeah,

I'm just going to say a bunch of

companies keep getting hacked.

So if we get to it,

if we have time,

this is just business as usual,

just companies getting hacked.

So maybe we can talk about those as

well.

But yeah,

let's just jump into the first one.

Exactly.

Just another day, another hack.

That's it.

That's how it rolls nowadays in twenty

twenty six.

So, yes,

I guess the sovereign cloud award,

the one hundred eighty million not gone to

a US based company.

Do you want to talk that through and

then maybe why you think that's happened?

Yeah,

so the EU has awarded a hundred and

eighty million euros to a bunch of

European companies like Thales,

S three and S,

Scaleway and a few other companies for a

six year contract to build

european sovereign cloud and microsoft aws

and azure google currently holds seventy

percent of the cloud market in in eu

as well so they didn't get any piece

of that and that's perhaps to the point

that you know the us cloud act that

came out that allows the american

authorities to access data uh but that's

held by us companies no matter where it

is

So, you know, this is EU has said,

why don't we start building our own cloud?

So, you know, legal compliance,

data protection,

and all of that can be transparent rather

than relying on an American company.

So that's come out in the last couple

of weeks, which is big news.

I don't know what your take on that

is.

Yeah,

I think because there was talk around them

setting up a company,

like a European-based company,

and then that didn't really make any

difference because the holding company was

still going to be,

could be like Alphabet or Amazon.

So actually,

it still came under the same jurisdiction

and really didn't really make much

difference.

I believe.

So then, yeah,

I guess this was then they've decided not

to go US at all.

I don't know if you've heard of the

FedRAMP stuff where there was like a big

investigation into how Microsoft do their

cloud.

So it's like federal risk and something

management program.

I can't really remember.

But basically the US government basically

gatekeeper for cloud security and they did

loads of investigation and they were like

trying to figure out how Microsoft did

things and I think they spent something

like five years or some insane amount of

time trying to work things out on like

how secure

their cloud was,

but actually in the end they just gave

up because they couldn't really get to the

bottom of it and then just announced that

it's kind of fine anyway and you can

kind of use it.

And then I think they figured out that

they had like,

I think something like the people in China

maybe were having access to things or

something or other.

There was those things that kind of came

out of the woodwork.

following it on um but yeah so maybe

that's a part of it too like essentially

can't really trust even if you did do

a secure cloud i guess there's a question

on how secure secure really is for

government data and whether truly um

they're being transparent and honest

anyway but yeah yeah

think the interesting thing also yeah i

agree with about transparency but the

interesting thing here which i found is

mistral ai i don't know if you come

across them as a french ai company yeah

so they are in in their consortium that

won this hundred and sixty million euros

to do this so i think europe is

signaling that

We're trying to build our own sovereign AI

stack as well because now, yes,

cloud is an important thing,

but having your own GPUs so you can

train your own models and be in charge

of your own destiny is probably also very

important because we spoke about it

previously how AWS and how Google and

Microsoft are trying to break the NVIDIA

monopoly by having their own chips.

So I guess it's good for consumers as

well that these companies are hopefully

will be building the...

cloud that lives in european sovereign

land i know a lot of people have

to say this and and i think going

to these conferences and speaking to

people within europe the um the feeling is

that we should have our own well as

europe should have our own uh cloud in

which we can host our

I'd be transparent about how things are

being hosted.

So I think it's... I mean,

the thing is, right,

the EU is putting up €EU €EU to

build our own sovereign club and reduce

reliance on US providers.

Now, people might say,

is one hundred and eighty million euros

enough to rival the cloud giants.

And well,

let's just say it's like trying to out

pizza the hut.

Right.

But you have to start somewhere,

maybe with a sovereign slice.

So that's basically what's happening.

So we Europe is taking a stance and

starting up.

And of course,

there's a lot of other smaller companies

and large sized companies as well that are

out there.

But I don't know if we ever see

like a hyperscaler type of provider come

out of Europe.

I don't know what you think.

I think it's going to be a bit

tricky to see because they hold a lot

of real estate where all this stuff is

hosted.

I don't know if we'll see a European

come out.

I don't really understand how it's going

to work, though, because to build a cloud,

what does that mean?

Do you know what they mean by build

a cloud?

Are all these independent companies...

going to somehow come together and unify

in like a common way of

like managing and upgrading and deploying

things to the cloud or they've just

basically diversified the pot and you're

going to have five different types of

solutions to maybe work with that are all

different.

What's the end game exactly?

This is the Airbus equivalent of cloud,

John.

You know how Airbus,

we have all the European companies come

together and make a plane that can rival

Boeing.

from europe that's the same so they will

work together of course they will have

some uh some competing technologies but

that's the same as what was done with

aerospace monopoly the that boeing used to

hold so it's a it's an interesting time

And it is an exciting time that there's

more people entering the market.

So I'm looking forward to what comes out

of it.

We'll keep an eye on it.

Of course, John,

we can let people know how it progresses,

perhaps in the next few episodes,

maybe in the fifteenth or so.

We'll let you know where we are with

this.

I mean, yeah,

we should probably move on a little bit,

but I just don't know.

Why don't they just invest in?

It's a bit controversial for me to say,

but why not just invest in one company?

to create an Amazon or a GCP equivalent,

but high investment.

And then poach loads of the people from

Amazon, Google, and Azure,

who obviously have driven data centers and

know it and make the package really

compelling,

sponsor them to come over here.

Because in the end, it's talent-based.

not necessarily company based.

So it's like, you know,

you probably want to get the talent in

that's got the understanding of cloud and

the cloud services and certain things,

not just necessarily the companies that

hold the talent,

if you see what I mean.

Yeah,

but then some of these companies have

expertise in some of these areas already.

Why wouldn't you bring them together and

use them so you can create one solution,

I guess?

But giving it to one provider,

one company,

probably be a bigger gamble than dividing

it between six or seven.

And I understand what you're saying.

I think it's more of a gamble.

Really, I think the reverse.

Only because...

like they're all going to be trying to

have a slice of the pie.

They're probably looking at like,

you know,

a hundred and eighty million isn't that

much.

I mean,

like one department inside of government

spends more than that in one of the

cloud providers a year anyway.

So you're like you've basically put like a

single department in the UK spends more

than that in a year with Amazon or

whatever.

Right.

So you're like, well,

you've basically divided up of quite a

small amount of financial value.

And they're all going to be picking at

that pie and trying to work out how

to proportionate it,

all trying to work out what the

contractual model is around it.

And how do we all work together?

What's the commercial legalities around it

all?

Who owns what?

Who's prime in this contract is like a

prime that's managing it all?

Or we all are equal?

And how do decisions get made?

So in the end, it's like,

how does it legally commercially work for

me is way more complicated than saying

maybe as Europe,

we all take a little bit of the

EU's funding, put it into some big bucket.

So we all contribute.

We set up like an independent company that

we've all got a stake in proportionally.

And then we just steal those of talent.

I mean, literally,

my idea is probably crap as well and

crazy,

but it just feels like this is going

to be harder for people to agree

commercially, I mean,

not necessarily on the technicality of

what needs to be done,

but the commercial element around it feels

harder to agree upon.

Yeah,

but this is the EU way of doing

things, right?

Like we have these companies that are

split out.

Perhaps maybe, John,

you want your company to win this contract

and do the whole thing.

But I think this is just a statement.

We are in this.

So yeah,

that's why I'm saying this on this podcast

today,

because I'm a little bit bitter about it,

actually.

I was joking.

I'm just having a prejudice.

Okay,

so you kept it all in until we

got into the podcast and just started.

Yeah, I'm messing up.

It's just a statement of intent, John.

As you correctly pointed out,

€EU €EU is not a lot of money,

but it's a statement of intent.

So I think it's a good start.

It's a good start.

It is.

So another story, which is quite a big,

I think this is quite a big one.

I think this is going to be happening

a lot, I imagine.

And there's probably going to be other

lawsuits.

But the BART versus Anthropic.

So one point five billion copyright

settlement has been kind of finalized.

It's like one is like three hundred

million has to be paid.

And another four hundred and fifty million

gets paid in September twenty,

twenty seven.

This is to do with like Libgen and

Polybi,

which is like these pirated book

repositories that were online.

Apparently the model was trained on these

basically other authors books through like

a pirated route, I guess,

to basically get the data.

And then it's basically a new class of

action was then put in place against the

Meta as well.

And Mark Zuckerberg personally,

I believed,

I think he was named personally because he

gave authorization for LibGen scraping,

basically, I believe.

And so there's like a personal one,

the one against Meta.

But anyway,

essentially what they're saying is you've

stolen

from us, really, through nefarious means,

and trained your model on it,

and now you're competing with us from our

work, really.

What do you think about that?

So, of course,

we should say before we start,

we don't have any legal expertise, right?

Because this is a legal matter.

So there's two legal terms here.

Well,

but the lawyers have got an opinion on

it because that's what's happening.

So there's two terms which are being used,

fair use and transformative use.

Fair use is if you legally purchase a

book and you use that book for training

purposes, sharing in a course,

writing a review about it, that's allowed.

The other one is a transformative use,

which is you take your book and you

create something else out of it which does

not stop the sales of the original book.

Inspiration.

Take inspiration from it.

Yeah, exactly.

Exactly.

It's like sampling a track.

I took a track.

You can sample a track and you can

create a new one.

So what the judges said is the fair

use, the books that you bought...

The books that you bought and you train

the model on, that's kind of fair use.

Some of the people are not happy with

it.

The publisher is not happy with them

because what you're not doing with that is

stopping the sales of the original book.

You took the information,

you trained yourself.

Basically,

you turned it into the AI learning process

was considered as transformative use.

So it analyzed the pattern,

it created something new.

You can't go to ChatGPT or Claude and

say, can I read the War and Peace?

It's not going to give you the text

to read the War and Peace.

It'll give you a summary,

which is allowed.

So

Which is fine.

I don't know if some of the publishers

are not happy with that.

But the thing about these pirated books,

that's the thing that's caused the issue

for them to say, right,

now you're allowing pirated books to be

downloaded and trained.

That's not allowed.

But it still doesn't answer the question,

is training your model using books,

is that fair use or not?

The court says it is.

People don't really agree that this is

fair news.

This is something new.

It's kind of like two things are being

mixed up.

You might remember Napster, right?

I know it's before your time, John,

but Napster back in the day where you

can turn songs illegally and listen.

And then Spotify came in and then they

said, okay, you know what?

You can stream the music,

but you have to pay like ten pounds

a month.

And the singers will only get like zero

point zero five cents of every stream,

every million things that they stream.

it's yes it's been things have been agreed

upon that payouts need to be given up

but i don't know this opinion i don't

know if this is gonna go away anytime

soon so basically wasn't though the main

issue that they basically used pirated

yeah as in like it's which is technically

like you just said with like napster or

like or any torrent bit torrent you know

I mean,

BitTorrent itself obviously is not the

issue,

but basically it allows people to download

the software,

obviously using other software, sorry,

download the assets using other software,

obviously published by others on servers,

BitTorrent servers,

which then means most of those would be

like movies or books or whatever else that

people decide to share to save people

money.

But they've done it as a company.

Apparently they then trained on pirated

information which they hadn't bought.

And so that is obviously surely is illegal

because that is illegal.

But the case when it started wasn't just

about the pirated companies, right?

It's about the fair use as well.

But that question is still open because

according to that case,

transforming the information to something

new that's allowed.

So the publishers somehow still need to

get their information back.

They still need to get their investment

back.

So yes, the pirated ones,

that's absolutely bad.

But I don't know if we got to

this is the final time,

the last time you're going to hear about

this.

What we've seen is that AI companies may

be allowed to train on copyrighted books

under fair use,

but only when you obtain the book legally.

However, if you buy it...

Didn't New York Times have the same thing?

Didn't New York Times win a court case?

I'm sure they had one as well.

They probably did.

So if you get the book legally,

you are allowed to train your models.

But buying the book and teaching your AI

from it might be okay and downloading it.

But if you download it from

totallynotpiratedbooks.com,

that's not allowed.

So just buy a book, one book,

and train your model.

Yeah,

that does seem a bit unfair because

training a model,

like me training a model on a book,

not so much risk.

Just as a person, I mean,

say I want to do model building.

It's only me that uses that model.

which is the same as me referring to

the book manually myself anyway,

there's not very much risk.

Me though, publishing the book,

say I then took that book,

then published that book on the internet

for others to then read at high scale

um because i've like say this podcast is

like going through the roof and we've got

bazillions of followers right me then

publishing the book or narrating the book

on this podcast with our bazillion viewers

surely can i do that can i can

i read each chapter out loud to bazillions

of people on a podcast and that's not

copyright

That's not fair use, right?

So this is you falling into fair use.

But the model that's trained on the book,

which is a service for lots of people

to then use,

that's very different to a single model

that an individual is using.

Yeah, because now, according to the court,

they decided that's transformative way,

right?

So you've transformed the information.

So that's fine.

The only thing they got caught upon is

downloading the pirated books.

So basically,

buying the book legally and training your

AI on it is fine.

But if you download it from...

piratebay.com that's not fine and train

your model so you can still you can

train the model yeah that's that's what

they came up with but i don't think

this is the last we're going to hear

of this we will still it will still

keep yeah there'll be loads of this and

we also got a little confession from you

for using napster back in the day to

download uh software you see you've seen

very important i've seen uh i've seen i've

seen the the the social what is that

movie but the facebook movie

oh the social dilemma is it or something

the social network the social that's the

one it is actually a dilemma having us

uh having having some of these social

media is a dilemma but anyway yeah that's

that's what that is yeah so uh i

guess the random story getting into the a

little bit of a random story what what

is your uh

Random story of the week.

My random story,

which is not too random anymore,

is that Stanford and Imperial College

carried out an Internet Archive study,

and they found that by the middle of

last year,

about thirty five percent of the newly

published websites were generated or

assisted and almost twenty percent were

fully generated.

So basically,

the research is linked to what they call

a dead Internet theory,

that the idea that bots and AI are

dominating the online content.

So imagine this.

Imagine this.

You go into a library.

And you start looking at the books.

Every third book is written by somebody

who doesn't sleep, doesn't eat,

doesn't pay taxes,

doesn't get writer blocks.

How original it is, I don't know.

But this is what's happening right now.

Now,

do you care if the information that you

get is useful or

transform your life i'm not sure does this

provide value that's fine but a lot of

the information online is now going to be

ai generated i've seen a lot of posts

and comments on the posts that are

generated by ai and people actually

accepting that they

put a AI generated comment,

like their bots reply to LinkedIn posts.

So this is a reality is not so

random.

But this is a reality.

That's what's mental.

It didn't.

There's loads of like,

there's some people that were MPs here

that I think also wrote a book,

or maybe might have been the US I

can't really remember.

they basically wrote a book with um with

ai it was like it was really obvious

everyone was then commenting on it saying

this is like the most ai written book

ever published um because it's super super

obvious that it is written by ai so

yeah i guess but it's just going to

be not very the worth or value of

something that can produce something

really easily it's like

It's not high value, is it really?

I don't know how popular those things are

going to be.

I wouldn't want to read an AI generated

book.

Well,

if you don't know it's an AI generated

book and you read it and you liked

it, what would you do?

You probably never see me again.

You'd be like, where did John go?

I'd just be in the house all the

time,

paranoid that people can tell when they

see me that that guy likes AI books.

Somehow it would translate maybe in my

eyes.

When people look at me,

they'll be able to see.

So I'd probably never be able to leave

the house, I think.

I'd be locked up inside.

Would you accept textbooks like,

for example, a match textbook?

There was a there was a thing going

around where somebody published a match

textbook for a school and there was an

equation that they're helping us solve.

And there's a table underneath.

It says, oh,

if you want me to explain a different

way, let me know.

Right.

So they just copied and pasted whatever

the model said.

So would you accept a textbook perhaps for

like school?

Maybe.

Yeah,

I think I think anything that's I think

creativity.

is something i mean obviously figuring

things out and that's that's not something

necessarily we specifically have created

it's that it's it's the calculation of the

things around us it's the things that

already existed nature or you know the

parameters different but something that

somebody else has created or manifested

you know through creativity is a very

personal thing that is affiliated with

that individual their perception of the

world that

as another human being should be very

relatable.

And so the characters or the person,

wherever that stemmed from,

including music or art or whatever else,

I do feel there is a protection because

it's got a sense of being mocked when

AI does it.

There's a mockery to it that's like it

doesn't really understand any of those

things that are quite humanistic.

So therefore the value of it's like a

little bit insulting when something

pretends to understand those things and

then just can knock it up rather than

factual things.

But yeah, I've got a really quick one.

I'll get it out of the way.

But there's basically a wolf in Korea that

escaped from a zoo and was on the

rampage.

And some guy basically created an AI

generated image of this wolf in his area

and then posted it,

I think on social media.

And then everybody freaked out and all the

schools closed in the area.

And I think hospitals were obviously

panicking because of the wolf and all

these other things.

So I think there was a lot of

police and people reacting to it.

And they basically put him in jail.

He's gone to jail in South Korea for

basically causing a disruption by using AI

to pretend that the wolf was in an

area that it wasn't really in.

The rule of law around AI is coming

for people nowadays.

It's hit or miss.

You never know what's going to happen.

They're probably looking for the wolf on

the wall street.

All they found was a wolf made by

AI on the streets.

Some guy.

I mean,

I suppose if I dressed in a wolf

outfit and was prowling around the

streets...

might not get arrested.

You know, how convincing you are,

like pretending to be a wolf.

John,

no one's ever seen a six foot three

wolf, right?

So if you dressed up as a wolf.

Not that they haven't, but they soon will.

They soon will, my friend.

John in a six foot,

John in a wolf costume.

I'd love to see that.

yeah i'll have to uh but yeah ai

ai uh generated um wolves are a no-go

in south korea so don't even think about

it um fair enough but anyway so i

know it's a bit of a a quicker

one for us uh this week um but

but um back at back in action since

my holiday but

Yeah,

we'll be tuning in next week as well.

And we shall speak to everybody soon.

Thanks for listening.

Speak to you soon.

Bye-bye.

Cheers, bye.

Creators and Guests

Salman Iqbal
Host
Salman Iqbal
Salman is an experienced Cloud, Data and AI leader, lover of all things AI, Cloud, Platform Engineering and Development tooling.
EU Cloud Wars, AI Copyright laws and Chaos
Broadcast by