There’s a disconnect in the conversation about AI regulation. While gov’t focuses on labeling AI-generated output (like deepfakes), creative professionals are concerned about the copyrighted input used to train these models without permission or compensation.

You’ll learn:

the latest AI content labeling laws in India, China, and the EU.
breaking news from Australia, which is rejecting a “fair use” style exemption for AI training data.
difference between government focus (labeling, misinformation) and creative pro focus (unauthorized data scraping).
about the 82+ copyright lawsuits filed against AI companies.

Sources:

Australia status:

India status:

China status:

EU status:

Canada status:

Lawsuits:

https://chatgptiseatingtheworld.com/2025/10/23/apple-sued-in-3rd-copyright-suit-for-alleged-used-of-pirated-books-perplexity-sued-for-3rd-copyright-suit-total-us-cases-v-ai-56/

Davie504:

Washington Post

Sora guardrails:

https://www.nbcnews.com/tech/tech-news/openai-sora-2-guardrails-sag-aftra-bryan-cranston-rcna238715

Video about Sora and video generation:

https://www.youtube.com/watch?v=eCcVA94N7L8

==========================

About and Support

==========================

Written, edited, and hosted by Jen deHaan.

Find this show on YouTube at https://youtube.com/@humaninternettheory

Subscribe to this show's newsletter for additional resources and a free 3 page workbook when you join https://humaninternettheory.com

Produced by Jen deHaan of StereoForest https://stereoforest.com

Contact Jen at https://jendehaan.com

==========================

Connect on Socials

==========================

Support

Your support will help this show continue. Funds will go towards hosting and music licensing for this show and others on StereoForest. This show is produced by an independent HUMAN artist directly affected by the state of the industry. StereoForest does not have any funding or additional support.

If you find value in our shows, please consider supporting them with a one time donation at https://stereoforest.com/tip

We love our podcast host Capitvate.fm! Contact me anytime to ask me anything. You can support my shows by signing up with Captivate here: https://www.captivate.fm/signup?ref=yzjiytz

==========================

About Jen

Jen's professional background is in web software technology (audio/video/web and graphics), working for many years in Silicon Valley. She has worked in instructional design, writing, marketing, and education in the creative space. She was also a quality engineer for awhile.

Jen became involved in performing, acting, and improv in 2015. She taught dance fitness classes (despite beginning with two left feet), performed in community theatre, and taught and coached improv comedy and acting at several theatres. Jen was also the Online School Director and Director of Marketing at WGIS.

Jen's website: https://jendehaan.com

This podcast is a StereoForest production. Made and produced in British Columbia, Canada.

Transcript

WEBVTT

:: 00:00

(upbeat music)

:: 00:02

- Governments are mostly focused

:: 00:08

on labeling AI-generated content.

:: 00:11

Creative professionals like you and me

:: 00:12

are more focused oftentimes on the copyrighted data

:: 00:17

being used to train the LLMs.

:: 00:19

So today we're going to look at the very differing concerns,

:: 00:24

what countries have made, what legislative measures,

:: 00:28

and what all of this means for creatives like us.

:: 00:33

Now there's a pretty big disconnect

:: 00:35

in the global conversation about artificial intelligence.

:: 00:40

So governments in India, the EU, and the US

:: 00:44

are primarily focused on regulating the output of AI.

:: 00:49

And these regions are acting on potential societal

:: 00:53

or business and political harms of things like deep fakes,

:: 00:58

misinformation, and other forms of deceptive content.

:: 01:03

But creative professionals, we are often focused

:: 01:07

on what's going into the LLMs,

:: 01:09

like what's being scraped basically from the internet.

:: 01:13

We are pretty concerned with the unauthorized

:: 01:17

and very uncompensated scraping of copyrighted work

:: 01:21

to train AI models in the first place

:: 01:24

for these corporations to profit from.

:: 01:27

And in this episode, we're going to look at this input

:: 01:30

versus output gap in the conversation.

:: 01:34

I'll cover some of the latest legislation aimed

:: 01:37

at regulating AI, which is usually not addressing

:: 01:41

the primary economic and ethical concerns of creators,

:: 01:46

but at least doing something to help identify

:: 01:50

what is synthetic versus what was created by a human

:: 01:54

is good.

:: 01:54

And just this week, a couple of days ago,

:: 01:58

we got a major update from Australia that shows

:: 02:01

that some governments are finally starting to listen

:: 02:05

to the people impacted by all of our IP

:: 02:08

being scraped into LLMs.

:: 02:11

Hi, I'm Jen, and this is the Human Internet Theory.

:: 02:14

And in this show and podcast, I talk about changes

:: 02:17

to human creative content on the internet

:: 02:20

and how that is affecting the creative professional industry.

:: 02:24

What we need to know and when possible,

:: 02:27

what we can do in response.

:: 02:29

Before we get to the core issue for creators,

:: 02:32

we will first look at what governments around the world

:: 02:35

are actually doing right now.

:: 02:38

So let's start with India.

:: 02:40

The government there has just established rules

:: 02:42

for what it calls Synthetically Generated Content, or SGI.

:: 02:47

And this is defined as information that is artificially

:: 02:52

or algorithmically created, generated, modified,

:: 02:56

or altered using a computer resource in a way

:: 03:00

that appears reasonably authentic or true.

:: 03:03

This framework puts a burden on both the user

:: 03:07

and the platforms that are displaying this content.

:: 03:10

So users of those platforms have to declare

:: 03:14

if their upload is SGI.

:: 03:17

That's kind of one issue right there.

:: 03:19

Then the platforms must use technical tools

:: 03:22

to verify that declaration.

:: 03:25

So if the content is SGI, it has to be clearly marked.

:: 03:30

And that clear mark is covering 10% of the surface area

:: 03:35

for visual content, or the first 10%

:: 03:38

of the duration for the audio.

:: 03:41

So this is applying to all content

:: 03:43

from creative work to those more malicious deep fakes.

:: 03:49

Now, China has also implemented new requirements.

:: 03:52

AI generated content must be labeled with a watermark

:: 03:56

that covers at least 5% of the shortest side

:: 03:59

of the content, along with metadata

:: 04:02

that tags that content.

:: 04:04

So the tech platforms there are legally required

:: 04:07

to enforce this measure.

:: 04:09

The EU has its AI Act now,

:: 04:11

which applies to any provider in the EU market.

:: 04:15

So most generated content is falling

:: 04:19

into what they call a limited risk category.

:: 04:23

And according to IBM,

:: 04:24

this category imposes transparency requirements,

:: 04:29

which means disclosing that content that's AI generated.

:: 04:33

And I'll put that article in the description.

:: 04:35

So this obligation is on the provider of the AI system,

:: 04:40

while the disclosure of the,

:: 04:43

say deep fakes of the generated content

:: 04:45

is on the person who is deploying that content.

:: 04:50

So the USA and UK have very little in place right now.

:: 04:55

Now in Canada, a new minister of AI has been appointed,

:: 05:00

but new legislation is still just in the planning stage.

:: 05:04

It seems like old legislation was kind of scrapped

:: 05:07

when this change occurred.

:: 05:09

So this is now restarting with the new Canadian government.

:: 05:13

This kind of inaction or slow action

:: 05:16

from many of the Western countries leads to many court cases.

:: 05:21

That's what happens when there isn't really

:: 05:23

legislation in place.

:: 05:25

So that's what makes the recent news from Australia

:: 05:30

pretty important and interesting.

:: 05:32

On October 27th, just a couple of days ago

:: 05:35

from the time of recording,

:: 05:36

the Australian government confirmed

:: 05:39

that it is ruling out a fair use style of exemption

:: 05:44

for AI training, for that ingesting of all of IP

:: 05:49

into LLMs.

:: 05:51

And that exemption of fair use

:: 05:54

has been what corporations have been pushing for.

:: 05:58

They want that exemption so they can use all the stuff.

:: 06:01

So this is a really positive sign

:: 06:04

for creative professionals and all creatives anywhere.

:: 06:08

So the government's position is that AI models

:: 06:11

using copyrighted works for training

:: 06:13

already require a license

:: 06:15

under current existing Australian law.

:: 06:19

So it doesn't sound like Australia is really interested

:: 06:23

in creating a new loophole for the tech companies to use.

:: 06:26

And Australia is moving in a direction

:: 06:29

that supports and protects artists and creative professionals.

:: 06:33

So this is a good thing,

:: 06:35

which is why it's also so important to voice concerns

:: 06:39

and advocate for creative protections of IP

:: 06:42

remaining in place for the future.

:: 06:45

So the global regulatory focus is mostly,

:: 06:49

not entirely, but mostly on labeling AI content,

:: 06:53

on transparency and declaration,

:: 06:56

if anything is in place at all.

:: 06:59

But that focus really kind of misses

:: 07:02

the main issue for creators.

:: 07:05

So a good example of what's happening right now

:: 07:08

is the musician, the YouTuber, Davey50.

:: 07:12

He discovered that an AI service,

:: 07:15

which was kind of implied in those videos,

:: 07:19

I'll put the links in the description,

:: 07:20

you can check them out,

:: 07:21

allowed users to upload his copyrighted recordings

:: 07:26

into the service.

:: 07:27

And then they used those stolen songs

:: 07:30

to generate new derivative songs.

:: 07:32

And this example is just one of many

:: 07:35

of why creative groups are lobbying governments.

:: 07:38

So in Canada, the Coalition for Diversity

:: 07:42

of Cultural Expressions, or the CDCE,

:: 07:45

is pushing for three measures.

:: 07:47

And one of these measures is a requirement

:: 07:50

for AI developers to disclose the data used

:: 07:54

to train their systems.

:: 07:56

And I'll put that source in the description.

:: 07:59

So we see this sort of thing happening in the US as well.

:: 08:03

So the WGA strike that was fairly recent

:: 08:06

led to contract protections for creatives.

:: 08:10

SAG-AFTRA and the RIAA have been lobbying hard as well.

:: 08:14

SAG-AFTRA for protections against deep fakes

:: 08:17

and the RIAA for copyright protections.

:: 08:20

And the WGA strike, for example,

:: 08:23

won those protections that AI cannot be used

:: 08:26

as a source material or to rewrite scripts.

:: 08:29

And this protects writers' credits

:: 08:32

and compensation as well.

:: 08:34

So speaking of writing, join my free newsletter

:: 08:38

at humaninternettheory.com

:: 08:39

and I'll send you some real human writing.

:: 08:42

Mine, and I'm working on improving my segue.

:: 08:45

So you're gonna see different ones all the time.

:: 08:47

So because we don't have many or any regulations

:: 08:51

in place for ingesting all of the data

:: 08:54

and creative assets that lead to a whole bunch

:: 08:57

of unauthorized derivative work,

:: 08:59

creators are often taking matters into their own hands.

:: 09:04

And as such, this has led to a whole bunch

:: 09:08

of class action lawsuits.

:: 09:11

According to the website chat,

:: 09:13

GBT is eating the world.

:: 09:15As of late: 2025:: 09:21

that are filed against AI companies worldwide.

:: 09:25

And 56 of those are apparently cases just in the US.

:: 09:30

I'll put a link to that in the description, of course.

:: 09:33

So clear labeling of synthetic content is important.

:: 09:37

That helps the humans consuming content.

:: 09:40

That's all of us.

:: 09:42

It helps us identify human-made creations

:: 09:45

when it's now getting really hard to tell what is what

:: 09:50

and what's real, what's not, what's a deep fake,

:: 09:52

what was created by hand versus computer.

:: 09:57

Now that copying is so accessible and easy

:: 10:00

to everyone to do.

:: 10:02

The labeling also demonstrates to the internet people

:: 10:07

consuming all of this stuff,

:: 10:08

just how much content is being generated

:: 10:11

without much of any human oversight.

:: 10:15

And some estimates are that it's already over half

:: 10:20

of the entire internet.

:: 10:22

And this is going to allow people, the humans,

:: 10:24

to make choices about what content they even engage with.

:: 10:29

Like if you can't be bothered to even read

:: 10:32

what you've pasted onto the internet,

:: 10:34

why do I want to consume it, right?

:: 10:36

But a regulatory focus on labeling,

:: 10:39

that misses a couple of things.

:: 10:42

Like will people even use or declare what they're creating?

:: 10:46

For example, watermark removers are very common,

:: 10:51

very popular in the app stores right now.

:: 10:54

And will the platforms even disclose what is declared?

:: 10:58

An investigation by The Washington Post

:: 11:01

tested eight major social platforms

:: 11:03

by uploading a video containing the standard metadata

:: 11:08

to flag synthetic content.

:: 11:10

And only one platform correctly added a warning label

:: 11:14

to let people know that the content was generated by AI.

:: 11:18

So there's still a lot of issues

:: 11:21

in the labeling conversation.

:: 11:23

So that makes the more pressing and pertinent issue

:: 11:27

in all of this the copyright part of it.

:: 11:31

Derivative works are being created for material

:: 11:33

that was never intended to have derivatives.

:: 11:37

And access to something on the internet, as we know,

:: 11:41

does not give anyone the right to create derivatives from it.

:: 11:46

And even if derivatives were allowed in the licensing,

:: 11:50

the current usage is often incorrect

:: 11:52

because credit is often required as part of that license.

:: 11:57

And then it has to be released under the same licenses again.

:: 12:00

So the 82 lawsuits show what happens

:: 12:04

when regulation isn't in place or the regulation fails.

:: 12:08

And this is exactly why the Australian government's

:: 12:12

decision to enforce existing law

:: 12:15

is pretty darn refreshing to hear this week.

:: 12:18

So the copyright owners are being forced

:: 12:21

to take on this fight themselves in the courts

:: 12:25

because lawmakers are kind of focusing

:: 12:28

on the output side of the thing

:: 12:31

and are not working on the input, the copyright issue.

:: 12:36

And as a creator, this input issue is what I'm concerned about.

:: 12:41

Obviously, by this point, the part where our assets

:: 12:44

are being used by these LLM corporations without asking,

:: 12:49

and those LLMs are allowing all of these derivatives to happen.

:: 12:53

And that is the real fight here.

:: 12:57

AI companies are building their models on an opt-out basis

:: 13:02

quite often, even if they're allowing the opt-out,

:: 13:04

even if that opt-out works, which is why we're seeing

:: 13:08

how the Sora2 app came out with the opt-out

:: 13:10

and there's a whole bunch of derivatives

:: 13:12

still being created from that material.

:: 13:15

And I'll put a link in the description

:: 13:16

to the video I did regarding that.

:: 13:18

So this is where the LLMs are kind of taking all the work

:: 13:22

and allowing these derivatives

:: 13:23

and the creatives need to fight instead

:: 13:26

to have it removed from the dataset after the fact.

:: 13:29

That's broken.

:: 13:30

And that's not how things generally work.

:: 13:33

So I believe we need to be advocating for this opt-in model,

:: 13:37

like we're seeing Australia support,

:: 13:39

which is the standard for all of our licensing in the past.

:: 13:44

One more thing to think about is how addressing the input,

:: 13:48

the ingestion, the scraping, the copyright stuff,

:: 13:53

that also could help what the lawmakers are trying

:: 13:57

to address with this labeling.

:: 13:59

Lawmakers want to stop the harms on the output,

:: 14:03

like the misinformation and the deep fakes.

:: 14:06

That's what they want.

:: 14:07

And that could be accomplished to some degree at least

:: 14:11

by controlling or checking the training data

:: 14:14

that's being used on the way in.

:: 14:16

And that work can involve checking for the bias

:: 14:20

or any dangerous material that causes the output harms.

:: 14:24

Anyways, all of this work can be opted into ethically

:: 14:29

and controlled more.

:: 14:30

And these kinds of models can then be used

:: 14:33

to create highly targeted, useful applications.

:: 14:38

And then those tools can be made

:: 14:41

that are very specific and useful and benefit humans

:: 14:45

and are on a smaller scale and use less energy

:: 14:48

and all that kind of thing,

:: 14:49

as opposed to the services now

:: 14:52

that reduce the number of apps and tools

:: 14:55

that are available to us for creativity.

:: 14:58

'Cause there's services that just like replace

:: 15:00

the tools entirely.

:: 15:03

But let's get back to us creative professionals now.

:: 15:06

So unions and organizations such as WGA and RIAA

:: 15:11

have the money and the resources to fight

:: 15:15

for the artists that they represent.

:: 15:17

And those are big organizations

:: 15:20

with collective bargaining power and money.

:: 15:24

So what about independent YouTubers?

:: 15:28

And individual writers and visual artists or podcasters?

:: 15:33

What about all of us that don't have those resources

:: 15:36

or organizations behind us?

:: 15:37

We can't afford to launch one of those 82 lawsuits.

:: 15:42

So therefore the input, what the LLMs are taking

:: 15:46

from us as well is kind of like the small guy

:: 15:49

versus big guy situation.

:: 15:51

And that's why we need regulation from the top down

:: 15:56

like in Australia.

:: 15:57

The current system of forcing the fight

:: 16:00

onto the court systems through these lawsuits

:: 16:02

only helps larger artists and people who can afford it.

:: 16:06

And many of us independent creators

:: 16:08

are kind of left out of the conversation.

:: 16:11

And I mean, assuming that the court cases

:: 16:13

actually succeed in the end, right?

:: 16:15

So right now the government, some governments,

:: 16:18

if anything are focused on the outcome, the symptom,

:: 16:22

the deceptive content like the deep fakes,

:: 16:25

but us creators are focused on what could be

:: 16:27

an actual solution to that,

:: 16:28

which is the unauthorized training data

:: 16:31

is being controlled more.

:: 16:32

And this is a solution that at least one government

:: 16:36

is starting to recognize and that's great.

:: 16:39

And I'll be back soon with another episode.

:: 16:42

Bye for now.

:: 16:43

Episodes are written, directed, edited and produced

:: 16:49

by Jen deHaan of STereoForest.com

:: 16:52

Find out more about this podcast

:: 16:54

and join our free newsletter for additional resources

:: 16:57

at humaninternettheory.com.

:: 16:59

Find additional videos at the YouTube channel

:: 17:01

called Human Internet Theory.

:: 17:04

Links are also in the show notes.

:: 17:06

(upbeat music)

:: 17:09

(upbeat music)

Australia Closer to Protecting Copyright from AI (Win for Creatives?)

About and Support

Connect on Socials

Support

About Jen

Transcript

About Jen deHaan

Reader Interactions

Leave a Reply Cancel reply