Artists having been imitating the works of others for thousands of years. Why is...

lelandfe · 2025-01-12T15:10:37 1736694637

How many NYT articles do you think you could recreate from memory? https://nytco-assets.nytimes.com/2023/12/Lawsuit-Document-dk...

Oh, you said zero? How beguiling. Maybe there is a difference.

Workaccount2 · 2025-01-12T16:25:02 1736699102

The problem there is outputting the data, not inputting it.

OAI can put a dumb IP filter on ChatGPT output and resolve the case. Training plays no part here.

lelandfe · 2025-01-12T16:43:27 1736700207

The Time's lawsuit does allege in part that 1. the training data was not licensed and therefore OpenAI has committed copyright violation, and 2. that the resultant model is a copy or derivative work of The Time's body of copyrighted articles.

That regurgitation is merely evidence of those two, and so putting a filter on the output explicitly does not resolve the case.

Workaccount2 · 2025-01-12T17:25:27 1736702727

The question is whether legally you need a license to view copyright. Training doesn't copy anything, I think this is where people are confused. People assume this is how training works, because they have a false intuition about how LLMs must work.

LLM's are not data archives, I don't know how many times this has to be repeated.

lelandfe · 2025-01-12T17:59:23 1736704763

The NYT and their lawyers are confused, then:

> Defendants’ generative artificial intelligence (“GenAI”) tools rely on large-language models (“LLMs”) that were built by copying and using millions of The Times’s copyrighted news articles, in-depth investigations, opinion pieces, reviews, how-to guides, and more.

https://nytco-assets.nytimes.com/2023/12/NYT_Complaint_Dec20...

This case is still ongoing a year later.

Workaccount2 · 2025-01-12T19:16:45 1736709405

They need to be confused because they need the judge to be confused too.

NYT isn't suing because LLMs will print out NYT articles for free. They are suing because LLMs are poised to be better/more favorable news reporters than them. It's a long term survival case, not a copyright one (despite that being the weapon used in the fight)

munchler · 2025-01-12T21:02:47 1736715767

I agree. That said, if AI puts journalism out of business, then AI will quickly run out of content to train on and report. I think this is a situation where technology has gotten way out ahead of the law.

munchler · 2025-01-12T15:53:12 1736697192

So if they fix this problem, you’d then be OK with generative AI?

BTW, plenty of humans have memorized copyrighted material, such as song lyrics. Do you think that should be prohibited? Maybe the difference isn’t as great as you think.

52-6F-62 · 2025-01-12T15:09:41 1736694581

This is it, isn’t it? Thats why all the effort pours in to make these machines produce rembrandt and tolstoy copies. And why I still have to do my taxes by hand rather than the machine handling it with speed and accuracy.

It’s the core of it all— jealousy of a creative spirit.

Artists are not machines, but living souls.

If you were remotely open enough to see for yourself, then you wouldn’t struggle with engaging in the world in a creative manner and you wouldn’t feel that jealousy but encouragement by what you see pouring out of your fellow humans as a reflection of each other.

No machine will grant you that understanding, you just have to engage directly.

It will never succeed to supplant it, no matter the billions of dollars burned to try.

CuriouslyC · 2025-01-12T15:16:53 1736695013

AI doesn't make art, it makes images - it's like a camera this way. Art is in the composition, the message and the aesthetic of the one using the tool to create an image.

52-6F-62 · 2025-01-18T03:07:46 1737169666

Ignoring my point doesn’t make it go away.

Using an AI tool to create an image of a painting betrays the person who seeks to be “artist” by short circuiting the practice that leads the prospect to their path of enlightenment through mastery.

In our constricted 3d world there is no circumstance where an algorithmically generated image of a painting will equally serve the prospective artist in its procedural work on the prospective artist, internally. There is no other pursuit in art, and any pursuant will come to that conclusion in any number of ways but always through submission to the course of mastery (for which there is no shortcut).

Worse, the companies at the helm of this side of the technology are pushing it in order to stand middle man to humankind’s modus operandi-to create.

Keep your mind open to perspectives beyond the software industry.

energy123 · 2025-01-12T14:55:18 1736693718

I do think a new way of thinking about copyright is needed for AI. Allow tech firms to train on all material, but there should be an AI tax that serves as compensation back to the public commons for what was taken from it and privatized.

The status quo favors large players who can navigate the legal system.

CuriouslyC · 2025-01-12T15:14:30 1736694870

Training on copyright content as research is entirely fair use. Just require that fair use defense to hinge on that research being public, i.e. that it's only a defense of open weight models.

A tax on AI is stupid because the big players can dodge taxes well and have the ear of power now, so any regulation would favor them. It would only serve to prevent challengers to their dominance.

xena · 2025-01-12T14:58:17 1736693897

Food costs money. When the price of your labor becomes zero, you can't afford to eat.

gordonhart · 2025-01-12T14:52:15 1736693535

The controversy here is that LibGen doesn't legally distribute its content. Mass-scale training on pirated content is... legally murky, to say the least.

recursivecaveat · 2025-01-13T01:34:03 1736732043

It would also be a crime for an artist to download all of libgen and imitate/learn from it. Zuck is just a billionaire and hence above the law.