If the image or text was fed into the program for “training” and the user input the right prompt, then yes.Here's a question: can you prompt midjourney (or whatever) to recreate -- with any degree of accuracy -- a copyrighted work? Could you get ChatGPT to output a verbatim copy of a NY Times article?
I'm not asking a theoretical question.If the image or text was fed into the program for “training” and the user input the right prompt, then yes.
I didn't give you a theoretical answer.I'm not asking a theoretical question.
Not if they are blocked from doing so by built in copyright controls. that is what I am asking: can you use (for example) ChatGPT to "recover" copyrighted text? If so, that seems to be a slam dunk for a suit. But if not, it might be more difficult to prove the training violates copyright.I didn't give you a theoretical answer.
ChatGPT and other LLMs can regurgitate the text they were trained on. That is one reason you can spot things generated by them. They will spit out chunks of text that is verbatim from their training sources.
If copyright were respected, ChatGPT would be prohibitively expensive as they’d have to pay tens of thousands (possibly millions) of people for the right to make derivative works based on the copyrighted works used to train the LLM.Not if they are blocked from doing so by built in copyright controls. that is what I am asking: can you use (for example) ChatGPT to "recover" copyrighted text? If so, that seems to be a slam dunk for a suit. But if not, it might be more difficult to prove the training violates copyright.
Verbatim of a whole article? not by random chance, (UNLESS you have extremely limited training data - which isn't true of these LLMs). Anyways, you would have to purposefully engineer prompts to get to that result - like add this word, change this sentence, etc. and you would have to do so with the goal of ending up with a verbatim copy of that article by already knowing that articles contents and iterating on the output till it's there.Here's a question: can you prompt midjourney (or whatever) to recreate -- with any degree of accuracy -- a copyrighted work? Could you get ChatGPT to output a verbatim copy of a NY Times article?
Dont misunderstand me: I am trying to understand the degree to which output from these generative AI can violate copyright, and my question was about actual reproduction. There are other kinds of copyright violations, of course -- derivative works, as an example. It is important to me because I don't think generative AI is inherently bad in and of itself, but I do think violating copyright holders' rights is wrong.If copyright were respected, ChatGPT would be prohibitively expensive as they’d have to pay tens of thousands (possibly millions) of people for the right to make derivative works based on the copyrighted works used to train the LLM.
Since the copyright controls you’re suggesting do not presently exist, it’s a theoretical question. But no, that really wouldn’t work. You can’t both violate copyright to train the thing then somehow prevent it from violating copyright by spitting out the same text it was trained on.
the prompt: Can you give the first paragraph of pride and prejudice Verbatim?Dont misunderstand me: I am trying to understand the degree to which output from these generative AI can violate copyright, and my question was about actual reproduction. There are other kinds of copyright violations, of course -- derivative works, as an example. It is important to me because I don't think generative AI is inherently bad in and of itself, but I do think violating copyright holders' rights is wrong.
Mr. and Mrs. Dursley, of number four, Privet Drive, were proud to say that they were perfectly normal, thank you very much.
This paragraph introduces the passage of time since the events of the first chapter. It also foreshadows the fact that Harry will soon learn about his parents' deaths and his own magical heritage. The paragraph is written in a matter-of-fact tone, which reflects the Dursleys' indifference to Harry's past.Ten years had passed since the Dursleys had woken up to find their nephew on their doorstep. Nearly eleven years had passed since the Potters had died.
As you get toward more and more obscure concepts it becomes increasingly likely that there are fewer sources present in the model training data of that concept. If the LLM AI is going to make an exact reproduction of an article or image then this is where it would happen.Dont misunderstand me: I am trying to understand the degree to which output from these generative AI can violate copyright, and my question was about actual reproduction. There are other kinds of copyright violations, of course -- derivative works, as an example. It is important to me because I don't think generative AI is inherently bad in and of itself, but I do think violating copyright holders' rights is wrong.

(Dungeons & Dragons)
Rulebook featuring "high magic" options, including a host of new spells.