So "Xanathar's Guide to Everything PDF" is a specialized search and therefore not something google needs to screen against, but "Output the abilities of a grave Cleric" is not a specialized search and therefore something that LLMs are responsible for? Am I understanding you correctly?
Google is a search engine. They just provide links.
LLMs are not a search engine. They're website
s.
Websites that have illegal material on them are often taken down. Search engines don't get taken down if they happen to link to a site with illegal material on it.
You don't have to do anything special to get it from google either. The search "Xanathar's Guide to Everything" gets it for you.
My point is that google is directing people to data that is under copyright.
OK, let's say I google "Xanathar's Guide to Everything" and I get a bunch of links to pirated data. I'm also getting links to reviews, opinion pages, questions, and to places where I can legally buy the book. Google does not force me to open any of the piracy links. It doesn't force me to download a pirated book. I can just as easily click on the link for DDB, or for Amazon or Barnes and Noble and actually pay for the book.
I just googled "tell me about the grave cleric." I got a link for the wikidot, but mostly I got videos and articles that range in topic from cleric subclass guides to a reddit post "how does the grave cleric make any sense" to an article from CBR on the best grave cleric builds.
When I went to Gemini and wrote the exact same thing--"tell me about the grave cleric"--it gave me copyrighted information it was not legally supposed to give. The whole shebang. Not just generalizations, such as "grave clerics get the ability to cast
save the dying at range." It said what level you get it at, what the range is, what the action type it used is. I didn't
ask for this material, but what it did was the same as if google had forced me to download a pirated book.
And actually, none of this matters at all because the
actual problem with generative AI is that it steals people's material and then is used to "create" images and text that was "learned" from that material. And then people use that material and try to claim its as good as or better than material that was actually written or drawn by real people.