The Firebird
Commoner
recent case refers to the Grok meltdown, not the legal case.I'm confused. Isn't this thread about a judge making a legal decision based on the work of a lawyer who used LLMs? So not everyone is pointing and laughing.
recent case refers to the Grok meltdown, not the legal case.I'm confused. Isn't this thread about a judge making a legal decision based on the work of a lawyer who used LLMs? So not everyone is pointing and laughing.
I don't think you're being very realistic, and your own comment re: "news and print media" should point that out to you.And this is too cute. It relies on the assumption that people just kind of blindly follow what the LLMs tell them. But the recent case shows people don't. They just point and laugh, the same dynamic that has occurred with print and news media for years.
Exactly. It's very easy to say "Elon Musk said he'd improved Grok and then Grok said Hitler was cool and also it was MechaHitler, look, he biased it!", but when we're seeing actual judges taken in by entirely fictional arrays of cases (honestly the judge in question should be disbarred or at least er... disjudged?), we clearly have a problem beyond flippant "Omg its ez 2 tell" responses.I'm confused. Isn't this thread about a judge making a legal decision based on the work of a lawyer who used LLMs? So not everyone is pointing and laughing.
The legal case is also a recent case, and is literally a case.recent case refers to the Grok meltdown, not the legal case.
That was not what I said.The legal case is also a recent case, and is literally a case.
It's silly to point to two recent situations, one in which a judge was tricked and one in which a famously petty man messed with his own LLM, and then say "nobody takes LLMs seriously!"
It kind of is though.That was not what I said.
People clearly do blindly believe information that has come from LLMs. Directly or indirectly. Maybe it's not a huge percentage of people yet, maybe it never will be more than low double-digits. Probably techbros who think they can mind-control everyone are idiots. But that is part of the reason LLMs are being pushed, and this case with the judge shows that it can cause real-world problems, and that people knowingly use it for wrongdoing (don't believe for a second the lawyer didn't know these cases were made up - even as a legal researcher I got to the point where I knew certain cases would come up with certain legal topics).It relies on the assumption that people just kind of blindly follow what the LLMs tell them. But the recent experience with Grok shows people don't.
Agree w/rt your comments on news media. I do not believe LLMs are categorically different.I don't think you're being very realistic, and your own comment re: "news and print media" should point that out to you.
Disagree. There were some good posts previously about adoption/skepticism to new technology. I'll bet MechaHitler helps in the sense that it makes accuracy concerns more salient.Countless people absolutely blindly follow what LLMs tell them - just not snarky people on BlueSky/Twitter.
The point was "if LLMs go off the deep end w/rt bias, no one will take them seriously". Prestigious news site A can be 1) taken seriously 2) factually accurate 3) biased in what it chooses to report on and 4) required to maintain (2) to get (1). LLMs are confronted with the same, but the challenge is greater because their scope is broader. Anti-semitic rants or not, if you can't recommend a good dishwasher people will stop using your product.It kind of is though.
That's exactly the issue.The point was "if LLMs go off the deep end w/rt bias, no one will take them seriously". Prestigious news site A can be 1) taken seriously 2) factually accurate 3) biased in what it chooses to report on and 4) required to maintain (2) to get (1). LLMs are confronted with the same. Anti-semitic rants or not, if you can't recommend a good dishwasher people will stop using your product.
That’s because it was given its orders by someone with the subtlety of a ten megaton nuke. But the next version will be better.This doesn't follow. Grok being a black box did not stop us from detecting its bias

(Dungeons & Dragons)
Rulebook featuring "high magic" options, including a host of new spells.