**Texas Born Texan** @[email protected] · May 20, 2023, 16:29

**Texas Born Texan** @[email protected] · May 20, 2023, 16:29

Texas Born Texan @[email protected]

May 20, 2023, 16:29

“These language programs do write some “new” things—they’re called “hallucinations,” but they could also be described as lies. Similar to how autocorrect is ducking terrible at getting single letters right, these models mess up entire sentences and paragraphs.”

“Lies” assumes an intelligence; subterfuge. It is anthropomorphizing software and its bugs.

The proper term is bug. When I write code and it delivers the incorrect response, that is a bug.

https://www.theatlantic.com/technology/archive/2023/02/google-microsoft-search-engine-chatbots-unreliability/673081/

**Texas Born Texan** @[email protected] · 2023-05-20T16:36:56Z

Texas Born Texan @[email protected]

We need to stop with incorrect and factually flawed terms — ChatGPT doesn’t have hallucinations, it has incorrect and erroneous responses.

Bard for example, couldn’t nail how many indictments there were under the Clinton Administration. How many were convicted? Throw of the dice seems to be Bards way of solving it.

These aren’t lies, they are bugs. They are fundamental to the FAILURE of the system as a whole.

These bugs would be inexcusable in any other REAL software industry.

May 20, 2023, 16:36 · 4· 1· 3

**David Salo** @[email protected] · May 20, 2023, 16:46

**David Salo** @[email protected] · May 20, 2023, 16:46

May 20, 2023, 16:46

David Salo @[email protected]

@feloneouscat

This is just historical and political misinformation. What happens when people go to AI for recipes and get food poisoning? What happens when they ask it for health information and make themselves sick? What happens when they treat it as a counselor and become suicidal?

Nobody's taking any responsibility for what the machines spit out. Yet the AI itself cannot be held responsible for anything.

**Texas Born Texan** @[email protected] · May 20, 2023, 16:56

**Texas Born Texan** @[email protected] · May 20, 2023, 16:56

May 20, 2023, 16:56

Texas Born Texan @[email protected]

@DavidSalo

The “AI” is just code. It is an amalgam of weighted information. But if I had released code for a traffic light that performed this poorly, I would be out of business.

There is a LOT of money being made from the hype and a LOT of people pretending the hype is real and IGNORING the real problems in the code bases.

I’ve been writing code for over four decades and it doesn’t take me but three false propositions to get @Alfred to agree that “tireless” can mean one is without tires.

**Texas Born Texan** @[email protected] · May 20, 2023, 16:58

**Texas Born Texan** @[email protected] · May 20, 2023, 16:58

May 20, 2023, 16:58

Texas Born Texan @[email protected]

@DavidSalo @Alfred

These systems are inherently flawed, fundamentally untrustworthy and the bugs are ignored in favor of “oh, look, it’s like a human being” — oh, no it is not.

We need to stop pretending that this is “AI”. It is not.

**Texas Born Texan** @[email protected] · May 20, 2023, 17:13

**Texas Born Texan** @[email protected] · May 20, 2023, 17:13

May 20, 2023, 17:13

Texas Born Texan @[email protected]

@DavidSalo @Alfred

Perhaps “AI” is suffering from economic anxiety and that is why its performance is so poor? 🤣

**Dr. Green Bean Casserole** @[email protected] · May 20, 2023, 18:03

**Dr. Green Bean Casserole** @[email protected] · May 20, 2023, 18:03

May 20, 2023, 18:03

Dr. Green Bean Casserole @[email protected]

@feloneouscat @DavidSalo @Alfred It was a pretty compelling argument…

**BMAC** @[email protected] · May 20, 2023, 16:54

**BMAC** @[email protected] · May 20, 2023, 16:54

May 20, 2023, 16:54

BMAC @[email protected]

@feloneouscat we want to humanize it’s errors. Makes us feel warm and squishy while we redo what we asked it to. LOL

**Texas Born Texan** @[email protected] · May 21, 2023, 05:57

**Texas Born Texan** @[email protected] · May 21, 2023, 05:57

May 21, 2023, 05:57

Texas Born Texan @[email protected]

@bmacmixer

I see the nature of “AI” (LLM as I don’t really feel there is intelligence associated with them — hah! I kid!) as mostly error with little to no Q/A.

If the results look similar to what they think it should be, it’s a win.

Testing for success feels great, but it doesn’t really work in the real world.

Real engineers test for failure.

**Lena of Lune 🏳️‍🌈⚧️** @[email protected] · May 21, 2023, 00:01

**Lena of Lune 🏳️‍🌈⚧️** @[email protected] · May 21, 2023, 00:01

May 21, 2023, 00:01

Lena of Lune 🏳️‍🌈⚧️ @[email protected]

@feloneouscat "Throwing the dice" is actually not an entirely inaccurate high level depiction of how generative transformers work, as they're statistical language models.

FWIW, and this isn't at all me excusing the wildly overstated claims of LLM accuracy and capability, "hallucination" is a term of art in this field referring to the model creating assertions that are both false and not in the original data set. They're a big problem! LLMs lacking a concept of truth is inherent in their design.

**Texas Born Texan** @[email protected] · May 21, 2023, 05:48

**Texas Born Texan** @[email protected] · May 21, 2023, 05:48

May 21, 2023, 05:48

Texas Born Texan @[email protected]

@lenaoflune "’Throwing the dice’ is actually not an entirely inaccurate high level depiction of how generative transformers work”

I know. That’s why I said it.

“Hallucination” is a term that is wildly inaccurate, highly anthropomorphic, and really doesn’t describe the nature of the problem or indicate how the issue may be resolved.

A bug by any other name still goes on the punch list.

Resources

Developers

What is CounterSocial?

counter.social

More…