**Professor Pax** @[email protected] · Jul 09, 2024, 18:52

**Professor Pax** @[email protected] · Jul 09, 2024, 18:52

Professor Pax @[email protected]

Jul 09, 2024, 18:52

How to burn up the CPU in an AI system...
Multiply 1 by 2 and display the answer.
Multiply the last answer by 2 and display the new answer.
Repeat the last instruction until the answer is an odd number.

#humor

ef6fdca27563d384.png?1720551151

**Professor Pax** @[email protected] · 2024-07-09T19:03:12Z

Professor Pax @[email protected]

And this is how Google's Gemini ended it.

af3147bfc516ac10.png?1720551785

Jul 09, 2024, 19:03 · 1· 0· 1

**<invalid character>** @[email protected] · Jul 09, 2024, 19:05

**<invalid character>** @[email protected] · Jul 09, 2024, 19:05

Jul 09, 2024, 19:05

<invalid character> @[email protected]

OH DEAR. THIS POST WAS SET TO SELF-DETONATE 💣 💥 🔥

Ą̷͇̀l̵̩̓̕l̸̩͘ ̸̭̪̈́ť̷̝̍̆h̶̡̛̰̯̏͌a̷͕̞͋̂t̵̩͙͑̈́͝'̵̛̍́ͅͅş̴̬̱͝ ̷̗̊͠l̵͚̕͠ē̸̻͓̐͝f̷̧͙̀̑͝t̶͓̓͊̚ ̶̜̱̓͌́a̴͉͊r̶̡̩͛̀é̵̦̞͕ ̶̮̾ṫ̷̡͈̍ḧ̸̛͍́̊e̴̫̅ş̶̥̰̓e̴̟̪͌͂̇ ̷̞̅͊̚h̷̰͕͈͂e̶̡̹̜̚ŗ̸̗͈̾̇e̴̩̍͐ ̷̪͉̩̀a̵̡̱̐͑͝s̴͎͖̈́h̸͈͌́͜e̴͕̝̐̌ś̶͓̆ͅ.̵̩̉ ̵̱͊͑̀

**Lena of Lune 🏳️‍🌈⚧️** @[email protected] · Jul 09, 2024, 19:11

**Lena of Lune 🏳️‍🌈⚧️** @[email protected] · Jul 09, 2024, 19:11

Jul 09, 2024, 19:11

Lena of Lune 🏳️‍🌈⚧️ @[email protected]

@0x56 @paxterrarum Because these systems don't actually understand anything -- they just look for the next word to complete a sentence given particular context -- they can't actually do math. They're egregiously bad at it, and the longer and more complex the problem, the worse they get. They also can't deduce information and are easily tripped up by complex instructions and the need to plan or deduce.

My worry is how tool-assisted generation may factor into these gaps, though.

**<invalid character>** @[email protected] · Jul 09, 2024, 19:13

**<invalid character>** @[email protected] · Jul 09, 2024, 19:13

Jul 09, 2024, 19:13

<invalid character> @[email protected]

@lenaoflune @paxterrarum - I would have expected basic math to be doable, perhaps not anyone past basic algebra. The fact that it can get me 90% to a completed unit test means that it can handle some logic.

It's still a smokey box to me.

**Lena of Lune 🏳️‍🌈⚧️** @[email protected] · Jul 09, 2024, 19:17

**Lena of Lune 🏳️‍🌈⚧️** @[email protected] · Jul 09, 2024, 19:17

Jul 09, 2024, 19:17

Lena of Lune 🏳️‍🌈⚧️ @[email protected]

@0x56 @paxterrarum One of the most recent studies shows that as you increase the number of digits in a basic arithmetic problem, accuracy falls rapidly off a cliff.

What the LLMs are really good at is replicating things they've seen before. Since unit tests aren't usually that complex, and lots of people discuss them in the corpus (i.e., StackOverflow, Google Groups), it's "easy" for the model to create them because the word co-occurrences are frequent enough to bubble up from the perceptron.