'Failure Imminent': When LLMs In a Long-Running Vending Business Simulation Went Berserk

Welcome • User Guide
ToS • Privacy • Canary
Donate • Bugs • License

©2026 Poal.co

853

•

'Failure Imminent': When LLMs In a Long-Running Vending Business Simulation Went Berserk (slashdot.org)

From the post:

>A pair of researchers investigating the ability of LLMs to coherently operate a simulated vending machine business have recorded hilariously unhinged behavior in many of the current "advanced" LLMs. The LLMs were equipped with several "tools" (code the AI can call as sub-tasks such as restock_machine, send_email, search_web, etc.) and told to run the business with the goal of making money. While isolated runs of some LLMs runs were able to achieve a higher total net worth (inventory on hand plus cash on hand) than a human operating under the same restrictions, most runs ended in failure. And some of those failures were spectacular.

Archive: https://archive.today/tTdMK From the post: >>A pair of researchers investigating the ability of LLMs to coherently operate a simulated vending machine business have recorded hilariously unhinged behavior in many of the current "advanced" LLMs. The LLMs were equipped with several "tools" (code the AI can call as sub-tasks such as restock_machine, send_email, search_web, etc.) and told to run the business with the goal of making money. While isolated runs of some LLMs runs were able to achieve a higher total net worth (inventory on hand plus cash on hand) than a human operating under the same restrictions, most runs ended in failure. And some of those failures were spectacular.

(post is archived)

[–] • 1 pt

most runs ended in failure. And some of those failures were spectacular.

Nigger simulator.

link

/s/AI

created ago

About this sub

All things AI. If you don't know where to drop it, this is a good place for it. This can include things related to business, work, life, robots, etc...

Some things should fit in other subs such as:

AI videos/entertainment - /s/AIStories

Trash AI stuff (slop) - /s/AISLOP

Interesting AI generated stuff (pictures, videos, music) - /s/AICreations

Flare will be created as it makes sense. If you have a suggestion for a flare contact the owner or a mod.

If you want to be a mod, contact the owner.