WelcomeUser Guide
ToSPrivacyCanary
DonateBugsLicense

©2025 Poal.co

667

Archive: https://archive.today/tTdMK

From the post:

>A pair of researchers investigating the ability of LLMs to coherently operate a simulated vending machine business have recorded hilariously unhinged behavior in many of the current "advanced" LLMs. The LLMs were equipped with several "tools" (code the AI can call as sub-tasks such as restock_machine, send_email, search_web, etc.) and told to run the business with the goal of making money. While isolated runs of some LLMs runs were able to achieve a higher total net worth (inventory on hand plus cash on hand) than a human operating under the same restrictions, most runs ended in failure. And some of those failures were spectacular.

Archive: https://archive.today/tTdMK From the post: >>A pair of researchers investigating the ability of LLMs to coherently operate a simulated vending machine business have recorded hilariously unhinged behavior in many of the current "advanced" LLMs. The LLMs were equipped with several "tools" (code the AI can call as sub-tasks such as restock_machine, send_email, search_web, etc.) and told to run the business with the goal of making money. While isolated runs of some LLMs runs were able to achieve a higher total net worth (inventory on hand plus cash on hand) than a human operating under the same restrictions, most runs ended in failure. And some of those failures were spectacular.

(post is archived)

[–] 1 pt

most runs ended in failure. And some of those failures were spectacular.

Nigger simulator.