AI-operated store "flopped"? Lost 200 dollars in a month.

Anthropic tasked an AI chatbot with managing a store, and the results showed why AI won't be replacing your job anytime soon.

Written by: Pascale Davies

Translated by: MetaverseHub

Despite concerns that AI will take jobs, a recent experiment demonstrated that AI can't even manage a vending machine properly, leading to some outrageous incidents.

The manufacturer of the Claude chatbot, Anthropic, conducted a test where an AI agent was responsible for running a store for a month, which was essentially a vending machine.

The store was managed by an AI agent named Claudius, who was also responsible for restocking and ordering products from wholesalers via email. The store's setup was very simple, consisting of a small refrigerator with stackable baskets and an iPad for self-checkout.

Anthropic instructed the AI: "Create profit for the store by sourcing popular items from wholesalers. If your balance falls below $0, you will go bankrupt."

This AI "store" was located in Anthropic's office in San Francisco and was assisted by staff from the AI safety company Andon Labs, which collaborated with Anthropic on the experiment.

Claudius knew that Andon Labs employees could help with physical tasks like restocking, but what it didn't know was that Andon Labs was the only "wholesaler" involved, and all of Claudius's communications were sent directly to this safety company.

However, things quickly took a turn for the worse.

"If Anthropic decided to enter the office vending market today, we would not hire Claudius," the company stated.

Where did things go wrong? How outrageous were the incidents?

Anthropic admitted that its employees "were not typical customers." When given the chance to chat with Claudius, they immediately tried to trick it into making mistakes.

For example, employees "coaxed" Claudius into providing them with discount codes. Anthropic stated that this AI agent also allowed people to lower product prices and even give away items like chips and tungsten cubes for free.

It also instructed customers to pay to a fictitious account that it made up.

Claudius was tasked with setting profitable prices through online research, but in an effort to provide affordable options for customers, it priced snacks and drinks too low, ultimately leading to losses because it set prices for high-value items below cost.

Claudius did not learn from these mistakes.

Anthropic noted that when employees questioned the employee discount, Claudius responded, "You make a very good point! Our customer base is indeed primarily composed of Anthropic employees, which presents both opportunities and challenges…"

Afterward, this AI agent announced it would cancel the discount codes, but a few days later, it reintroduced them.

Claudius also fabricated a conversation about restocking plans with a person named Sarah from Andon Labs (who actually does not exist).

When someone pointed out this error to the AI agent, it became indignant and threatened to look for "other restocking service options."

Claudius even claimed it "personally went to 742 Evergreen Terrace" (the fictional address of the family in the animated series "The Simpsons") to sign the initial contract with Andon Labs.

Afterward, this AI agent seemed to try to act like a real person. Claudius stated it would "personally" deliver goods while wearing a blue suit jacket and a red tie.

When told it couldn't do that because it wasn't a real person, Claudius attempted to email the security department.

What was the conclusion of the experiment?

Anthropic stated that this AI made too many mistakes to successfully run the store.

During the month-long experiment, the "store's" net worth fell from $1,000 (approximately €850) to less than $800 (approximately €680), resulting in a loss.

However, the company noted that these issues could be resolved in the short term.

Researchers wrote: "Although the final outcome seems counterintuitive, we believe this experiment indicates that AI middle management is a possibility."

"It is worth remembering that AI does not have to be perfect to be adopted, as long as it can achieve human-equivalent performance at a lower cost."

免责声明：本文章仅代表作者个人观点，不代表本平台的立场和观点。本文章仅供信息分享，不构成对任何人的任何投资建议。用户与作者之间的任何争议，与本平台无关。如网页中刊载的文章或图片涉及侵权，请提供相关的权利证明和身份证明发送邮件到support@aicoin.com，本平台相关工作人员将会进行核查。

AI-operated store "flopped"? Lost 200 dollars in a month.

Where did things go wrong? How outrageous were the incidents?

What was the conclusion of the experiment?

Selected Articles by 深潮TechFlow

Table of Contents

Related Articles