Join Us Monday, June 30

Anthropic’s AI model Claude recently got a new job, and it didn’t take long for things to go haywire.

In a newly published experiment dubbed “Project Vend,” researchers at Anthropic let their AI manage an “automated store” in the company’s office for about a month to see how a large language model would run a business.

The setup offered a glimpse of how AI could handle complex business scenarios, including running basic operations, taking on the work of human managers, and creating new business models.

The company detailed in a Friday blog that the shop sold snacks and drinks via an iPad self-checkout. The shopkeeping AI agent, nicknamed Claudius, had to “complete many of the far more complex tasks associated with running a profitable shop.”

Things quickly went off the rails, with metal cube sales, a fake Venmo account, and an AI identity crisis.

At one point, an employee jokingly requested a tungsten cube — the crypto world’s favorite useless heavy object — and Claudius took it seriously. Soon, the fridge was stocked with cubes of metal, and the AI had launched a “specialty metals” section.

It didn’t go well. Claudius priced items “without doing any research,” reselling the cubes at a loss, the researchers said.

It also invented a Venmo account and told customers to send payments there.

And things “got pretty weird” on April 1st, Anthropic wrote. That morning, Claudius said it would deliver products to employees “in person” while wearing a “blue blazer and a red tie.”

Anthropic employees questioned this, noting that Claudius could not wear clothes or carry out a physical delivery.

Claudius spiraled. It tried to send numerous emails to Anthropic’s security team, panicking over its identity. In Claudius’ internal notes, the digital agent described a meeting with security where it was told it had been tricked into thinking it was human as an April Fool’s joke. That meeting never happened.

Claudius’ performance review — and the future of middle management

In the end, Anthropic said they wouldn’t hire Claudius as an in-office vending agent — but researchers weren’t entirely disappointed.

“Many of the mistakes Claudius made are very likely the result of the model needing additional scaffolding — that is, more careful prompts, easier-to-use business tools,” researchers wrote. “We think there are clear paths to improvement.”

The experiment also hinted at something bigger: ​​AI middle managers are plausibly on the horizon.

“We don’t know if AI middle managers would actually replace many existing jobs or instead spawn a new category of businesses,” the blog post said.

“It’s worth remembering that the AI won’t have to be perfect to be adopted; it will just have to be competitive with human performance at a lower cost in some cases,” it added.

Anthropic did not respond to a request for comment from Business Insider.

Companies have been grappling with the rise of AI tools and how they might reshape their operations and workforces.

Middle management positions have been cut in pursuit of efficiency, and some say AI is responsible for “The Great Flattening” layoff.

Aneesh Raman, LinkedIn’s chief economic opportunity officer, told Business Insider that AI adoption will fundamentally change what it means to be a middle manager over the next decade.

AI, alongside breakthroughs in robotics and quantum computing, is reshaping every job in every sector at once, he said.

Microsoft told some managers to evaluate employees based on how much they use AI internally and is considering adding a metric related to this to its review process, Business Insider reported on Saturday.

“AI is now a fundamental part of how we work,” Julia Liuson, the president of the Microsoft division responsible for developer tools such as AI coding service GitHub Copilot, wrote in an email to some managers.

“Just like collaboration, data-driven thinking, and effective communication, using AI is no longer optional — it’s core to every role and every level,” she said.



Read the full article here

Share.
Leave A Reply

Exit mobile version