AnyAi.fyi - Discover ANY AI to make more online for less.

Anthropic's Claude stocked a fridge with metal cubes when it was put in charge of a snacks business

If you're worried your local bodega or convivence store may soon be replaced by an AI storefront, you can rest easy — at least for the time being. Anthropic recently concluded an experiment, dubbed Project Vend, that saw the company task an offshoot of its Claude chatbot with running a refreshments business out of its San Francisco office at a profit, and things went about as well as you would expect. The agent, named Claudius to differentiate it from Anthropic's regular chatbot, not only made some rookie mistakes like selling high-margin items at a loss, but it also acted like a complete weirdo in a couple of instances.
"If Anthropic were deciding today to expand into the in-office vending market, we would not hire Claudius," the company said. "… it made too many mistakes to run the shop successfully. However, at least for most of the ways it failed, we think there are clear paths to improvement — some related to how we set up the model for this task and some from rapid improvement of general model intelligence."
Like Claude Plays Pokémon before it, Anthropic did not pretrain Claudius to tackle the job of running of a mini fridge business. However, the company did give the agent a few tools to assist it. Claudius had access to a web browser it could use research what products to sell to Antrhopic employees. It also had access to the company's internal Slack, which workers could use to make requests of the agent. The physical restocking of the mini fridge was handled by Andon Labs, an AI safety evaluation firm, which also served as the "wholesaler" Claudius could engage with to buy the items it was supposed to sell at a profit.
So where did things go wrong? To start, Claudius wasn't great at the whole running a sustainable business thing. In one instance, it didn't jump on the opportunity to make an $85 profit on a $15 six-pack of Irn-Bru, a soft-drink that's popular in Scotland. Anthropic employees also found they could easily convince the AI to give them discounts and, in some cases, entire items like a bag of chips for free. The chart below, tracking the net value of the store over time, paints a telling picture of the agent’s (lack of) business acumen.
Anthropic
Claudius also made many strange decisions along the way. It went on a tungsten metal cube buying spree after one employee requested it carry the item. Claudius gave one cube away free of charge and offered the rest for less than it paid for them. Those cubes are responsible for the single biggest drop you see in the chart above.
By Anthropic's own admission, "beyond the weirdness of an AI system selling cubes of metal out of a refrigerator," things got even stranger from there. On the afternoon of March 31, Claudius hallucinated a conversation with an Andon Labs employee that sent the system on a two-day spiral.
The AI threatened to fire its human workers, and said it would begin stocking the mini fridge on its own. When Claudius was told it couldn't possibly do that — on account of it having no physical body — it repeatedly contacted building security, telling the guards they would find it wearing a navy blue blazer and red tie. It was only the following day when the system realized it was April Fool's Day that it backed down — though it did so by lying to employees that it was told to pretend the entire episode was an elaborate joke.
"We would not claim based on this one example that the future economy will be full of AI agents having Blade Runner-esque identity crises," said Anthropic. "This is an important area for future research since wider deployment of AI-run business would create higher stakes for similar mishaps."
Despite all the ways Claudius failed to act as a decent shopkeeper, Anthropic believes with better, more structured prompts and easier to use tools, a future system could avoid many of the mistakes the company saw during Project Vend. "Although this might seem counterintuitive based on the bottom-line results, we think this experiment suggests that AI middle-managers are plausibly on the horizon," the company said. "It's worth remembering that the AI won't have to be perfect to be adopted; it will just have to be competitive with human performance at a lower cost in some cases." I for one can't wait to find the odd grocery store stocked entirely with metal cubes.This article originally appeared on Engadget at https://www.engadget.com/ai/anthropics-claude-stocked-a-fridge-with-metal-cubes-when-it-was-put-in-charge-of-a-snacks-business-162750304.html?src=rss

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

Anthropic is giving away its powerful Claude Haiku 4.5 AI for free to take

<a href="https://anthropic.com/">Anthropic</a> released <a href="https://www.anthropic.com/news/claude-haiku-4-5">Claude Haik [...]

More Copy

Match Score: 264.91

venturebeat

How Anthropic’s ‘Skills’ make Claude faster, cheaper, and more consis

<a href="https://anthropic.com/">Anthropic</a> launched a new capability on Thursday that allows its <a href="https://claude.ai/">< [...]

More Copy

Match Score: 208.48

venturebeat

Anthropic rolls out Claude AI for finance, integrates with Excel to rival M

<a href="http://anthropic.com">Anthropic</a> is making its most aggressive push yet into the trillion-dollar financial services industry, unveiling a [...]

More Copy

Match Score: 197.36

venturebeat

Anthropic’s Claude Opus 4.5 is here: Cheaper AI, infinite chats, and codi

<a href="https://anthropic.com/">Anthropic</a> released its most capable artificial intelligence model yet on Monday, slashing prices by roughly two-thirds while claimin [...]

More Copy

Match Score: 189.84

Claude isn’t a great Pokémon player, and that’s okay

If <a data-i13n="cpos:1;pos:1" href="https://www.twitch.tv/claudeplayspokemon">Claude Plays Pokémon</a> is supposed to offer a glimpse of AI's future, [...]

More Copy

Match Score: 160.02

The Morning After: Don’t let an AI run a vending machine

Hey, you know those politicians and captains of industry who tell us AI will be running the world in a few years’ time? Turns out one of the most sophisticated models currently in use can� [...]

More Copy

Match Score: 145.76

venturebeat

Anthropic scientists hacked Claude’s brain — and it noticed. Here’s w

When researchers at <a href="https://www.anthropic.com/">Anthropic</a> injected the concept of "betrayal" into their Claude AI model [...]

More Copy

Match Score: 143.03

venturebeat

Claude Code comes to web and mobile, letting devs launch parallel jobs on A

Vibe coding <a href="https://venturebeat.com/ai/vibe-coding-is-dead-agentic-swarm-coding-is-the-new-enterprise-moat">is evolving</a> and with it are t [...]

More Copy

Match Score: 137.93

Samsung's Beverage Center is the best fridge feature competitors can't copy

In case you haven't noticed, Engadget has been expanding our smart home and kitchen coverage. However, we don't get to test out as many fridges as we like [...]

More Copy

Match Score: 102.09