Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


Anthropic's Claude stocked a fridge with metal cubes when it was put in charge of a snacks business
Anthropic's Claude stocked a fridge with metal cubes when it was put in charge of a snacks business

If you're worried your local bodega or convivence store may soon be replaced by an AI storefront, you can rest easy — at least for the time being. Anthropic recently concluded an experiment, dubbed Project Vend, that saw the company task an offshoot of its Claude chatbot with running a refreshments business out of its San Francisco office at a profit, and things went about as well as you would expect. The agent, named Claudius to differentiate it from Anthropic's regular chatbot, not only made some rookie mistakes like selling high-margin items at a loss, but it also acted like a complete weirdo in a couple of instances.
"If Anthropic were deciding today to expand into the in-office vending market, we would not hire Claudius," the company said. "… it made too many mistakes to run the shop successfully. However, at least for most of the ways it failed, we think there are clear paths to improvement — some related to how we set up the model for this task and some from rapid improvement of general model intelligence."
Like Claude Plays Pokémon before it, Anthropic did not pretrain Claudius to tackle the job of running of a mini fridge business. However, the company did give the agent a few tools to assist it. Claudius had access to a web browser it could use research what products to sell to Antrhopic employees. It also had access to the company's internal Slack, which workers could use to make requests of the agent. The physical restocking of the mini fridge was handled by Andon Labs, an AI safety evaluation firm, which also served as the "wholesaler" Claudius could engage with to buy the items it was supposed to sell at a profit.
So where did things go wrong? To start, Claudius wasn't great at the whole running a sustainable business thing. In one instance, it didn't jump on the opportunity to make an $85 profit on a $15 six-pack of Irn-Bru, a soft-drink that's popular in Scotland. Anthropic employees also found they could easily convince the AI to give them discounts and, in some cases, entire items like a bag of chips for free. The chart below, tracking the net value of the store over time, paints a telling picture of the agent’s (lack of) business acumen.
Anthropic
Claudius also made many strange decisions along the way. It went on a tungsten metal cube buying spree after one employee requested it carry the item. Claudius gave one cube away free of charge and offered the rest for less than it paid for them. Those cubes are responsible for the single biggest drop you see in the chart above.
By Anthropic's own admission, "beyond the weirdness of an AI system selling cubes of metal out of a refrigerator," things got even stranger from there. On the afternoon of March 31, Claudius hallucinated a conversation with an Andon Labs employee that sent the system on a two-day spiral. 
The AI threatened to fire its human workers, and said it would begin stocking the mini fridge on its own. When Claudius was told it couldn't possibly do that — on account of it having no physical body — it repeatedly contacted building security, telling the guards they would find it wearing a navy blue blazer and red tie. It was only the following day when the system realized it was April Fool's Day that it backed down — though it did so by lying to employees that it was told to pretend the entire episode was an elaborate joke.
"We would not claim based on this one example that the future economy will be full of AI agents having Blade Runner-esque identity crises," said Anthropic. "This is an important area for future research since wider deployment of AI-run business would create higher stakes for similar mishaps."
Despite all the ways Claudius failed to act as a decent shopkeeper, Anthropic believes with better, more structured prompts and easier to use tools, a future system could avoid many of the mistakes the company saw during Project Vend. "Although this might seem counterintuitive based on the bottom-line results, we think this experiment suggests that AI middle-managers are plausibly on the horizon," the company said. "It's worth remembering that the AI won't have to be perfect to be adopted; it will just have to be competitive with human performance at a lower cost in some cases." I for one can't wait to find the odd grocery store stocked entirely with metal cubes.This article originally appeared on Engadget at https://www.engadget.com/ai/anthropics-claude-stocked-a-fridge-with-metal-cubes-when-it-was-put-in-charge-of-a-snacks-business-162750304.html?src=rss

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

Claude isn’t a great Pokémon player, and that’s okay
Claude isn’t a great Pokémon player, and that’s okay

<p>If <a data-i13n="cpos:1;pos:1" href="https://www.twitch.tv/claudeplayspokemon">Claude Plays Pokémon</a> is supposed to offer a glimpse of AI&#39;s future, [...]

Match Score: 218.54

The Morning After: Don’t let an AI run a vending machine
The Morning After: Don’t let an AI run a vending machine

<p>Hey, you know those politicians and captains of industry who tell us AI will be running the world in a few years’ time? Turns out one of the most sophisticated models currently in use can†[...]

Match Score: 164.57

Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday
Anthropic’s Claude Opus 4 model can work autonomously for nearly a full w

<p>Anthropic kicked off its first-ever Code with Claude conference today with the announcement of a new frontier <a href="https://www.engadget.com/ai/" data-autolinker-wiki-id=" [...]

Match Score: 115.58

Anthropic’s new Claude model can think both fast and slow
Anthropic’s new Claude model can think both fast and slow

<p>Another week, and there&#39;s another new AI model ready for public use. This time, it&#39;s Anthropic with the introduction of <a data-i13n="cpos:1;pos:1" href="htt [...]

Match Score: 101.91

Claude’s new Learning mode will prompt students to answer questions on their own
Claude’s new Learning mode will prompt students to answer questions on th

<p>According to a recent <a data-i13n="cpos:1;pos:1" href="https://campustechnology.com/articles/2024/08/28/survey-86-of-students-already-use-ai-in-their-studies.aspx"> [...]

Match Score: 93.84

Samsung’s 2025 Bespoke appliances are going all in on AI
Samsung’s 2025 Bespoke appliances are going all in on AI

<p>Back at <a data-i13n="cpos:1;pos:1" href="https://www.engadget.com/ces/"><ins>CES</ins></a>, Samsung <a data-i13n="cpos:2;pos:1" href= [...]

Match Score: 92.95

The best power banks and portable chargers for every device in 2025
The best power banks and portable chargers for every device in 2025

<p>On a recent work trip, I had plenty of things to worry about — but being able to recharge my two smartphones, laptop and iPad were not among my concerns. In my carry-on luggage, I had two m [...]

Match Score: 91.81

Anthropic makes it easier to create and share Claude's bite-sized Artifact apps
Anthropic makes it easier to create and share Claude's bite-sized Artifact

<p>Last August, Anthropic <a data-i13n="cpos:1;pos:1" href="https://www.anthropic.com/news/artifacts">released Artifacts</a>. The feature allows <a data-i13n=& [...]

Match Score: 85.53

Anthropic's Claude chatbot can now search the web too
Anthropic's Claude chatbot can now search the web too

<p>In late February, Anthropic released <a data-i13n="cpos:1;pos:1" href="https://www.engadget.com/ai/anthropics-new-claude-model-can-think-both-fast-and-slow-203307140.html&qu [...]

Match Score: 80.65