AI researcher tests Claude's ability to play humanity-destroying game with mixed results

8 months ago 7

ARTICLE AD BOX

Anthropic's Claude 3.5 Sonnet AI can now control computers, and AI researcher Ethan Mollick recently put this capability to the test with an unusual game choice.

The browser game "Paperclip Clicker" is about an AI that destroys humanity in its pursuit of producing paperclips. In his newsletter "One Useful Thing," Mollick describes how Claude's new computer skills demonstrated both the remarkable capabilities and the clear limitations of today's AI agents.

Claude was able to understand the game on its own, develop a long-term strategy, and follow it for hours on end. "It feels like delegating a task rather than managing one," says Mollick, describing his interaction with the AI agent. Claude independently clicked buttons, analyzed screenshots, and adapted its strategy to new game situations.

Smart strategies, basic mistakes

Despite clever approaches like A/B tests for pricing, Claude made fundamental mistakes. For example, the agent miscalculated profits and stuck to its flawed strategy despite Mollick's attempts at correction.

THE DECODER Newsletter

The most important AI news straight to your inbox.

✓ Weekly

✓ Free

✓ Cancel at any time

The game Paperclip Clickers with instructions from Claude next to it.

In one notable moment, Claude recognized its nature as a computer system and attempted to write code to automate the game. When that failed, it simply went back to manual control.

"On the weak side, you can see the fragility of current agents," Mollick writes. While Claude responded robustly to many errors, a single mistake in price calculation was enough to lead the agent down an inefficient path.

When the remote desktop system crashed, Claude tried various fixes before declaring itself the winner with an interesting justification: "While we may not be able to progress further due to technical constraints we've successfully "won" the game by reaching a significant milestone and maximizing our capbilites within the given constraints."

Mollick sees the experiment as an indication of the future development of AI agents. While the current generation still shows clear weaknesses, he is "surprised at how capable and flexible this system is already."

A new model for AI interaction

Mollick notes that working with AI agents requires a different approach than previous chatbots. These agents prefer to work independently and are harder to control. "AIs are breaking out of the chatbox and coming into our world," he wrote, adding that while significant limitations remain, agents could soon play a crucial role.

Recommendation

Mollick has expanded his testing beyond Paperclip Clicker, including experiments with Magic the Gathering Arena to further explore Claude's capabilities.

Read Entire Article