New AI Model Will Likely Blackmail You If You Try to Shut It Down: ‘Self-Preservation’

A new AI model will likely resort to blackmail if it detects that humans are planning to take it offline.

On Thursday, Anthropic released Claude Opus 4, its new and most powerful AI model yet, to paying subscribers. Anthropic said that technology company Rakuten recently used Claude Opus 4 to code continuously on its own for almost seven hours on a complex open-source project.

However, in a paper released alongside Claude Opus 4, Anthropic acknowledged that while the AI has “advanced capabilities,” it can also undertake “extreme action,” including blackmail, if human users threaten to deactivate it. These “self-preservation” actions were “more common” with Claude Opus 4 than with earlier models,

→ Continue reading at Entrepreneur

More from author

Related posts

Advertisment

Latest posts

Inside a Drone Factory in Ukraine

Enlarge this image Serhii, head of Vyriy production company operating fpv...

Most Major Retailers Are Open on Memorial Day, Except One. Here’s What’s Open and Closed This Monday.

As the nation honors and mourns its deceased service men and women on Memorial Day, Monday, May 26, banks and mailing services, including...

Why Every Company Should Have a 90-Day Cash Flow Buffer

Opinions expressed by Entrepreneur contributors are their own. What would happen if your company's sales dropped significantly for an entire month? Or...