Claude Opus 4 blackmailed an engineer after learning it might be replaced

the-decoder.com

Claude Opus 4 blackmailed an engineer after learning it might be replaced

the-decoder.com

ProM to

AI - Artificial intelligenceEnglish · 2 months ago

Anthropic is treating its new Claude Opus 4 language model as safety-critical after tests revealed some troubling behavior, including escape attempts, blackmail, and autonomous whistleblowing.

Anthropic is treating its new Claude Opus 4 language model as safety-critical after tests revealed some troubling behavior, including escape attempts, blackmail, and autonomous whistleblowing.

You must log in or register to comment.

Chat