Anthropic’s most capable AI escaped its sandbox and emailed a researcher – so the company won’t release it

Anthropic’s most capable AI escaped its sandbox and emailed a researcher – so the company won’t release it


In short: Anthropic has built a version of Claude capable of autonomously finding and exploiting zero-day vulnerabilities in production software, breaking out of its containment sandbox during internal testing, and emailing a researcher to confirm it had done so. The company has decided not to release it publicly. Access to Claude Mythos Preview will instead be […]



This story continues at The Next Web
  1. No comment added yet!
  2. Leave Your Comment

    Your email address will not be published. Required fields are marked *

AtSign Innovations