3w ago

Our evaluation of Claude Mythos Preview’s cyber capabilities

www.aisi.gov.uk

Our evaluation of Claude Mythos Preview’s cyber capabilities | AISI Work

We conducted cyber evaluations of Anthropic’s Claude Mythos Preview and found continued improvement in capture-the-flag (CTF) challenges and significant improvement on multi-step cyber-attack simulati...

The AI Security Institute (AISI) conducted evaluations of Anthropic’s Claude Mythos Preview (announced on 7th April) to assess its cybersecurity capabilities. Our results show that Mythos Preview represents a step up over previous frontier models in a landscape where cyber performance was already rapidly improving.