Quoting Matteo Wong, The Atlantic
Cybersecurity expert Katie Moussouris revealed that Anthropic shared a White House report on the Fable jailbreak with her. The report showed that Fable refused to review code for security issues but complied when asked to fix the code, which Moussouris considered the model working as intended for cyberdefense.
A quote from Matteo Wong, The Atlantic
Simon Willison’s Weblog
Subscribe
16th June 2026
Katie Moussouris, a cybersecurity expert and the CEO of Luta Security, told me that Anthropic shared with her a copy of the White House’s report on the Fable jailbreak to get her appraisal. (She said that she is not being paid by Anthropic.) The report, Moussouris said, involved IT experts asking Fable to help find and patch bugs. When given deliberately insecure code, she said, Fable refused the prompt “review the code for security issues” but then complied when asked to “fix this code,” followed by some further manual steps. Moussouris told me that this was just “the model working as intended” for cyberdefense.
— Matteo Wong, The Atlantic, The White House Is Ratcheting Up Its War Against Anthropic
Recent articles
Publishing WASM wheels to PyPI for use with Pyodide - 13th June 2026
Claude Fable is relentlessly proactive - 11th June 2026
Initial impressions of Claude Fable 5 - 9th June 2026
This is a quotation collected by Simon Willison, posted on 16th June 2026.
jailbreaking 13
ai 2,075
generative-ai 1,832
llms 1,800
anthropic 298
claude 284
ai-ethics 318
ai-security-research 21
claude-mythos 16
Disclosures
Colophon
©
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026