‘Safety first’ puts Anthropic ahead in game of AI spin

“I don’t see this as security theatre,” says Ajder. “The claims appear to be substantial.” But Dr Heidy Khlaaf, chief AI scientist at the AI Now Institute and a former OpenAI safety engineer, is sceptical. She notes Anthropic provides no comparison with existing automated security tools, nor any false-positive rates. “It also serves their ‘safety first’ image, as they’re able to justify the lack of public release, even a limited one for independent evaluation, as a public service – when it simply obscures experts’ abilities to independently validate their claims,” she said.