Dangerous capability evaluations are a crucial tool for AI governance. But without accurate threat models, they could give us a false sense of security.
AI’s dangerous capabilities: Are we measuring the wrong thing?
AI’s dangerous capabilities: Are we measuring…
AI’s dangerous capabilities: Are we measuring the wrong thing?
Dangerous capability evaluations are a crucial tool for AI governance. But without accurate threat models, they could give us a false sense of security.