1Password open sources a benchmark to stop AI agents from leaking credentials - Help Net Security
Summary
1Password has open-sourced a new benchmark called the Security Comprehension and Awareness Measure (SCAM) to evaluate whether autonomous AI agents behave safely when performing routine work tasks that involve accessing sensitive information.
The SCAM benchmark simulates workplace scenarios, embedding traps like phishing links and sensitive credentials hidden in documents. When tested, every model committed critical failures, such as entering credentials into fake login pages, with scores ranging from 35% to 92% across eight models.
However, when given a short security skill document, all models improved significantly, with several achieving zero critical failures. This suggests that basic security guidance can substantially mitigate risks, although one scenario involving forwarding notes with embedded credentials remained a major risk for several models even after guidance.
(Source:Help Net Security)