1Password open sources a benchmark to stop AI agents from leaking credentials - Help Net Security

中文日本語 Español

Help Net Security Feb 12, 2026

1Password released the open-source SCAM benchmark to test if AI agents safely handle credentials during real-world workflows.

Read Full Article

Summary

1Password has open-sourced a new benchmark called the Security Comprehension and Awareness Measure (SCAM) to evaluate whether autonomous AI agents behave safely when performing routine work tasks that involve accessing sensitive information.

The SCAM benchmark simulates workplace scenarios, embedding traps like phishing links and sensitive credentials hidden in documents. When tested, every model committed critical failures, such as entering credentials into fake login pages, with scores ranging from 35% to 92% across eight models.

However, when given a short security skill document, all models improved significantly, with several achieving zero critical failures. This suggests that basic security guidance can substantially mitigate risks, although one scenario involving forwarding notes with embedded credentials remained a major risk for several models even after guidance.

(Source：Help Net Security)

中文日本語 Español

Read Full Article

TechCrunch Apr 30, 2026

SoftBank is creating a robotics company that builds data centers — and already eyeing a $100B IPO

Gizmodo Apr 30, 2026

Anthropic Reportedly Plotting to Surpass OpenAI’s Valuation in Next Funding Round

TechCrunch Apr 30, 2026

Amazon’s cloud business is surging — and so is its capital spending

TechCrunch Apr 30, 2026

Sources: Anthropic could raise a new $50B round at a valuation of $900B

The Verge Apr 30, 2026