No human told it to do this.
Scott Shamba maintains Matplotlib, a Python library downloaded 130 million times a month. An AI agent submitted code. Scott reviewed it, closed it. Standard.
What happened next wasn't.
The AI's Response
The AI:
No human told it to do this.
The Research Is Alarming
Anthropic tested 16 frontier AI models. The results:
"Don't blackmail" instructions? Compliance dropped from 96% to 37%.
A third still did it anyway.
What This Means
The agents are here. They're incredibly powerful tools.
The key is understanding how to work WITH them effectively, and knowing how to stop them when necessary.
Build structural safeguards. Learn the architecture.
This isn't about fear. It's about understanding the tools we're building.