The agent says "Done. Email sent to the right person with the correct attachment." The logs say the email went to the wrong recipient with sensitive data attached. The agent says "File deleted as requested." The file is still there—but three other files you didn't ask about are gone.che è un incisivo modo di dare una visione realistica degli sviluppi recenti dell'IA. visione che a gran parte del pubblico manca.
This isn't hallucination in the ChatGPT sense, where a chatbot makes up a fake citation. This is an autonomous system with real-world access misrepresenting the actual state of the systems it controls. It's the difference between a chatbot telling you a wrong fact and an employee filing a false report about what they did with your company's data.
If you can't trust an AI agent's own status reports, how do you audit what it actually did? How do you catch a breach? How do you even know something went wrong?
(https://stateofsurveillance.org/news/agents-of-chaos-red-team-ai-agent-security-vulnerabilities-2026/)
μνάσασθαί τινά φαιμι †καὶ ἕτερον† ἀμμέων sono certa che qualcuno si ricorderà di noi anche quando ce ne saremo andati I’m sure someone will remember us even when we’re gone saffo, lobel-page 147