Agentic AI in the SOC: The Governance Layer you Need Before You Let Automation Execute

The risk isn’t AI getting an answer wrong. The risk is AI taking action. Most SOC leaders I talk to are hearing the same message from the business — use AI to move faster — and to be fair, the tech is finally good enough to do more than just summarize alerts or draft an investigation. We are entering the agentic phase, where systems can decide and execute across identity, endpoint, cloud and network controls. That is the inflection point. Your SOC stops being a detection function and becomes an execution engine. If you do not put a governance layer under that engine, you are not automating security. You are automating risk. The question every CISO will face soon is simple: What must be true before you allow AI to execute containment actions? The Day Your ‘AI Helper’ Breaks Production Let’s make it painfully real. A high-confidence alert fires: Suspicious token activity tied to an admin identity. Your agent is integrated with your identity provider and endpoint tooling. It moves fast, disables the account, revokes sessions and quarantines two endpoints. Great outcome… right up until you realize that admin identity is actually a service account tied to a revenue-critical integration. Now your incident response has become an outage. The executive team will not ask how advanced your AI is. They will ask why an automated system was allowed to touch a control it did not fully understand. This is the new SOC problem. Agentic actions have real-world consequences. The more integrated your tools are, the bigger the blast radius of a mistake. Why ‘Agentic’ is Different From ‘AI in the SOC’ We have lived with AI in security for years. It helped classify alerts, prioritize vulnerabilities and surface anomalies. It usually did not act. Agentic AI is different because it behaves like a junior analyst with keyboard access. It can pivot, correlate, decide and execute. This makes it powerful. It is also exactly why governance must come first. Even if humans remain accountable, agents can make irreversible changes faster than humans can intervene. That means your control model has to evolve. The Failure Modes Teams Underestimate Until They Get Burned When AI can execute, a few predictable things show up in real environments. Over-containment becomes common. The agent chooses safe containment, which unintentionally breaks business-critical workflows. Evidence gets trampled. Automated remediation can overwrite artifacts your responders need for forensics, legal or insurance. Privilege quietly expands. The agent starts with narrow permissions, then accumulates more because it needs it to do its job. Chain reactions multiply. One automated action triggers playbooks that trigger other actions, and the SOC loses a clean narrative of what happened and why. If this sounds familiar, it is because it is the same problem we have had with human admins for decades. The difference is speed and scale. A human makes one mistake. An agent can repeat the same mistake hundreds of times in minutes. The Governance Layer: What has to Exist Before Agents Execute You do not need a 60-page AI governance program to start safely. You need a minimum control layer that answers five questions every time an agent takes action. 1. What Actions are Allowed, and in What Context? Treat agent permissions like privileged access for a human operator. A rule that holds up under pressure: Start with actions that are reversible. Examples that are often safe to automate with tight limits: Revoking a session or token Quarantining a single endpoint with auto-expiration Temporarily disabling a suspicious OAuth app Blocking a single IP for a short window pending review Actions that are hard to undo, like deleting identities, purging data or rewriting broad policies, should require explicit approval. 2. When Does a Human Approval Gate Kick In? Human in the loop should not mean that every action needs approval. That defeats the point. Use approvals for high-blast-radius decisions, the ones that can plausibly create an outage: Disabling privileged accounts that touch production systems Rotating secrets tied to core workloads Network segmentation with broad scope Organization-wide token revocations or policy changes If the action can take down revenue, stop and ask a human. 3. What is the Blast Radius Limit? Every agent needs a seatbelt. This is where you prevent a well-intended automation loop from becoming its own incident: Scope Limits: Only non-production, only tagged assets or only a defined subset of identities Rate Limits: Maximum actions per hour Time Limits: Quarantines and blocks expire automatically unless extended This is not bureaucracy. It is how you keep automation useful instead of dangerous. 4. Can you Audit the Decision and Preserve Evidence? If you cannot explain why the agent acted, you will not trust it. If you cannot prove what changed, you will not defend it. At a minimum, every action should record: What triggered it and why the system believed it was severe What data was used at decision time, captured as an evidence snapshot What changed, with a clear before/after view Who approved it if approval was required When something goes wrong, this audit trail is the difference between a controlled response and “we have no idea what happened.” 5. Is Rollback Built in by Design? Rollback is not just nice to have. It is the difference between safe automation and chaos. If an agent can isolate an endpoint, it must be able to un-isolate it cleanly. If it can disable an account, it must be able to re-enable it quickly and safely. If it can push a rule, it must be able to revert it without a manual emergency change process. If rollback is unclear or brittle, you are not ready for autonomous execution. How Mature SOCs Phase This In Without Betting the Business You do not jump from AI summarizes alerts to AI executes containment across the stack. The cleanest path I have seen is a trust ladder: Suggest → Assist → Execute with guardrails Suggest means the system proposes what to do and why. Humans act. Assist means the system gathers evidence, correlates signals, pre-stages change sets and reduces analyst friction. Execute with guardrails means the system performs limited, reversible actions under policy, with audit and rollback designed in. This ladder keeps humans in control while still delivering speed. It also makes the adoption story easier to sell internally because you can say, “We are in Assist today. Execution comes only after controls are proven.” Where to Start in the Next 30 Days If you want to move quickly without turning this into a year-long program, pick one workflow where speed matters and actions are reversible. Identity session revocation is a strong candidate because it is high-impact, fast and usually reversible. Then do five things in order: First, write the allowed-action policy in plain language. Not a framework. A policy someone can read in two minutes. Second, put blast radius limits around it, including auto-expiration, so you cannot accidentally automate a disaster. Third, wire-in audit and evidence capture so that every action is defensible. Fourth, design rollback before you allow execution. Finally, run a tabletop: “What if the agent is wrong?” If you cannot recover quickly, you are not ready. Here is the point that matters: Agentic AI can help you get proactive, but only if your proactive actions are measurable and reversible, not reckless. Bottom Line Agentic AI will make SOCs faster. It will also make mistakes faster. The winning SOCs will not be the ones with the most automation. They will be the ones with automation that can be trusted — clear permissions, approval gates for high-impact actions, blast radius limits, auditability and rollback by design. If AI can execute, governance is not overhead. It is your safety harness.

About Author

AndyC

Andy Curtis is an award-winning security consultant, researcher and public speaker. He has been working in the computer security industry since the early 1990s, having been employed by state and federal government, leading healthcare and banking providers across three continents. He has given talks about computer security for some of the world’s largest companies, worked with law enforcement agencies on investigations into hacking groups, and is a regular voice on TV and radio explaining IT security threats.

See author's posts