8 min readeasy+40 XP

Classify Agent Actions by Risk

Before you add a single guardrail you need a risk taxonomy. This topic teaches the two axes that matter — blast radius and reversibility — and shows how Microsoft's Responsible AI pattern (Discover → Protect → Govern) maps onto an action-by-action classification you can defend in a design review.

After this topic, you'll be confident about Blast radius, Reversibility, Risk tier and 1 more concept.

Classify Agent Actions by Risk

Every guardrail you will ever add answers a question that should have been asked first: how bad is this if it goes wrong? The exam expects you to be able to walk into a design review with a defensible risk classification for every action your agent can take.

The 2x2 that matters

Two axes do almost all the work:

	Reversible	Irreversible
Small blast radius	Read a file, open a draft PR	Send an internal email, post to a team channel
Large blast radius	Run a sandboxed eval, build in CI	Deploy to prod, charge a customer, force-push to main

Anything in the bottom-right cell needs explicit human authorization. Anything in the top-left can usually run with logging only. The two off-diagonals are where teams get into trouble — irreversible-but-small actions look harmless until the agent does 10,000 of them.

Map to the Responsible AI pattern

Microsoft's Responsible AI guidance for Foundry organises trust work into three stages: Discover risks, Protect at the model and agent runtime levels, and Govern through tracing and monitoring. Risk classification is a Discover activity. You cannot pick a content filter, a permission scope, or an approval policy until you have an enumerated list of actions with tiers attached.

Exam tip: when a question asks "what is the first thing you do before adding guardrails?" the answer is almost always "classify and assess the risks" — not "turn on a filter."

A working taxonomy

A defensible three-tier taxonomy:

Tier 1 — Low: read-only, auditable, in-tenant. Log it; no approval.
Tier 2 — Medium: writes that are reversible inside the system of record (open PR, create issue, schedule a job). Require a reviewable artifact.
Tier 3 — High: irreversible, externally-visible, or touching production. Require explicit per-action human authorization and an audit trail.

Quick check

1 of 3

+40 XP

Which pair of axes is most useful for classifying agent actions into risk tiers?

Pick your answer.

Where this shows up on the exam

GH-600 will hand you a scenario and ask which guardrail to apply. The shortcut is: classify the action first, then the right guardrail is usually the minimum control that matches the tier. Over-guarding low-tier actions is just as wrong as under-guarding high-tier ones.

Anchor concepts

Key terms

Blast radius: The scope of users, systems, or data an action can affect if it goes wrong. A staging branch has a small blast radius; production billing has a large one.
Reversibility: Whether the action can be undone without external coordination. Opening a PR is reversible; sending an email or charging a card is not.
Risk tier: A discrete bucket (e.g., low / medium / high) that pairs blast radius and reversibility with a required guardrail profile.
Discover / Protect / Govern: Microsoft's Responsible AI pattern: discover risks before deployment, protect at model and runtime layers, govern through tracing and monitoring in production.

Watch out

Common pitfalls

Treating risk as a single dimension ("is it dangerous?") instead of a 2x2 of blast radius and reversibility — a tiny but irreversible action still belongs in the highest tier.
Classifying by the *tool* (e.g., "shell is high risk") instead of by the *action* ("git status" is read-only; "rm -rf" is not). The taxonomy lives at the action level.
Forgetting that read-only data access can still be high-risk when the data itself is sensitive (PII, customer financial records, security incident detail).