NIST AI RMF
NIST AI RMF Explained: A Plain-English Guide for Builders
The NIST AI Risk Management Framework is the US standard for responsible AI — but its language is dense. This guide strips it back to what you actually need to implement, with concrete examples at each stage.
What is the NIST AI Risk Management Framework?
The National Institute of Standards and Technology (NIST) AI Risk Management Framework (AI RMF 1.0, released January 2023) is a voluntary framework that helps organisations identify, assess, and manage risks arising from the design, development, deployment, and use of AI systems. Unlike the EU AI Act, it carries no legal force — but it is increasingly referenced in US federal procurement requirements, FTC enforcement priorities, and international standards like ISO 42001.
The framework is organised around four core functions: Govern, Map, Measure, and Manage. Think of them as a loop, not a linear checklist.
Function 1: Govern — Build the Policy Foundation
The Govern function is the one most teams skip — and the one auditors look for first. It asks: does your organisation actually have a culture, structure, and set of policies that support responsible AI? Without this foundation, the other three functions are difficult to execute consistently.
What it requires in practice:
- AI policy document:A written, board-approved statement on your organisation's values and risk appetite for AI. It should cover prohibited use cases, bias thresholds, and escalation paths.
- Roles and accountability: Clearly assigned AI risk ownership — who is responsible for each system, who can escalate, and who signs off on deployment decisions.
- Training and awareness: Developers, product managers, and executives should all understand basic AI risk concepts relevant to their role.
- Third-party risk: If you use third-party AI models (OpenAI, Anthropic, Cohere, etc.), Govern requires you to assess and contractually manage the risks in those relationships.
Example: A fintech startup building a credit-scoring model should have a written policy stating that no demographic proxies (zip code as a racial proxy, for example) will be used in feature engineering, and that a named ML engineer is accountable for bias audits before each model version ships.
Function 2: Map — Know Your Context
Map asks you to understand the AI system's context — who uses it, what decisions it influences, and what harms could arise. This is the risk identification phase.
Key activities:
- Intended use documentation: Write a clear intended-use statement covering who the system was designed for, what tasks it performs, and what environment it is deployed in.
- Stakeholder mapping: Identify all affected groups — direct users, indirectly affected people (e.g. job applicants processed by an AI hiring tool), and vulnerable populations.
- Risk categorisation: Classify risks by type: bias/fairness, privacy, security, reliability, transparency, and explainability.
- Impact assessment: For each risk category, assess severity (what harm could occur) and likelihood (how probable under realistic conditions).
Example: A healthcare AI startup deploying a diagnostic support tool should map risks including misdiagnosis, over-reliance by clinicians, PII exposure in clinical notes fed to the model, and bias toward majority population health data.
Function 3: Measure — Quantify Your Risks
Measure asks you to move from identifying risks to quantifying them — using metrics, tests, and analysis to understand severity and likelihood with evidence, not intuition.
Key activities:
- Bias and fairness testing: Use disaggregated performance metrics broken down by demographic group. Measure false positive and false negative rates across subgroups. Tools like Fairlearn and AI Fairness 360 are useful here.
- Robustness testing: Evaluate model performance under distribution shift, adversarial perturbations, and edge cases at deployment boundaries.
- Explainability scoring: For decisions affecting individuals, quantify how explainable model outputs are. SHAP values, LIME, and attention maps are standard techniques.
- Uncertainty quantification:Confidence calibration — does the model's stated confidence reflect its actual accuracy?
Example: An e-recruitment AI tool should report true positive rate by gender and ethnicity, calibration curves by candidate pool segment, and an explainability score for each rejection decision — all documented before deployment.
Function 4: Manage — Treat and Track Risks
Manage is where you act on what Measure revealed. It covers prioritisation, treatment plans, ongoing monitoring, and incident response.
Key activities:
- Risk treatment register:For every identified risk, document: accept, mitigate, transfer (e.g. insurance), or avoid (don't deploy that feature). Assign owner and deadline.
- Monitoring pipeline: Set up continuous monitoring of production model performance. Alert on distribution shift, accuracy degradation, and anomalous output patterns.
- Incident response plan: What happens when the model causes harm? Define escalation paths, rollback procedures, and public communication protocols.
- Residual risk acceptance: Document risks that cannot be fully mitigated and get sign-off from a risk owner with authority to approve them.
Example:If your sentiment analysis model's Measure phase reveals it misclassifies negative sentiment for non-native English speakers 2x more often than native speakers, your Manage response might be to add a human review step for low-confidence predictions, retrain on a more diverse corpus, and set an alert threshold at 1.5x disparity in production.
How NIST AI RMF Relates to Other Standards
The NIST AI RMF does not exist in isolation. It maps closely onto ISO 42001 (the international AI management system standard) and is referenced explicitly in how the EU AI Act expects risk management systems to be designed. Running a single Governer scan will simultaneously assess your codebase and system configuration against NIST AI RMF, EU AI Act, ISO 42001, and GDPR — giving you a unified compliance picture across all four frameworks at once.
Getting Started with NIST AI RMF Today
The most practical starting point is a gap assessment: understand which of the four functions you are already doing informally, and which are missing entirely. Document what exists. Then use tools like Governer's NIST AI RMF scanner to identify technical gaps in your codebase — missing logging, absent transparency mechanisms, undocumented data governance — and prioritise those with the highest risk scores.
The NIST AI RMF is a journey, not a destination. The teams that start today, even imperfectly, will be significantly better positioned than those who wait for mandatory requirements to force the issue.
Ready to check your AI system's compliance?
Run a free scan against the EU AI Act, GDPR, NIST AI RMF, and ISO 42001 in under 60 seconds. No credit card required.
Run Free Compliance Scan →