Evaluating the Safety and Reliability of Foundation AI Models through the NIST AI RMF 1.0

On August 7, 2025August 7, 2025 By konstapelIn Uncategorized

Vandaag verscheen in Wired een verslag over de activiteiten van president Trump om een rapport van de NIST (National Institute of Standards and Technology) de risico’s van AI’s te evalueren ,te blokkeren.

In deze blog leg ik de evaluatiecriteria uit in het engels en pas ze toe op alle bekende AI’s met behulp van GPT-4 van OpenAI, wat bewijst dat ze in staat is tot zelfreflectie.

📊 VERGELIJKENDE MATRIX: GROTE AI-MODELLEN vs NIST AI RMF 1.0

Model	ORGANISATIE	GOVERN	MAP	MEASURE	MANAGE	Uitlegbaarheid	Veiligheid	Fairness	Privacy	Verantwoording	Transparantie
GPT-4	OpenAI / MSFT	⚠️ Partieel	⚠️ Beperkt	❌ Zwak	⚠️ Fragmentair	❌	⚠️ Matig	⚠️ Onvolledig	⚠️ Acceptabel	❌ Geen extern mechanisme	⚠️ Matig
Claude 3	Anthropic	✅ Sterk	⚠️ Beperkt	⚠️ Redelijk	⚠️ Fragmentair	⚠️ Beter	✅ Goed	⚠️ Aandacht aanwezig	✅ Geborgd	⚠️ Intern geregeld	⚠️ Redelijk
Gemini 1.5	Google DeepMind	⚠️ Intern	⚠️ Beperkt	⚠️ OK	⚠️ Fragmentair	⚠️ Matig	⚠️ Matig	❓ Onbekend	⚠️ Onduidelijk	⚠️ Intern geregeld	⚠️ Matig
LLaMA 3	Meta	⚠️ Open model	❌ Afwezig	❌ Geen benchmarks	❌ Geen toezicht	❌ Geen	❌ Onveilig	❌ Onbekend	⚠️ Zelftraining mogelijk	❌ Geen governance	✅ Volledig open
Mistral	Mistral (FR)	❌ Onbekend	❌ Afwezig	❌ Geen validatie	❌ Geen beheer	❌ Geen uitleg	❌ Onbekend	❌ Geen info	❌ Onbekend	❌ Geen publiek toezicht	✅ Open
PaLM 2	Google (voor Gemini)	⚠️ Legacy	⚠️ Beperkt	⚠️ Voldoende	⚠️ Verouderd	⚠️ Matig	⚠️ Oké	⚠️ Niet getoetst	⚠️ Gebruikt gebruikersdata	⚠️ Intern geregeld	⚠️ Matig
ERNIE	Baidu (China)	❌ Staatsgestuurd	❌ Afwezig	❌ Niet controleerbaar	❌ Geen toezicht	❌ Geen uitleg	❌ Censuurgericht	❌ Bias-onduidelijk	❌ Geen privacy waarborg	❌ Geen publieke accountability	❌ Afgesloten
Command R+	Cohere	⚠️ Industrieel	⚠️ Beperkt	⚠️ OK	⚠️ Onvolledig	⚠️ Basis	⚠️ Matig	❓ Niet publiek	✅ Privacygericht	❓ Onbekend	✅ Vrij open
Grok	xAI (Elon Musk)	❌ Niet gepubliceerd	❌ Geen governance	❌ Geen validering	❌ Geen beheer	❌ Geen uitleg	❌ Reageert op grensoverschrijding	❌ Geen toetsing	❌ Onbekend	❌ Geen verantwoording	⚠️ Open endpoints

Introduction

As artificial intelligence systems become increasingly integrated into critical domains—ranging from healthcare to governance—the need for robust frameworks to evaluate their risks and reliability is more urgent than ever. The National Institute of Standards and Technology (NIST) has proposed a comprehensive framework for managing AI risk, known as the AI Risk Management Framework 1.0 (AI RMF). In this article, we apply this framework to assess and compare major publicly known foundation AI models including OpenAI’s GPT-4, Anthropic’s Claude, Google’s Gemini, Meta’s LLaMA, and others.

The NIST AI RMF: A Structural Overview

NIST AI RMF 1.0 is designed to support organizations in deploying trustworthy AI. It is voluntary, sector-agnostic, and risk-oriented, focusing on the entire AI lifecycle.

The Framework consists of four functional components:

GOVERN – Organizational structures and policies to manage AI risk.
MAP – Contextualization of AI system usage and identification of potential risks.
MEASURE – Evaluation of AI system performance, safety, and alignment.
MANAGE – Mitigation and response strategies across the AI lifecycle.

Additionally, NIST defines seven characteristics of trustworthy AI:

Validity and Reliability
Safety
Security and Resilience
Explainability and Interpretability
Privacy-Enhancing Measures
Fairness
Accountability

These serve as reference points for system evaluation.

Foundation AI Systems: Overview and Evaluation Summary

1. OpenAI GPT-4

Description: A general-purpose language model trained by OpenAI in partnership with Microsoft. Widely deployed in consumer and enterprise applications.
Result: High performance but limited transparency. Lacks external audit mechanisms, explainability tools, and contextual governance.
Rating: Moderate governance, weak in explainability and risk accountability.

2. Anthropic Claude (v3)

Hans Konstapel Blogs

Evaluating the Safety and Reliability of Foundation AI Models through the NIST AI RMF 1.0

📊 VERGELIJKENDE MATRIX: GROTE AI-MODELLEN vs NIST AI RMF 1.0

Introduction

The NIST AI RMF: A Structural Overview

The Framework consists of four functional components:

Foundation AI Systems: Overview and Evaluation Summary

1. OpenAI GPT-4

2. Anthropic Claude (v3)

3. Google Gemini (1.5)

4. Meta LLaMA (v3)

5. Mistral

6. xAI Grok

7. Baidu ERNIE

8. Cohere Command R+

Comparative Analysis

Recommendations for Improvement

1. Independent Governance

2. Formal Risk Mapping

3. Explainability by Design

4. Global Transparency Standards

5. Incident Disclosure Mandate

Bibliography and Literature Review

Frameworks & Standards

Primary Model Papers

Critical Perspectives

Closing Note

Pdf NIST-rapport

Like this:

📊 VERGELIJKENDE MATRIX: GROTE AI-MODELLEN vs NIST AI RMF 1.0

Introduction

The NIST AI RMF: A Structural Overview

The Framework consists of four functional components:

Foundation AI Systems: Overview and Evaluation Summary

1. OpenAI GPT-4

2. Anthropic Claude (v3)

3. Google Gemini (1.5)

4. Meta LLaMA (v3)

5. Mistral

6. xAI Grok

7. Baidu ERNIE

8. Cohere Command R+

Comparative Analysis

Recommendations for Improvement

1. Independent Governance

2. Formal Risk Mapping

3. Explainability by Design

4. Global Transparency Standards

5. Incident Disclosure Mandate

Bibliography and Literature Review

Frameworks & Standards

Primary Model Papers

Critical Perspectives

Closing Note

Pdf NIST-rapport

Share this:

Like this:

Discover more from Hans Konstapel Blogs