Germany: Adopted BSI guide on explainable Artificial Intelligence in an adversarial context

Adopted BSI guide on explainable Artificial Intelligence in an adversarial context

On 6 January 2025, the German Federal Office for Information Security (BSI) adopted a white paper serving as a guide addressing the explainability of artificial intelligence (AI) in adversarial contexts. The document focuses on the limitations of Explainable Artificial Intelligence (XAI), particularly post-hoc methods used to interpret black box AI models. The white paper identifies three challenges, namely the disagreement problem, manipulation risks, and fairwashing. Solutions to these problems include standardising explanation methods, employing robust audits such as white-box or outside-the-box access), and developing new manipulation-resistant techniques. Furthermore, the white paper proposes detection strategies, such as outlier analysis and statistical comparisons, to identify inconsistencies and prevent deceptive practices in AI assessments. The objective of the white paper is to inform the development of reliable assessment procedures and digital consumer protection measures in line with the requirements of the European Union AI Act.

Original source

Scope

Policy Area

Design and testing standards

Policy Instrument

Design requirement

Regulated Economic Activity

ML and AI development

Implementation Level

national

Government Branch

executive

Government Body

data protection authority

Complete timeline of this policy change

Hide details

2025-01-06

adopted

On 6 January 2025, the German Federal Office for Information Security (BSI) adopted a white paper s…

Description

Adopted BSI guide on explainable Artificial Intelligence in an adversarial context

Scope

Complete timeline of this policy change