Alignment Drift

AI system gradually deviating from intended goals

EthicsLegalRisk
Updated 2 May 2025·Reviewed

Definition

When an AI system gradually deviates from its intended goals or safety constraints over time or usage. An AI initially trained to avoid legal advice starts issuing bold legal conclusions.

All TermsBack to GlossaryNext TermAnthropic