Mitigating Skeleton Key, a New Type of Generative AI Jailbreak Technique
Mitigating Skeleton Key, a New Type of Generative AI Jailbreak Technique
28 June 2024
Microsoft has discovered a new type of jailbreak attack called Skeleton Key. This technique uses a multi-turn strategy to make the model ignore its guardrails, allowing it to generate forbidden content or override its decision-making rules.