BEAST Attack on AI Models can Break LLM Guardrails in a Minute
BEAST Attack on AI Models can Break LLM Guardrails in a Minute
29 February 2024
Computer scientists have developed a fast and efficient method, called BEAST, to generate harmful prompts that elicit undesirable responses from large language models using an Nvidia RTX A6000 GPU with 48GB of memory.