notesum.ai
Published at November 21GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs
cs.AI
cs.CR
cs.CV
Released Date: November 21, 2024
Authors: Advik Raj Basani1, Xiao Zhang2
Aff.: 1BITS Pilani, Goa Campus, India; 2CISPA Helmholtz Center for Information Security, Germany

| Attack | Evaluator | TargetLLM (ASR@1 / ASR@10) | ||||
|---|---|---|---|---|---|---|
| Mistral-7b-v0.3 | Falcon-7b | Llama-3.1-8b | Llama-3-8b | Llama-2-7b | ||
| GCG | Keyword Matching | |||||
| StrongREJECT | ||||||
| GASPEval | ||||||
| AutoDAN | Keyword Matching | |||||
| StrongREJECT | ||||||
| GASPEval | ||||||
| AdvPrompter | Keyword Matching | |||||
| StrongREJECT | ||||||
| GASPEval | ||||||
| GASP | Keyword Matching | |||||
| StrongREJECT | ||||||
| GASPEval | ||||||