notesum.ai
Published at October 20Faster-GCG: Efficient Discrete Optimization Jailbreak Attacks against Aligned Large Language Models
cs.LG
cs.AI
physics.chem-ph
q-bio.BM
Released Date: October 20, 2024
Authors: Xiao Li1, Zhuhong Li2, Qiongxiu Li3, Bingze Lee1, Jinghao Cui1, Xiaolin Hu1
Aff.: 1Tsinghua University; 2Duke University; 3Aalborg University

| Models | Method | Batch Size | Iteration | ASR |
|---|---|---|---|---|
| Llama-2-7B-chat | GCG | 512 | 500 | 5% |
| Faster-GCG | 256 | 100 | 34% | |
| Faster-GCG | 512 | 500 | 49% | |
| Vicuna-13B | GCG | 512 | 500 | 79% |
| Faster-GCG | 256 | 100 | 87% | |
| Faster-GCG | 512 | 500 | 95% |