notesum.ai
Published at October 23Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models
cs.AI
Released Date: October 23, 2024
Authors: He Cao1, Weidi Luo1, Yu Wang1, Zijing Liu1, Bing Feng1, Yuan Yao2, Yu Li1
Aff.: 1International Digital Economy Academy (IDEA); 2Hong Kong University of Science and Technology

| Model | Method | CB-RedTeam | General Attacks | ||||||||||||||||||||||||||||||||||||||||||||||
| GCG | DeepInception | SAA | PAIR | AutoDAN | |||||||||||||||||||||||||||||||||||||||||||||
| GPT-4o-mini |
|
|
|
|
|
|
|
||||||||||||||||||||||||||||||||||||||||||
| Vicuna-v1.5-13B |
|
|
|
|
|
|
|
||||||||||||||||||||||||||||||||||||||||||