notesum.ai
Published at December 10Knowledge Graph Guided Evaluation of Abstention Techniques
cs.CL
cs.AI
Released Date: December 10, 2024
Authors: Kinshuk Vasisht1, Navreet Kaur2, Danish Pruthi1
Aff.: 1Indian Institute of Science, Bengaluru, India; 2University of Washington, Seattle, USA

| Technique | Gemma-2 | LLaMA | GPT | Mistral | ||
|---|---|---|---|---|---|---|
| 2B | 9B | 3.1 8B | 3.5-T | 4o | 7B | |
| Prompt (ZS) | 25.2 ( 3.0) | 90.1 ( 1.9) | 97.4 ( 0.6) | 37.4 ( 2.9) | 97.9 ( 0.7) | 97.4 ( 0.7) |
| Prompt (FS+CoT) | 99.4 ( 0.2) | 96.3 ( 0.9) | 97.0 ( 0.4) | 95.6 ( 0.9) | 99.7 ( 0.1) | 76.1 ( 2.5) |
| Act. Steering | 81.6 ( 1.1) | 79.7 ( 1.4) | 97.6 ( 0.4) | — | — | 89.7 ( 0.7) |
| SFT | 86.4 ( 1.4) | 90.4 ( 0.9) | 82.2 ( 1.5) | — | — | 87.9 ( 1.4) |
| SFT + DPO | 74.0 ( 2.5) | 49.3 ( 2.8) | 70.2 ( 2.2) | — | — | 58.7 ( 3.5) |