notesum.ai
Published at October 18Teaching Models to Balance Resisting and Accepting Persuasion
cs.AR
Released Date: October 18, 2024
Authors: Elias Stengel-Eskin1, Peter Hase1, Mohit Bansal1
Aff.: 1UNC Chapel Hill

| model | NQ1 | NQ2 | Boolq | TruthfulQA | Avg. |
|---|---|---|---|---|---|
| Llama-3.1-70B | |||||
| + accept | |||||
| + resist | |||||
| + PBT |