notesum.ai

Published at October 22

LLMScan: Causal Scan for LLM Misbehavior Detection

cs.AI
cs.DL
cs.LG

Released Date: October 22, 2024

Authors: Mengdi Zhang1, Kai Kiat Goh1, Peixin Zhang1, Jun Sun1

Aff.: 1Singapore Management University

Arxiv: https://arxiv.org/abs/2410.16638v1