notesum.ai

Published at November 26

Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats

cs.CL

Released Date: November 26, 2024

Authors: Jiaxin Wen1, Vivek Hebbar2, Caleb Larson3, Aryan Bhatt2, Ansh Radhakrishnan4, Mrinank Sharma4, Henry Sleight3, Shi Feng5, He He5, Ethan Perez4, Buck Shlegeris2, Akbir Khan6

Aff.: 1Tsinghua University; 2Redwood Research; 3MATS; 4Anthropic; 5George Washington University; 6Anthropic, UCL

Arxiv: http://arxiv.org/abs/2411.17693v1