notesum.ai

Published at November 12

RedCode: Risky Code Execution and Generation Benchmark for Code Agents

cs.SE
cs.AI

Released Date: November 12, 2024

Authors: Chengquan Guo1, Xun Liu2, Chulin Xie2, Andy Zhou3, Yi Zeng4, Zinan Lin5, Dawn Song6, Bo Li1

Aff.: 1University of Chicago; 2University of Illinois Urbana-Champaign; 3Lapis Labs; 4Virginia Tech; 5Microsoft Research; 6University of California Berkeley

Arxiv: http://arxiv.org/abs/2411.07781v1