notesum.ai

Published at November 25

CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity

cs.CR

Released Date: November 25, 2024

Authors: Zhengmin Yu1, Jiutian Zeng2, Siyi Chen2, Wenhan Xu2, Dandan Xu2, Xiangyu Liu2, Zonghao Ying2, Nan Wang1, Yuan Zhang1, Min Yang1

Aff.: 1Fudan University; 2Alibaba Group

Arxiv: http://arxiv.org/abs/2411.16239v1