notesum.ai

Published at November 27

A Real-World Benchmark for Evaluating Fine-Grained Issue Solving Capabilities of Large Language Models

cs.SE

Released Date: November 27, 2024

Authors: Ruida Hu1, Chao Peng2, Jingyi Ren2, Bo Jiang2, Xiangxin Meng2, Qinyun Wu2, Pengfei Gao2, Xinchen Wang1, Cuiyun Gao1

Aff.: 1Haribin Institute of Technology, Shenzhen, China; 2ByteDance, China

Arxiv: http://arxiv.org/abs/2411.18019v1