notesum.ai

Published at October 18

RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions

cs.AI
cs.CL
cs.CR

Released Date: October 18, 2024

Authors: Zhiyuan Peng1, Jinming Nian1, Alexandre Evfimievski2, Yi Fang1

Aff.: 1Santa Clara University; 2Not Specified

Arxiv: https://arxiv.org/abs/2410.14567v1