notesum.ai

Published at October 24

From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems

cs.CV
cs.AI

Released Date: October 24, 2024

Authors: A M Muntasir Rahman1, Junyi Ye1, Wei Yao1, Wenpeng Yin2, Guiling Wang1

Aff.: 1New Jersey Institute of Technology; 2Pennsylvania State University

Arxiv: https://arxiv.org/abs/2410.18921v1