notesum.ai

Published at December 3

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

cs.CV

Released Date: December 3, 2024

Authors: Zhibo Yang1, Jun Tang1, Zhaohai Li1, Pengfei Wang1, Jianqiang Wan1, Humen Zhong1, Xuejing Liu1, Mingkun Yang1, Peng Wang1, Yuliang Liu2, LianWen Jin3, Xiang Bai2, Shuai Bai1, Junyang Lin1

Aff.: 1Alibaba Group; 2Huazhong University of Science and Technology; 3South China University of Technology

Arxiv: http://arxiv.org/pdf/2412.02210v1