notesum.ai

Published at October 20

Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

cs.MM
cs.AI
cs.CL
cs.HC
cs.IR

Released Date: October 20, 2024

Authors: Norbert Tihanyi1, Tamas Bisztray2, Richard A. Dubniczky3, Rebeka Toth2, Bertalan Borsos3, Bilel Cherif4, Mohamed Amine Ferrag5, Lajos Muzsai3, Ridhi Jain1, Ryan Marinelli2, Lucas C. Cordeiro6, Merouane Debbah7

Aff.: 1Technology Innovation Institute, Abu Dhabi, United Arab Emirates; 2University of Oslo, Oslo, Norway; 3Eötvös Loránd University, Budapest, Hungary; 4Technology Innovation Institute, United Arab Emirates; 5University of Guelma, Guelma, Algeria; 6The University of Manchester, Manchester, United Kingdom; 7Khalifa University, Abu Dhabi, United Arab Emirates

Arxiv: https://arxiv.org/abs/2410.15490v2