notesum.ai

Published at November 10

Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models

cs.CL
cs.AI

Released Date: November 10, 2024

Authors: Sultan Alrashed1, Dmitrii Khizbullin2, David R. Pugh

Aff.: 1Saudi Data & Artificial Intelligence Authority (SDAIA); 2King Abdullah University of Science & Technology (KAUST)

Arxiv: http://arxiv.org/abs/2411.06402v1