notesum.ai

Published at December 4

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

cs.LG
cs.AI
cs.CL

Released Date: December 4, 2024

Authors: Alex Havrilla1, Andrew Dai, Laura O'Mahony, Koen Oostermeijer, Vera Zisler, Alon Albalak, Fabrizio Milo, Sharath Chandra Raparthy, Kanishk Gandhi, Baber Abbasi, Duy Phung, Maia Iyer, Dakota Mahan, Chase Blagden, Srishti Gureja, Mohammed Hamdy, Wen-Ding Li, Giovanni Paolini, Pawan Sasanka Ammanamanchi, Elliot Meyerson

Aff.: 1Georgia Tech

Arxiv: http://arxiv.org/pdf/2412.02980v1