notesum.ai

Published at November 18

Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation

cs.CL

cs.AI

Released Date: November 18, 2024

Authors: Peng Shu¹, Junhao Chen¹, Zhengliang Liu¹, Hui Wang², Zihao Wu¹, Tianyang Zhong³, Yiwei Li¹, Huaqin Zhao¹, Hanqi Jiang¹, Yi Pan¹, Yifan Zhou¹, Constance Owl⁴, Xiaoming Zhai⁵, Ninghao Liu⁶, Claudio Saunt⁴, Tianming Liu⁶

Aff.: ¹School of Computing, The University of Georgia, Athens 30602, USA; ²Second Language Acquisition and Teaching, University of Arizona, Tucson, 85721, USA; ³Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, Canada; ⁴Department of History, The University of Georgia, Athens 30602, USA; ⁵Department of Mathematics, Science, and Social Studies Education, University of Georgia, Athens, GA, USA; ⁶National GENIUS Center, Athens, GA, USA

Arxiv: http://arxiv.org/abs/2411.11295v1

Language

Model

BLEU

ROUGE-L

BERTScore

Precision

BERTScore

Recall

BERTScore

Human Evaluation

Llama 3.1 405B

0.0

0.931

0.927

0.929

0.0

Cherokee

GPT-4o

0.003

0.0

0.938

0.0

GPT-4o + RAG

0.115

0.117

0.962

0.964

0.963

0.0

Llama 3.1 405B

0.0

0.879

0.859

0.869

0.067

Tibetan

GPT-4o

0.0

0.833

0.851

0.842

0.147

GPT-4o + RAG

0.108

0.123

0.802

0.810

0.806

0.293

Llama 3.1 405B

0.0

0.104

0.693

0.663

0.678

0.040

Manchu

GPT-4o

0.0

0.125

0.726

0.703

0.714

0.173

GPT-4o + RAG

0.077

0.188

0.716

0.696

0.706

0.333