notesum.ai

Published at October 21

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count

cs.CV
cs.AI

Released Date: October 21, 2024

Authors: Hanseul Cho1, Jaeyoung Cha1, Srinadh Bhojanapalli2, Chulhee Yun1

Aff.: 1Graduate School of AI, KAIST; 2Google Research

Arxiv: https://arxiv.org/abs/2410.15787v1