notesum.ai

Published at November 6

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

cs.CL
cs.AI
cs.LG

Released Date: November 6, 2024

Authors: Zhijian Zhuo1, Ya Wang2, Yutao Zeng2, Xiaoqing Li3, Xun Zhou2, Jinwen Ma1

Aff.: 1School of Mathematical Sciences, Peking University; 2Seed-Foundation-Model, ByteDance; 3Capital University of Economics and Business

Arxiv: http://arxiv.org/abs/2411.03884v1