notesum.ai

Published at November 26

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

cs.CV
cs.AI
cs.CL
cs.HC

Released Date: November 26, 2024

Authors: Kevin Qinghong Lin1, Linjie Li2, Difei Gao1, Zhengyuan Yang2, Shiwei Wu1, Zechen Bai1, Weixian Lei, Lijuan Wang2, Mike Zheng Shou1

Aff.: 1Show Lab, National University of Singapore; 2Microsoft

Arxiv: http://arxiv.org/abs/2411.17465v1