notesum.ai

Published at November 18

Enhancing Vision-Language Model Safety through Progressive Concept-Bottleneck-Driven Alignment

cs.CV
cs.AI

Released Date: November 18, 2024

Authors: Zhendong Liu1, Yuanbi Nie2, Yingshui Tan3, Xiangyu Yue4, Qiushi Cui2, Chongjun Wang1, Xiaoyong Zhu3, Bo Zheng3

Aff.: 1Department of Computer Science and Technology, Nanjing University, Nanjing, Jiangsu Province, China; 2School of Electrical Engineering, Chongqing University, Chongqing, China; 3Alibaba Group, Hangzhou, Zhejiang Province, China; 4Department of Information Engineering, Multimedia Lab (MMLab), Chinese University of Hong Kong, Hong Kong, China

Arxiv: http://arxiv.org/abs/2411.11543v1