notesum.ai
Published at November 29On Domain-Specific Post-Training for Multimodal Large Language Models
cs.CL
cs.CV
cs.LG
Released Date: November 29, 2024
Authors: Daixuan Cheng1, Shaohan Huang2, Ziyu Zhu3, Xintong Zhang4, Wayne Xin Zhao5, Zhongzhi Luan2, Bo Dai1, Zhenliang Zhang1
Aff.: 1State Key Laboratory of General Artificial Intelligence, BIGAI; 2Beihang University; 3State Key Laboratory of General Artificial Intelligence, BIGAI, Tsinghua University; 4State Key Laboratory of General Artificial Intelligence, BIGAI, Beijing Institute of Technology; 5Renmin University of China

| User: <Image>Describe the image. |
| Assistant: {Caption} |
| User: Answer with a precise response. {Instruction} |
| Assistant: {Precise Response} |
| User: Answer with an informative response. {Instruction} |
| Assistant: {Informative Response} |