notesum.ai
Published at December 5p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
cs.CV
cs.CL
Released Date: December 5, 2024
Authors: Jun Zhang1, Desen Meng, Ji Qi, Zhenpeng Huang, Tao Wu, Limin Wang
Aff.: 1State Key Laboratory for Novel Software Technology, Nanjing University

| Model | Params |
|
|
SEED | RWQA | MME | MMB | POPE | GQA | AI2D | AVG | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LLaVA-v1.5 | 7B | 8.38 | 100% | 66.2 | 55.6 | 1506.8 | 64.1 | 85.9 | 61.9 | 55.2 | 66.3 | ||||
| p-MoD-LLaVA-v1.5 | 7B | 4.92 | 53.8% | 66.5 | 55.7 | 1482.8 | 65.4 | 85.5 | 62.2 | 56.2 | 66.5 | ||||
| LLaVA-NeXT | 7B | 39.46 | 100% | 68.9 | 57.6 | 1519.3 | 67.5 | 87.2 | 63.5 | 64.0 | 69.2 | ||||
| p-MoD-LLaVA-NeXT | 7B | 21.94 | 53.8% | 69.0 | 57.6 | 1495.5 | 67.3 | 86.8 | 63.3 | 65.1 | 69.1 |