notesum.ai
Published at December 10Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting
cs.CV
Released Date: December 10, 2024
Authors: Zetong Yang1, Hanxue Zhang2, Yanan Sun1, Li Chen1, Fei Xia3, Fatma Guney4, Hongyang Li1
Aff.: 1Shanghai AI Lab; 2Shanghai Jiao Tong University; 3Meituan Inc.; 4Koc University
![[Uncaptioned image]](https://arxiv.org/html/2412.07768v1/x1.png)
| Method | Backbone | Type | mAP (%) | EDS (%) |
| MonoDETR [67] | R101 | Monocular | 37.9 | 38.2 |
| TTC-MonoDETR | 42.6 (+5.7) | 43.2 (+5.0) | ||
| MV2D [57] | R50 | Multiview | 38.9 | 38.3 |
| TTC-MV2D | 51.0 (+12.1) | 51.0 (+12.7) | ||
| Sparse4Dv2 [31] | R50 | Multiview + Temporal | 38.8 | 37.6 |
| TTC-Sparse4Dv2 | 53.5 (+14.7) | 52.1 (+14.5) | ||
| BEVFormer [27] | R101 | BEV + Temporal | 36.5 | 35.8 |
| TTC-BEVFormer | 47.8 (+11.3) | 47.2 (+11.4) | ||
| BEVFormerV2-t8 [62] | R50 | BEV +Temporal | 39.6 | 38.9 |
| TTC-BEVFormerV2-t8 | 51.6 (+12.0) | 51.0 (+12.1) | ||
| RayDN [32] | R50 | BEV + Temporal | 39.6 | 38.7 |
| TTC-RayDN | 51.7 (+12.1) | 50.8 (+12.1) | ||
| StreamPETR [54] | V2-99 | BEV + Temporal | 39.7 | 39.1 |
| TTC-StreamPETR | 52.3 (+12.6) | 51.5 (+12.4) |