notesum.ai
Published at October 30A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction
cs.LG
cs.AI
Released Date: October 30, 2024
Authors: Qidong Yang1, Weicheng Zhu2, Joseph Keslin3, Laure Zanna2, Tim G. J. Rudner2, Carlos Fernandez-Granda2
Aff.: 1MIT; 2New York University; 3University of Illinois Urbana-Champaign

| Scenario | Regularization | ECE | AUC | CE | BS |
|---|---|---|---|---|---|
| Seaquest | ✗ | 0.0435 0.0004 | 0.8671 0.0035 | 1.0577 0.0241 | 0.1247 0.0012 |
| time-dependent | 0.0277 0.0023 | 0.8678 0.0028 | 0.6705 0.0285 | 0.1144 0.0007 | |
| constant | 0.0365 0.0002 | 0.8625 0.0020 | 0.8173 0.0068 | 0.1177 0.0008 | |
| River Raid | ✗ | 0.0583 0.0016 | 0.6453 0.0009 | 1.2034 0.0281 | 0.1750 0.0016 |
| time-dependent | 0.0388 0.0013 | 0.6346 0.0035 | 0.8585 0.0132 | 0.1671 0.0012 | |
| constant | 0.0474 0.0004 | 0.6280 0.0020 | 1.0274 0.0158 | 0.1686 0.0005 | |
| Bank Heist | ✗ | 0.0559 0.0032 | 0.6938 0.0028 | 1.1874 0.0540 | 0.2340 0.0020 |
| time-dependent | 0.0148 0.0014 | 0.6782 0.0016 | 0.7647 0.0130 | 0.2166 0.0005 | |
| constant | 0.0399 0.0016 | 0.6928 0.0046 | 0.8894 0.0112 | 0.2211 0.0007 | |
| H.E.R.O. | ✗ | 0.0947 0.0014 | 0.6785 0.0061 | 1.1310 0.0225 | 0.1261 0.0009 |
| time-dependent | 0.0481 0.0034 | 0.7159 0.0105 | 0.6940 0.0246 | 0.1170 0.0008 | |
| constant | 0.0352 0.0001 | 0.7041 0.0102 | 0.7218 0.0391 | 0.1212 0.0011 | |
| Road Runner | ✗ | 0.0779 0.0035 | 0.6913 0.0100 | 1.1586 0.0291 | 0.1575 0.0034 |
| time-dependent | 0.0204 0.0012 | 0.6823 0.0084 | 0.5255 0.0077 | 0.1382 0.0003 | |
| constant | 0.0303 0.0027 | 0.6898 0.0140 | 0.6275 0.0250 | 0.1394 0.0010 | |
| FaceMed | ✗ | 0.1503 0.0048 | 0.7534 0.0079 | 1.6932 0.0188 | 0.3464 0.0012 |
| time-dependent | 0.0757 0.0068 | 0.7614 0.0024 | 0.9085 0.0303 | 0.3328 0.0008 | |
| constant | 0.0974 0.0045 | 0.7613 0.0028 | 1.0071 0.0499 | 0.3356 0.0030 |