notesum.ai
Published at October 30Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
cs.SE
cs.CL
cs.CR
Released Date: October 30, 2024
Authors: Lam Nguyen Tung1, Steven Cho2, Xiaoning Du1, Neelofar Neelofar1, Valerio Terragni2, Stefano Ruberto3, Aldeida Aleti1
Aff.: 1Monash University, Australia; 2University of Auckland, New Zealand; 3JRC European Commission, Italy

| Dataset | Data Statistics | Model Under Test | Number of top words | Importance Type | ||||
| Trust | Untrust | Total | Model Type | Accuracy | ||||
| Dong’s (Dong, 2018) | movie | 311 | 47 | 358 | MLP | 0.832 | 10, 20 | importance |
| 20news | MLP | 0.939 | equivalent | |||||
| Garg et al.’s (Garg et al., 2022) | CAMS | 1,206 | 739 | 1,945 | mentalbert-base-uncased | 0.397 | 10 | importance equivalent |
| Mathew et al.’s (Mathew et al., 2021) | HateXplain | 3,002 | 304 | 3,306 | bert-base-uncased | 0.797 | 10 | importance equivalent |
| Ours | amazon_polarity | 226 | 19 | 245 | roberta-base-cased | 0.960 | 5, 10, 20 | importance different |
| ag_news | bert-base-uncased | 0.934 | ||||||
| rotten_tomatoes | distilbert-base-uncased | 0.841 | ||||||
| yahoo_answers_topics | bert-base-uncased | 0.750 | ||||||
| imdb | distilbert-base-uncased | 0.928 | ||||||
| emotion | distilbert-base-uncased | 0.926 | ||||||