bernhardt2022tmlr 2022 (1) Mélanie Bernhardt, Fabio De Sousa Ribeiro, Ben Glocker, Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed, Transactions on Machine Learning Research, 2022