Data-adaptive discriminative feature localization with statistically guaranteed interpretation

Ben Dai; Xiaotong Shen; Lin Yee Chen; Chunlin Li; Wei Pan

doi:10.1214/22-AOAS1705

September 2023 Data-adaptive discriminative feature localization with statistically guaranteed interpretation

Ben Dai, Xiaotong Shen, Lin Yee Chen, Chunlin Li, Wei Pan

Author Affiliations +

Ann. Appl. Stat. 17(3): 2019-2038 (September 2023). DOI: 10.1214/22-AOAS1705

Abstract

In explainable artificial intelligence, discriminative feature localization is critical to reveal a black-box model’s decision-making process from raw data to prediction. In this article we use two real datasets, the MNIST handwritten digits and MIT-BIH electrocardiogram (ECG) signals, to motivate key characteristics of discriminative features, namely, adaptiveness, predictive importance and effectiveness. Then we develop a localization framework, based on adversarial attacks, to effectively localize discriminative features. In contrast to existing heuristic methods, we also provide a statistically guaranteed interpretability of the localized features by measuring a generalized partial $R^{2}$ . We apply the proposed method to the MNIST dataset and the MIT-BIH dataset with a convolutional autoencoder. In the first, the compact image regions localized by the proposed method are visually appealing. Similarly, in the second, the identified ECG features are biologically plausible and consistent with cardiac electrophysiological principles while locating subtle anomalies in a QRS complex that may not be discernible by the naked eye. Overall, the proposed method compares favorably with state-of-the-art competitors. Accompanying this paper is a Python library dnn-locate that implements the proposed approach.

Funding Statement

We would like to acknowledge support for this project from RGC-ECS 24302422, the CUHK direct grant, NSF DMS-1712564, DMS-1721216, DMS-1952539 and NIH grants R01GM126002, R01AG069895, R01AG065636, R01AG074858, R01AG074858, U01AG073079 and RF1 AG067924.

Acknowledgments

The corresponding authors for this work are Ben Dai and Wei Pan. The authors would like to thank the referees, the Associate Editor and the Editor for the constructive feedback which greatly improved this work.

Citation

Download Citation

Ben Dai. Xiaotong Shen. Lin Yee Chen. Chunlin Li. Wei Pan. "Data-adaptive discriminative feature localization with statistically guaranteed interpretation." Ann. Appl. Stat. 17 (3) 2019 - 2038, September 2023. https://doi.org/10.1214/22-AOAS1705

Information

Received: 1 March 2022; Revised: 1 August 2022; Published: September 2023

First available in Project Euclid: 7 September 2023

MathSciNet: MR4637655

Digital Object Identifier: 10.1214/22-AOAS1705

Keywords: deep learning , discriminative features , Explainable artificial intelligence , generalized partial R2 , interpretability , Localization , regularization

ACCESS THE FULL ARTICLE

JOURNAL ARTICLE
20 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY