Explainable Deep Learning to Profile Mitochondrial Disease Using High Dimensional Protein Expression Data

Published in 2022 IEEE International Conference on Big Data, 2022

Mitochondrial diseases are currently untreatable due to our limited understanding of their pathology. We study the expression of various mitochondrial proteins in skeletal myofibres (SM) taken from biopsies in order to discover processes involved in mitochondrial pathology using Imaging Mass Cytometry (IMC). IMC produces high dimensional multichannel (in our case 10) pseudo-images representing spatial variation in the expression of a panel of proteins within a tissue, including subcellular variation. Statistical analysis of these images requires semi-automated annotation of thousands of SM in IMC images of patient muscle biopsies. In this paper we investigate the use of deep learning (DL) on raw IMC data to analyse it without any manual pre-processing steps, statistical summaries or statistical models. For this we first t rain s tate-of-art c omputer vision DL models on all available image channels, both combined and individually. We observed better than expected accuracy for many of these models. We then apply state-of-the-art explainable techniques relevant to computer vision DL to find the basis of the predictions of these models. Some of the resulting visual explainable maps highlight features in the images that appear consistent with the latest hypotheses about mitochondrial disease progression within myofibres.

Recommended citation: A. Khan, C. Lawless, A. E. Vincent, S. Pilla, S. Ramesh and A. S. McGough, "Explainable Deep Learning to Profile Mitochondrial Disease Using High Dimensional Protein Expression Data," 2022 IEEE International Conference on Big Data (Big Data), Osaka, Japan, 2022, pp. 4375-4384, doi: 10.1109/BigData55660.2022.10020391. https://ieeexplore.ieee.org/abstract/document/10020391