DeepResBat: A residual MRI harmonization framework accounting for covariate distribution differences

Poster No:

1848 

Submission Type:

Abstract Submission 

Authors:

Lijun An1, Chen Zhang1, Naren Wulan1, Pansheng Chen1, Shaoshi Zhang1, Fang Ji1, Kwun Kei Ng1, Christopher Chen1, Juan Helen Zhou1, B. T. Thomas Yeo1

Institutions:

1National University of Singapore, Singapore, Singapore

First Author:

Lijun An  
National University of Singapore
Singapore, Singapore

Co-Author(s):

Chen Zhang  
National University of Singapore
Singapore, Singapore
Naren Wulan  
National University of Singapore
Singapore, Singapore
Pansheng Chen  
National University of Singapore
Singapore, Singapore
Shaoshi Zhang  
National University of Singapore
Singapore, Singapore
Fang Ji  
National University of Singapore
Singapore, Singapore
Eric Kwun Kei Ng  
National University of Singapore
Singapore, Singapore
Christopher Chen  
National University of Singapore
Singapore, Singapore
Juan Helen Zhou  
National University of Singapore
Singapore, Singapore
B. T. Thomas Yeo  
National University of Singapore
Singapore, Singapore

Introduction:

Mega-analysis combining MRI data from multiple sites has significantly advanced neuroimaging research (Miller et al., 2018). When pooling data across multiple sites, MRI harmonization is crucial to reduce undesired site variabilities while preserving effects of covariates of interests (e.g., age). Recently, deep learning has been utilized for addressing harmonization problems (Hu et al., 2023). However, existing deep learning harmonization approaches typically ignore the inclusion of covariates (Zuo et al., 2021; Bashyam et al., 2021). Here, we propose deep learning harmonization models accounting for covariate distribution differences. Furthermore, we demonstrate for the first time that deep learning harmonization algorithms can introduce false positives, which should lead to caution for applying deep learning in neuroimaging.

Methods:

In this study, we harmonized T1 MRI between ADNI (adni.loni.usc.edu) and AIBL (aibl.loni.csiro.au), as well as between ADNI and MACC (macc.sg) cohorts. The inputs obtained from FreeSurfer (Fischl et al., 2002; Desikan et al., 2006) are thickness and volumes of cortical ROIs, and volumes of subcortical ROIs. We performed a matching procedure to control for demographics, disease severity, and cognition impairment differences across cohorts. The matched participants served as the testset. The unmatched participants served as training and validation sets.
We proposed two deep learning models, coVAE (Figure 1B) and DeepResBat (Figure 1C), which account for covariate distribution differences across datasets. coVAE extends cVAE (Figure 1A; Moyer et al., 2020) by concatenating covariates y and site s with latent representations z. DeepResBat preserves the effects of covariates using machine learning and then eliminates unwanted site differences in covariate-free residuals through a cVAE model. For the baselines, we considered ComBat (Johnson et al., 2007), CovBat (Chen et al., 2021), and cVAE.
We considered three evaluation metrics. Firstly, an XGBoost model was applied to predict which dataset a harmonized participant came from. A higher accuracy indicates worse harmonization. Secondly, we tested whether harmonization leads to stronger association between harmonized ROIs and covariates of interests. A stronger association means better harmonization. Finally, we randomly permuted covariates and then harmonized ROIs. The permutation was repeated 1000 times, costing 240,000 GPU and 360,000 CPU hours. For each ROI, a GLM was fitted for each permutation to check association between harmonized ROI and permutated covariates, resulting in 1000 p values; then, we calculated percentage of p values below 0.05 across 1000 p values. The expected percentage across all ROIs is 5%, with a confidence interval of 3.65% to 6.35% (Eklund et al., 2016). A percentage higher than 5% underscores false positives.

Results:

Figure 2A shows that deep learning approaches significantly removed more dataset differences than linear approaches (ComBat and CovBat) for harmonizing ADNI and AIBL (Figure 2A1) and harmonizing ADNI and MACC (Figure 2A2) datasets.
Figure 2B shows association analysis results between harmonized ROIs and covariates of interests, measured by multivariate ANOVA. We found that coVAE and DeepResBat preserved more biological variabilities than cVAE without considering covariates and linear approaches.
Our permutation tests (Figure 2C) show coVAE introduced high false positives, while the DeepResBat achieved expected false positives. The results indicated deep learning harmonization approaches can introduce false positives without careful modeling.

Conclusions:

In this study, we proposed deep learning models, coVAE and ResBat, for incorporating covariates during harmonization. coVAE and ResBat outperformed cVAE and linear approaches in removing dataset differences and preserving biological variability. Furthermore, DeepResBat achieved acceptable false positives, which is a novel finding for deep learning harmonization studies.

Modeling and Analysis Methods:

Methods Development 1
Multivariate Approaches 2

Keywords:

Machine Learning
Modeling
MRI
Workflows
Other - MRI harmonization

1|2Indicates the priority used for review
Supporting Image: Lijun_OHBM_figure1.jpg
Supporting Image: Lijun_OHBM_figure2.jpg
 

Provide references using author date format

Miller, Karla L. (2016), "Multimodal population brain imaging in the UK Biobank prospective epidemiological study." Nature neuroscience 19.11 (2016): 1523-1536.

Hu, F. (2023), Image harmonization: A review of statistical and deep learning methods for removing batch effects and evaluation metrics for effective harmonization. NeuroImage, 120125.

Zuo, Lianrui. (2021), "Unsupervised MR harmonization by learning disentangled representations using information bottleneck theory." NeuroImage 243 (2021): 118569.

Bashyam, Vishnu M. (2021), "Deep Generative Medical Image Harmonization for Improving Cross‐Site Generalization in Deep Learning Predictors." Journal of Magnetic Resonance Imaging (2021).

Fischl, B. (2002), Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron, 33(3), 341-355.

Desikan, R. S. (2006), An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage, 31(3), 968-980.

Moyer, Daniel. (2020), "Scanner invariant representations for diffusion MRI harmonization." Magnetic resonance in medicine 84.4 (2020): 2174-2189.

Johnson, W. E. (2007), Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics, 8(1), 118-127.

Chen, A. A. (2022), Mitigating site effects in covariance for machine learning in neuroimaging data. Human brain mapping, 43(4), 1179-1195.

Eklund, A. (2016), Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates. Proceedings of the national academy of sciences, 113(28), 7900-7905.