Poster No:
1670
Submission Type:
Abstract Submission
Authors:
Charles Ellis1, Robyn Miller1, Vince Calhoun2
Institutions:
1Center for Translational Research in Neuroimaging and Data Science (TReNDS), Atlanta, GA, 2Center for Translational Research in Neuroimaging and Data Science (TReNDS), Decatur, GA
First Author:
Charles Ellis
Center for Translational Research in Neuroimaging and Data Science (TReNDS)
Atlanta, GA
Co-Author(s):
Robyn Miller, PhD
Center for Translational Research in Neuroimaging and Data Science (TReNDS)
Atlanta, GA
Vince Calhoun
Center for Translational Research in Neuroimaging and Data Science (TReNDS)
Decatur, GA
Introduction:
While the application of deep learning methods to raw electroencephalogram (EEG) data is becoming more common, the development of reliable models can be challenging due to the small size of many EEG datasets. One potential solution is data augmentation, wherein synthetic training data is generated by duplicating and modifying the training data. While multiple data augmentation approaches have been developed for raw EEG data, their utility for the diagnosis of neuropsychiatric disorders remains underexplored. Moreover, it is unclear if existing approaches boost model performance because they increase the number of training samples or because they introduce augmentations that help the model learn better representations. In this study, we train a baseline convolutional neural network for automated major depressive disorder (MDD) diagnosis, train a second baseline model with duplicate training data (i.e., two identical copies), and compare performance to 6 models that are each trained with a different data augmentation approach that doubles the training set size (i.e., original data + augmented data).
Methods:
We used a publicly available resting-state EEG dataset (Mumtaz et al., 2017) with 5-10-minute recordings from 30 individuals with MDD and 28 healthy controls (HCs). We trained our model with 19 channels: Fp1, Fp2, F7, F3, Fz, F4, F8, T3, C3, Cz, C4, T4, T5, P3, Pz, P4, T6, O1, and O2. We downsampled the data to 200 Hz, channel-wise z-scored each recording separately, and separated the data into 25-second samples with a sliding window approach (2.5-second step size) that yielded 2,942 MDD samples and 2,950 HC samples. We trained 8 models: a baseline model with no duplicate data (M1.1), a baseline model with 2 copies of identical unmodified training data (M1.2), 6 models trained with 1 copy of unmodified training data and a copy of data augmented with Gaussian noise (M2) (Ellis et al., 2022), time reverse (M3) (Rommel et al., 2022), smooth time masking (M4) (Mohsenvand et al., 2020), Fourier surrogate (M5) (Schwabedal et al., 2019), frequency shift (M6) (Rommel et al., 2022), or channel dropout (M7) (Saeed et al., 2021) augmentation. For M1.1, we began with an architecture from (Oh et al., 2019) and optimized it with the Hyperband algorithm in Keras-Tuner. We used M1.1 as a starting point for M1.2-7 and optimized the learning rate of each model with Hyperband and the data augmentation parameters. We trained for 25 folds with subject-wise cross-validation and tested performance for each fold via accuracy, balanced accuracy, sensitivity, and specificity. We lastly compared the performance of each model with pair-wise t-tests and false discovery rate correction.
Results:
Table 1 shows our model test performance, which is fairly high. Most data augmentation approaches improved model performance by 2-2.5% compared to M1.1 but not M1.2. Also, our statistical testing did not identify any significant changes in accuracy, balanced accuracy, and sensitivity and only a couple significant changes in specificity.
Conclusions:
Our results suggest that the benefits of existing EEG data augmentation approaches may mainly result from increased training set size and not necessarily from the introduction of helpful augmentations. As such, it may be beneficial to compare new augmentation methods to a baseline model with duplicate training data to see if their augmentations are helpful. It is also possible that future studies might benefit from an "augmentation by duplication" approach in which models are simply trained on two identical copies of the training data. We hope that our findings will provide helpful guidance as the field seeks to develop more robust models on small EEG datasets. Moreover, we would also like to stress that absolute rules cannot be derived from our findings. It is possible that other architectures or datasets might yield different conclusions on the efficacy of the data augmentation approaches we examined.
Disorders of the Nervous System:
Psychiatric (eg. Depression, Anxiety, Schizophrenia)
Modeling and Analysis Methods:
Classification and Predictive Modeling 2
EEG/MEG Modeling and Analysis 1
Keywords:
Electroencephaolography (EEG)
Machine Learning
Psychiatric Disorders
Other - Data Augmentation
1|2Indicates the priority used for review
Provide references using author date format
Ellis, C.A., Sattiraju, A., Miller, R., Calhoun, V., 2022. Examining Effects of Schizophrenia on EEG with Explainable Deep Learning Models, in: 2022 IEEE 22nd International Conference on Bioinformatics and Bioengineering (BIBE). IEEE, pp. 301–304. https://doi.org/10.1109/BIBE55377.2022.00068
Mohsenvand, M.N., Maes, P., Alsentzer, E.E., Mcdermott, M.B.A., Falck, F., Sarkar, S.K., 2020. Contrastive Representation Learning for Electroencephalogram Classification. Proc. Mach. Learn. Res. 136, 238–253.
Mumtaz, W., Xia, L., Yasin, M.A.M., Ali, S.S.A., Malik, A.S., 2017. A wavelet-based technique to predict treatment outcome for Major Depressive Disorder. PLoS One 12, 1–30. https://doi.org/10.1371/journal.pone.0171409
Oh, S.L., Vicnesh, J., Ciaccio, E.J., Yuvaraj, R., Acharya, U.R., 2019. Deep convolutional neural network model for automated diagnosis of Schizophrenia using EEG signals. Appl. Sci. 9. https://doi.org/10.3390/app9142870
Rommel, C., Moreau, T., Paillard, J., Gramfort, A., Paris-saclay, U., 2022. CADDA : Class-wise Automatic Differentiable Data Augmentation for EEG Signals, in: International Conference on Learning Representations (ICLR).
Saeed, A., Grangier, D., Pietquin, O., Zeghidour, N., 2021. Learning from heterogeneous EEG signals with differentiable channel reordering, in: IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. pp. 1255–1259. https://doi.org/10.1109/ICASSP39728.2021.9413712
Schwabedal, J.T.C., Snyder, J.C., Cakmak, A., Nemati, S., Clifford, G.D., Jan, S.P., 2019. Addressing Class Imbalance in Classification Problems of Noisy Signals by using Fourier Transform Surrogates 1–8.