Preprint
Article

Radiomics Features of FDG PET Images Predict ApoE4 and may Provide Clues for Further Studies on Alzheimer’s Disease

Altmetrics

Downloads

87

Views

67

Comments

0

This version is not peer-reviewed

Submitted:

24 September 2024

Posted:

25 September 2024

You are already at the latest version

Alerts
Abstract
Objective: Fludeoxyglucose F18 positron emission tomography (FDG PET) can give early clues for diagnosing Alzheimer’s disease (AD). Our objective was to use the same scan to predict Apolipoprotein E4 (ApoE4), known to be a risk factor. A second objective was to determine the brain regions and imaging features associated with this gene allele that can potentially help elucidate the mechanisms of the disease and better comprehend the images. Method: We employed data from 112 participants from the Alzheimer’s Disease Neuroimaging Initiative (ADNI). PET and structural MRI data were co-registered with ANTx, and the Freesurfer tool delineated 95 whole brain, 19 hippocampal, and 9 amygdala regions. Using PyRadiomics we extracted 120 radiomic features from these segments. We employed a hybrid feature selection strategy. The HistGradientBoosting (HGB) classifier was evaluated using stratified five-fold cross-validation. Results: The proposed radiomics model predicted the homozygous ApoE4 genotype, with an AUC of 0.945 an accuracy of 0.875, a sensitivity of 0.914, and a specificity of 0.838. Reducing the feature set to seven key features resulted in a slightly decreased performance (AUC: 0.889). The mean values of the features showed statistically significant (p<0.05) incremental deterioration between different groups of homozygous ApoE4 carriers. Conclusion: Our findings underscore the critical role of specific brain regions and associated features in differentiating ApoE4 carriers. The hippocampus, entorhinal cortex, amygdala, thalamus, and pars orbitalis exhibited significant associations with the ApoE4 genotype. By employing SHAP analysis, we identified key features driving model predictions, enhancing interpretability, and providing insights into the underlying pathophysiology.
Keywords: 
Subject: Engineering  -   Bioengineering

1. Introduction

Chronic diseases, characterized by their long-term progressive nature and ongoing physical, psychological, and social impact, impose a significant and ever-growing financial and emotional burden on individuals, families, communities, and healthcare systems worldwide [1]. Among these chronic conditions, dementia—a progressive syndrome characterized by a decline in cognitive function that interferes with daily life—is poised to become a major social and economic challenge in the near future [2]. With new cases emerging at an alarming rate, nearly one every 3 seconds according to Alzheimer’s Disease International (ADI), the World Health Organization estimates a staggering global total of 139 million people living with dementia by 2050. Notably, Alzheimer’s disease (AD), a neurodegenerative disorder characterized by the buildup of amyloid plaques and tau tangles in the brain, is believed to be responsible for a significant portion, with estimates ranging from 60-70% of all dementia cases [3,4]. This translates to millions of individuals worldwide grappling with memory loss, impaired judgment, and a decline in daily living activities, placing a tremendous strain on caregivers and healthcare systems.
However, a glimmer of hope emerges amidst this growing challenge. Artificial intelligence (AI) is making significant strides in the healthcare field, and its potential to predict Alzheimer’s disease earlier holds immense promise for alleviating the burden of this devastating disease [5]. By enabling earlier detection and intervention, AI-powered solutions could significantly improve patient outcomes, potentially slow disease progression, and allow for better management of symptoms. This could enhance the quality of life for patients and their families and reduce the immense financial strain placed on healthcare systems globally.
FDG PET is the most sensitive modality for the early detection and investigation of AD [6]. Therefore, quantitative analysis and understanding of imaging biomarkers can give more comprehensive information on an individual patient’s condition. Radiomics allows us to obtain imaging features that the naked eye cannot detect. Therefore, by analyzing these features (imaging biomarkers) it is possible to gain a deeper insight into the disease [7]. By prioritizing explainability and transparency, radiomics helps to build trust in AI-driven decision-making, ultimately leading to better patient outcomes [8,9]. In our previous work, we showed that these biomarkers help us detect AD accurately, predict amyloid positivity, determine severity and stage, and monitor progress in time [10,11,12]. In this way, a single test can give us a mine of information and understanding of the particular characteristics on an individual level.
In addition to the well-known pathological hallmarks of AD, amyloid-β (Aβ) and neurofibrillary tangles composed of tubulin-associated unit (Tau) protein, genetics plays another critical role in the underlying disease process [13,14,15]. Since its discovery in 1993 [16], the apolipoprotein ε4 allele (ApoE4) has consistently emerged as the single most potent genetic risk factor for AD across the entire human genome [17]. The presence of the ApoE4 allele significantly elevates an individual’s risk of developing AD. Individuals with two copies of the ApoE4 allele (homozygous ApoE4) have approximately a 12-fold increased risk of developing AD compared to those with two copies of the ε3 allele [18]. Those with one copy of the ApoE4 allele (heterozygotes ApoE4) have a lower risk, and individuals without the ApoE4 gene have a much lower risk of developing Alzheimer’s [19].
The objective of this paper is to extend the characterization of AD-suspected patients by studying the association between imaging features and the ApoE4 allele. This can lead us to the following benefits: (i) To predict ApoE4 carrier status from FDG PET scan but more importantly (ii) to determine the significant brain regions and their calculated image features. This can help us not only have a better understanding of the disease but also point to new future research on a biological level for deciphering pathological mechanisms and discovering new targeted drugs.

2. Method

Our study proposes a radiomics-based approach to predict the presence of the homozygous ApoE4 genotype using FDG PET scans. Radiomics, a field extracting quantitative features from medical images, offers a rich data source for exploring biological underpinnings. Our meticulous workflow involves several key stages: data selection with labeling, co-registration for image alignment, segmentation to isolate brain regions, feature extraction to quantify image characteristics, feature selection to identify the most informative features, and finally, model building and evaluation (as detailed in Figure 1). Rigorous metrics will assess the model’s performance and generalizability to unseen data. By following these steps, we aim to develop a robust and generalizable model for homozygous ApoE4 genotype prediction using FDG PET, with detailed explanations provided in subsequent sections.

2.1. ADNI and Participants

Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early AD.
We studied 697 MCI patients with baseline FDG PET scans, selecting 567 structural MRI (sMRI) scans acquired within 90 days of their corresponding FDG PET scans. To investigate the relationship of brain metabolism and structure with the ApoE4 genotype, we focused on a subset of 387 patients who had both baseline FDG PET and sMRI scans along with ApoE4 genotype information. This group included 56 individuals with the homozygous ApoE4 genotype and 331 without (ApoE non-ε4 carriers). To create a balanced dataset for analysis, we randomly selected 56 patients with ApoE non-ε4 carriers. This resulted in a final group of 112 patients with a balanced number of individuals possessing the homozygous ApoE4 genotype and those lacking the ε4 allele entirely (refer to Figure 2 for details on participant distribution). Additional clinical characteristics of the participants can be found in Table 1.

2.2. ApoE Genotyping

ApoE genotyping was performed to assess AD risk in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) study (adni.loni.usc.edu). While the main genotyping platform for ADNI utilized Illumina BeadChips (Illumina BeadStudio 3.2 software for initial analysis, later updated to Illumina GenomeStudio v2009.1), the specific ApoE alleles (ε2, ε3, and ε4) were not included on the Human610-Quad BeadChip used. Therefore, DNA was extracted from blood samples, and specific SNPs (rs429358 and rs7412) defining the ApoE alleles were genotyped using a separate method.

2.3. FDG PET Acquisition

Our study used FDG PET scans that involved injecting participants with 185 MBq (5 mCi) of FDG and then performing a dynamic 3D scan consisting of six 5-minute frames acquired 30-60 minutes post-injection. ADNI provides four processed formats for FDG PET scans, ensuring compatibility between scanners and simplifying analysis. We used 3rd type preprocessed FDG PET scans (CO-REG, AVG, STANDARDIZED IMAGE, AND VOXEL SIZE) which each subject co-registered averaged image from their baseline FDG PET scan is then reoriented into a standard 160×160×96 voxel image grid, having 1.5 mm cubic voxels. The image grid is aligned with the subject’s anterior-posterior axis parallel to the anterior and posterior commissure (AC-PC) line. Each FDG PET scan, including the baseline and all subsequent scans, is co-registered to this baseline image. An average image is created from the AC-PC co-registered frames and intensity is normalized using a subject-specific mask [20].

2.4. MRI Acquisition

The ADNI project has collected a wide range of sMRI data across its phases (1/GO/2/3). While sMRI acquisition protocols have evolved, the initiative provides standardized datasets to address variations in sMRI scanners between sites and ensure consistency for research. ADNI 3 exclusively uses 3T scanners and offers two tiers (“Basic” and “Advanced”) to manage this variability. Our study employed standardized sMRI data, specifically T1-weighted 3D-MRI acquired with a sagittal MP-RAGE plane and a 1.2mm slice thickness. These datasets were developed to promote data analysis consistency and facilitate researcher use [21].

2.5. Co-Registration

Our study addressed the challenge of aligning information from FDG PET and sMRI scans using the Advanced Normalization Tools (ANTs), a free and open-source software toolkit. ANTs’ tool is highly effective for image registration, correcting spatial discrepancies between scans. We specifically used ANTs’ rigid registration to address basic misalignments such as translation and rotation, ensuring precise spatial matching between the FDG PET and sMRI scans[22].

2.6. Segmentation

Our study leveraged FreeSurfer’s (FS) extensive segmentation capabilities to analyze various brain regions. FS version 7.1 provided automated segmentation of the whole brain into 95 regions using the DKT-Atlas [23]. Additionally, it performed segmentation for the hippocampus subfields and amygdala nuclei. The hippocampal segmentation employed a probabilistic atlas built from high-resolution ex vivo scans and standard-resolution in vivo images. This method classified voxels within the hippocampus into 19 subfields, including CA1, dentate gyrus, and the fimbria. Similarly, the amygdala segmentation utilized an atlas derived from high-resolution ex vivo data to parcellate the amygdala into 9 subnuclei [24,25].

2.7. Feature Extraction

To address the challenge of obtaining standard features in radiomics, we utilized PyRadiomics an open-source platform that aligns with the International Biomarker Standardization Initiative (IBSI) [26]. This platform simplifies the extraction of a wide variety of features from medical images. PyRadiomics provides dedicated tools for analyzing first-order statistics, shapes, and textures using methods like gray-level co-occurrence matrices, gray-level run-length matrices, and gray-level size zone matrices. Importantly, it allows feature extraction from both individual slices (2D) and entire volumes (3D) across various imaging modalities. This standardized workflow within PyRadiomics promotes consistency and reproducibility, enabling researchers to effectively compare findings across different studies [27].

2.8. Feature Selection

Feature redundancy is a significant issue in radiomics that leads to overfitting in classification models. To address this, we implemented a comprehensive feature selection approach involving both preprocessing and advanced selection methods. Initially, we reduced the dimensionality by eliminating constant, quasi-constant, and duplicated features. Subsequently, we applied the LASSO (Least Absolute Shrinkage and Selection Operator) [28] method, an embedded feature selection technique, which utilizes the cost function:
Cos t   Function = 1 N ( y i f x i ) 2 + λ β j
where is N the number of data points, y i represents the true values, f x i denotes the predicted values based on features, β j are the feature coefficients, and λ is the regularization parameter. LASSO effectively shrinks the coefficients of irrelevant features towards zero β j 0 , thus removing them. By exploring different λ values, we controlled the level of shrinkage.
From the top features identified by LASSO, we further refined our selection using the correlation method, a filter technique. This method evaluates the Pearson correlation coefficient:
Correlation X , Y = X i X ¯ Y i Y ¯ ( X i X ¯ ) 2 ( Y i Y ¯ ) 2
where X and Y are the features, X i and Y i are individual feature values, and X ¯ and Y ¯ are the feature means. This step helps eliminate redundant information by selecting a representative feature from highly correlated pairs. Our combined approach as the hybrid method leverages LASSO’s capacity to handle a large number of features and the correlation method’s focus on linear dependencies, resulting in a robust and informative feature set for radiomics classification tasks.

2.9. Classification

In this study, we used a HistGradientBoosting (HGB) Classifier to classify individuals based on their ApoE4 status. To ensure performance and generalizability, we implemented several key improvements. We used stratified K-Fold cross-validation to prevent bias, ensuring each fold had a similar distribution of homozygous ApoE4 carriers and non-ε4 carriers. Grid search was employed for hyperparameter tuning, systematically evaluating parameter combinations to find the optimal configuration. Additionally, we applied StandardScaler to standardize feature values, enhancing training stability and potentially improving performance.

3. Results

This study employed a radiomics-based approach to develop a predictive model for identifying the ApoE4 genotype. Quantitative image features were extracted from segmented regions of interest and subjected to LASSO regression for feature selection. A rank-based strategy, as illustrated in Figure 3, was implemented to incrementally incorporate features into the model based on the absolute magnitude of their LASSO coefficients. The performance of each feature subset was assessed using stratified five-fold cross-validation.

3.1. Feature Extraction

To address the high-dimensional nature of the dataset, which included 14,760 features (95 regions of interest * 120 radiomic features for the whole brain, 19 hippocampal subfields * 120 features, and 9 amygdala subnuclei * 120 features), we implemented a multi-step feature selection process to reduce the risk of overfitting during the classification stage. The initial step involved data pre-processing to eliminate irrelevant features like constant, quasi-constant, and duplicates. This initial cleaning reduced the feature set to a more manageable size of 6,278. Following pre-processing, we employed the LASSO method for further feature selection. LASSO works by shrinking coefficient values toward zero, effectively removing features with minimal predictive power. This technique significantly reduced the feature set to a more focused set of 152 features likely to be the most informative for our classification task. The LASSO implementation involved data scaling, a technique to ensure all features are on a similar scale, and setting a high maximum number of iterations (10,000) to guarantee the model converges on a solution. Finally, we optimized the model parameters using grid search with a range of alpha values (a key parameter in LASSO) logarithmically spaced between 10⁻⁴ and 10² and 10-fold cross-validation with ROC AUC scoring to evaluate model performance.

3.2. Classification

Building upon the feature selection process, we employed a HGB classifier for classification. HGB classifier is a powerful ensemble learning method that iteratively builds decision trees, focusing on improving classification for previously misclassified instances. This approach allows the model to learn from its mistakes and refine its predictions over time. The HGB classifier utilized the selected set of 152 features by LASSO for classification. Notably, we employed a rank-based scoring method that emphasized features with higher absolute LASSO coefficients. These features were likely identified as having a stronger association with the target variable (ApoE4 genotype) during the feature selection step. This approach of feature selection and HGB classification achieved excellent results in differentiating MCI patients with homozygous ApoE4 carriers and non-ε4 carriers. The average Area Under the ROC Curve (AUC) was 0.945, with a 95% confidence interval for ROC AUC ranging from 0.868 to 0.926, indicating a very strong ability for the model to distinguish between the two patient groups (Figure 3).
Additionally, the model achieved an average accuracy of 0.875, signifying a high proportion of patients were correctly classified. Furthermore, the model demonstrated exceptional sensitivity (0.914) for detecting individuals with homozygous ApoE4, highlighting its effectiveness in identifying this specific patient group. The model also maintained good specificity (0.838) in correctly identifying MCI patents without the ApoE4 genotype. It’s important to note that achieving this level of accuracy with only 18 features, selected for their highest coefficients as shown in Figure 4, highlights the model’s efficiency and its potential for clinical applications. This suggests that a smaller set of highly informative features can be sufficient for accurate classification, leading to a more streamlined and potentially more interpretable model.

3.3. Additional Feature Refinement

Following the initial LASSO feature selection that yielded 152 features, we conducted a more granular analysis to identify the most relevant features specifically related to ApoE4. This additional step focused on the top 18 features listed in Table 2 identified by LASSO, which were presumed to have the strongest predictive power. We computed Pearson correlation coefficients to assess the linear relationships between these top features. A high correlation value (positive or negative) indicated a significant overlap in information between features. Utilizing a correlation matrix (shown in Figure 5), we removed features with high correlations to avoid redundancy. We focused on the absolute value of the correlation coefficient to identify the strength of the relationship, regardless of direction. Self-correlations were filtered out, and only the pairs with the weakest correlations were retained.
This process led to the selection of seven features from the top 18 LASSO features, categorized into three distinct groups: glszm (SmallAreaEmphasis, ZoneVariance) related to Gray Level Size Zone Matrix, shape (SurfaceVolumeRatio, Sphericity, Maximum2DDiameterColumn, SurfaceArea) related to shape characteristics, and glrlm (RunEntropy) related to Gray Level Run Length Matrix.
These seven key features were identified in specific brain regions of the left hemisphere, including the hippocampus, entorhinal cortex, amygdala, thalamus, and pars orbitalis. This approach ensured that the final set of features was informative and non-redundant, providing a solid basis for further investigating the link between ApoE4 and MCI.

3.4. Classification with Reduced Feature Set

Leveraging these seven selected features, we built a classification model using the HGB model. To ensure model reliability and generalizability, we employed Stratified five-fold cross-validation. Hyperparameter tuning with grid search was performed to identify the optimal configuration for the HGB model. This involved searching through a range of specified parameters, including learning rates of 0.01, 0.1, and 0.2, maximum tree depths of 3, 5, and 7, training iterations of 100, 200, and 300, and minimum samples per leaf node of 20, 50, and 100. The tuning aimed to maximize the model’s performance based on ROC AUC score, a key metric for evaluating binary classifiers in medical applications.
Using grid search to determine the optimal hyperparameters, the HistGradientBoosting (HGB) classifier was evaluated with five-fold stratified cross-validation. The model achieved an average performance with ROC AUC of 0.889, accuracy of 0.796, specificity of 0.768, and sensitivity of 0.821, with a 95% confidence interval between 0.868 and 0.926 as shown in Figure 6a.
The proposed model, trained on a reduced feature set of seven variables (Table 2, *), demonstrated robust and balanced classification performance for the ApoE4 genotype (p < 0.05). Notably, this model effectively discriminated between individuals with and without the homozygous ApoE4 genotype, despite the small number of input features.

4. Discussion

4.1. Findings

Our investigation presents an approach to exploring FDG PET’s potential to identify individuals with the homozygous ApoE4 genotype, a known genetic risk factor for Alzheimer’s. It also attempts to find a few brain regions and features that are associated with this genotype to open new promising ways for a better understanding of the disease mechanisms. The results underscore the importance of integrating advanced genetic analysis and neuroimaging techniques to further elucidate the complex interactions between genetics and brain structure and activity.
We analyzed FDG PET images and genetic data from 112 individuals diagnosed with various subtypes of Mild Cognitive Impairment (MCI). To account for class imbalance, we included equal numbers of participants with homozygous ApoE4 carriers and non-ε4 carriers. We employed two separate segmentation techniques to isolate the whole brain and hippo-amygdala, resulting in 123 distinct regions for analysis. We hypothesized that a radiomics approach using FDG PET imaging could be used to predict the homozygous ApoE4 genotype. To achieve this, we implemented a hybrid feature selection process. This involved dimensionality reduction using the LASSO method on preprocessed features, followed by correlation analysis to pinpoint the most informative ones. To enhance model performance and generalizability, we employed a nested framework for training and hyperparameter tuning. This approach utilized stratified five-fold cross-validation, where grid search was applied within each fold to optimize hyperparameters through cross-validation. Importantly, this approach ensured robustness against overfitting by consistently employing stratified K-fold cross-validation throughout the process [29]. Furthermore, maintaining a consistent random state across all steps fostered the reproducibility of our results [30].
The study achieved promising performance metrics—an ROC AUC of 0.89, accuracy of 0.80, specificity of 0.77, and sensitivity of 0.82—using a minimal set of seven carefully chosen features. This simplification resulted from a meticulous feature selection process, leveraging a hybrid approach that combined various techniques (preprocessing, LASSO, correlation) to identify the most informative features for predicting the homozygous ApoE4 genotype. This method pinpointed seven key features within the only five critical brain sub-regions, leading to improved interpretability and reliability. The HGB Classifier, optimized via grid search, demonstrated high performance in a 70/30 training-test split, with a 95% confidence interval for ROC AUC ranging from 0.868 to 0.926, indicating acceptable confidence and precision in the model’s performance.

4.2. Explanation of the Relationship between Regions and APOE Genotype

Our study pinpointed five brain regions—the hippocampus, entorhinal cortex, amygdala, thalamus, and pars orbitalis with four of these demonstrating a strong association with the ApoE4genotype according to recent studies. Notably, the hippocampus shows accelerated shrinkage and dysfunction in APOE4 carriers [31], which may contribute to memory decline in AD [32]. The entorhinal cortex also exhibits earlier tau protein accumulation and neurodegeneration in these individuals, potentially linked to early memory issues [33,34]. Additionally, research suggests that the amygdala in ApoE4 carriers may experience reduced volume and altered activity [35], potentially impacting emotional regulation and memory changes observed in some AD patients. Finally, the thalamus, a crucial relay center, displays structural and functional alterations in ApoE4 carriers, indicating a potential connection between this genetic risk factor and thalamic dysfunction [36]. Crucially, the features extracted from these five brain regions (highlighted with an asterisk in Table 2) exhibited a statistically significant association with the APOE4 genotype (p<0.05), providing further evidence for the involvement of these areas in the pathophysiology of AD among ApoE4 carriers.

4.3. Features Explanation

Our findings identified seven distinct features from different categories, with four being shape-based. These shape-based features in the thalamus, hippocampus, and amygdala are consistent with recent studies showing shape differences, atrophy, and shrinkage in these regions in individuals with homozygous APOE4. Therefore, these shape-based features can be effectively used to analyze and interpret the shape differences or shrinkage of these regions in relation to ApoE4, enhancing the interpretability of the final model results. Additionally, GLSZM features provide information about the distribution of gray levels within specific image regions, which can be useful in identifying abnormalities in texture or homogeneity. Furthermore, GLRLM features capture information about the spatial arrangement of pixels with similar gray levels, helping to detect patterns or irregularities in tissue texture.

4.4. Interpretability

To enhance model interpretability, we used SHAP (SHapley Additive exPlanations) values to understand feature importance and contributions to predictions. The SHAP analysis identified glszm_ZoneVariance of the entorhinal cortex as the most influential feature in differentiating ApoE4 carriers, followed by shape_SurfaceArea of parsorbitalis. While glrlm_RunEntropy of the central nucleus and shape_Maximum2DDiameterColumn of the accessory-basal nucleus also contributed to the model, their impact was less pronounced. The SHAP summary plot (Figure 6. b) revealed a consistent pattern: higher values of glszm_ZoneVariance of the entorhinal cortex were associated with increased model output, whereas higher values of shape_SurfaceArea of parsorbitalis were associated with decreased model output. This comprehensive approach, encompassing data preprocessing, feature selection, model development, evaluation, and interpretability, enhances the model’s reliability and trustworthiness.

4.5. Clinical Significance

Early Detection: FDG PET can detect changes in brain metabolism associated with AD before cognitive symptoms become apparent. This early detection can allow for earlier intervention and potentially delay disease progression. Detecting imaging features that are associated with the different hallmarks and risk factors of AD may enhance the diagnostic, characterization, monitoring, and prognostic capability of FDG PET. This comprehensive information may lead to the development of more targeted treatment approaches.

4.6. Limitations

Our study encountered several limitations, primarily related to data availability. A larger dataset would strengthen the model’s reliability and adaptability. Moreover, the sensitivity of radiomic features to image acquisition and processing techniques can affect the consistency of our findings.

4.7. Future study

To address these issues, standardized imaging procedures and advanced image preprocessing techniques are needed. Using data from multi-centers should also be performed.
Our study unveiled a novel pars orbitalis region associated with the ApoE4 genotype. This newly discovered area, akin to other ApoE-linked brain regions, presents significant opportunities for clinical and scientific investigation into the ApoE4 variant. By identifying imaging features associated with ApoE4, clinicians might be able to:
i) Improve Diagnostic Accuracy: FDG PET could become a more accurate tool for diagnosing AD, especially in the early stages. Ii) Predict Disease Progression: Imaging features might help predict the rate of cognitive decline in ApoE4 carriers. These benefits should be investigated further.

5. Conclusions

In conclusion, the APoE-4 gene allele can be accurately predicted from FDG PET scans using a radiomics approach. The involved brain regions and associated image features can shed further light on the mechanism of AD in relation to genetic factors. This is likely to enhance the interpretation of FDG PET scans for AD-suspected patients. Further research is needed to expand this understanding in terms of these regions and features.

Acknowledgments

We gratefully acknowledge the financial support provided by the Bogazici University Research Fund (BAP) for this research under project code 19774. Their support was instrumental in carrying out the experiments and analyzing the data. Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd. and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

References

  1. Grover, A.; Joshi, A. An Overview of Chronic Disease Models: A Systematic Literature Review. Glob J Health Sci. 2014, 7. [Google Scholar] [CrossRef] [PubMed]
  2. Chapman, D.P.; Williams, S.M.; Strine, T.W.; Anda, R.F.; Moore, M.J. Dementia and its implications for public health. Prev Chronic Dis. 2006, 3, A34. [Google Scholar]
  3. World Health Organization. Dementia: Key facts’. [Online]. Available online: https://www.who.int/news-room/fact-sheets/detail/dementia (accessed on 23 July 2024).
  4. ‘Dementia, vs. Alzheimer’s Disease: What Is the Difference? [Online]. Available online: https://www.alz.org/ (accessed on 23 July 2024).
  5. Battista, P.; Salvatore, C.; Berlingeri, M.; Cerasa, A.; Castiglioni, I. Artificial intelligence and neuropsychological measures: The case of Alzheimer’s disease. Neurosci Biobehav Rev. 2020, 114, 211–228. [Google Scholar] [CrossRef]
  6. Collin-Castonguay, P.; Gourdeau, D.; Potvin, O.; Duchesne, S. Synthetic FDG-PET hypometabolism sensitivity validation in AD. Alzheimer’s & Dementia 2023, 19. [Google Scholar] [CrossRef]
  7. Pereira, H.R.; Ferreira, H.A. Exploring the Potential of Radiomics Features of the Hippocampus in Alzheimer’s Disease Considering Standard versus Parallel Imaging. Alzheimer’s & Dementia 2023 2023, 19. [Google Scholar] [CrossRef]
  8. Varriano, G.; Guerriero, P.; Santone, A.; Mercaldo, F.; Brunese, L. Explainability of radiomics through formal methods. Comput Methods Programs Biomed. 2022, 220, 106824. [Google Scholar] [CrossRef] [PubMed]
  9. Rogers, W.; et al. Radiomics: from qualitative to quantitative imaging. Br J Radiol 2020, 93. [Google Scholar] [CrossRef] [PubMed]
  10. Rasi, R.; Guvenis, A. Predicting amyloid positivity from FDG-PET images using radiomics: A parsimonious model. Comput Methods Programs Biomed 2024, 247, 108098. [Google Scholar] [CrossRef]
  11. Yuksel, C.; Rasi, R.; Guvenis, A. A New Method for Diagnosing Alzheimer’s Disease and Monitoring Its Severity Using FDG-PET. In 2022 Medical Technologies Congress (TIPTEKNO); IEEE, 2022; pp. 1–4. [Google Scholar] [CrossRef]
  12. Rasi, R.; Güvenis, A. A Platform for the Radiomic Analysis of Brain FDG PET Images: Detecting Alzheimer’s Disease. In Proceedings of the Bioinformatics and Biomedical Engineering—10th International Work-Conference, IWBBIO 2023, Meloneras, Proceedings, Part I. Gran Canaria, Spain, 12–14 July 2023; Rojas, I., Valenzuela, O., Ruiz, F.R., Herrera, L.J., Ortuño, F.M., Eds.; Lecture Notes in Computer Science 1319. Springer; pp. 244–255. [Google Scholar] [CrossRef]
  13. Yi, L.X.; Zeng, L.; Wang, Q.; Tan, E.K.; Zhou, Z.D. Reelin links Apolipoprotein E4, Tau, and Amyloid-β in Alzheimer’s disease. Ageing Res Rev. 2024, 98, 102339. [Google Scholar] [CrossRef]
  14. Hampel, H.; et al. The Amyloid-β Pathway in Alzheimer’s Disease. Mol Psychiatry 2021, 26, 5481–5503. [Google Scholar] [CrossRef]
  15. Koutsodendris, N.; Nelson, M.R.; Rao, A.; Huang, Y. Apolipoprotein E and Alzheimer’s Disease: Findings, Hypotheses, and Potential Mechanisms. Annual Review of Pathology: Mechanisms of Disease 2022, 17, 73–99. [Google Scholar] [CrossRef] [PubMed]
  16. Corder, E.H.; et al. Gene Dose of Apolipoprotein E Type 4 Allele and the Risk of Alzheimer’s Disease in Late Onset Families. Science (1979) 1993, 261, 921–923. [Google Scholar] [CrossRef] [PubMed]
  17. Jackson, R.J.; Hyman, B.T.; Serrano-Pozo, A. Multifaceted roles of APOE in Alzheimer disease. Nat Rev Neurol 2024. [CrossRef] [PubMed]
  18. Chen, Y.; Strickland, M.R.; Soranno, A.; Holtzman, D.M. Apolipoprotein E: Structural Insights and Links to Alzheimer Disease Pathogenesis. Neuron 2021, 109, 205–221. [Google Scholar] [CrossRef] [PubMed]
  19. Genin, E.; et al. APOE and Alzheimer disease: a major gene with semi-dominant inheritance. Mol Psychiatry 2011, 16, 903–907. [Google Scholar] [CrossRef]
  20. Jagust, W.J.; et al. The Alzheimer’s Disease Neuroimaging Initiative 2 PET Core: 2015. Alzheimer’s & Dementia 2015, 11, 757–771. [Google Scholar] [CrossRef]
  21. Wyman, B.T.; et al. Standardization of analysis sets for reporting results from ADNI MRI data. Alzheimer’s & Dementia 2013, 9, 332–337. [Google Scholar] [CrossRef]
  22. Avants, B.; Epstein, C.; Grossman, M.; Gee, J. Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Med Image Anal 2008, 12, 26–41. [Google Scholar] [CrossRef]
  23. Fischl, B. FreeSurfer. Neuroimage 2012, 62, 774–781. [Google Scholar] [CrossRef]
  24. Kahhale, I.; Buser, N.J.; Madan, C.R.; Hanson, J.L. Quantifying numerical and spatial reliability of hippocampal and amygdala subdivisions in FreeSurfer. Brain Inform. 2023, 10, 9. [Google Scholar] [CrossRef]
  25. Sämann, P.G.; et al. <scp>FreeSurfer</scp> -based segmentation of hippocampal subfields: A review of methods and applications, with a novel quality control procedure for <scp>ENIGMA</scp> studies and other collaborative efforts. Hum Brain Mapp 2022, 43, 207–233. [Google Scholar] [CrossRef] [PubMed]
  26. Hatt, M.; Vallieres, M.; Visvikis, D.; Zwanenburg, A. IBSI: an international community radiomics standardization initiative. Soc Nuclear Med. 2018.
  27. van Griethuysen, J.J.M.; et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef]
  28. Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J R Stat Soc Series B Stat Methodol 1996, 58, 267–288. [Google Scholar] [CrossRef]
  29. Wieczorek, J.; Guerin, C.; McMahon, T. K -fold cross-validation for complex sample surveys. Stat 2022, 11. [Google Scholar] [CrossRef]
  30. Impagliazzo, R.; Lei, R.; Pitassi, T.; Sorrell, J. Reproducibility in learning. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing; ACM: New York, NY, USA, 2022; pp. 818–831. [Google Scholar] [CrossRef]
  31. O’Dwyer, L.; et al. Reduced Hippocampal Volume in Healthy Young ApoE4 Carriers: An MRI Study. PLoS ONE 2012, 7, e48895. [Google Scholar] [CrossRef]
  32. Pievani, M.; Galluzzi, S.; Thompson, P.M.; Rasser, P.E.; Bonetti, M.; Frisoni, G.B. APOE4 is associated with greater atrophy of the hippocampal formation in Alzheimer’s disease. Neuroimage 2011, 55, 909–919. [Google Scholar] [CrossRef]
  33. Miranda, A.M.; et al. Effects of APOE4 allelic dosage on lipidomic signatures in the entorhinal cortex of aged mice. Transl Psychiatry 2022, 12, 129. [Google Scholar] [CrossRef]
  34. La Joie, R.; et al. Association of APOE4 and Clinical Variability in Alzheimer Disease With the Pattern of Tau- and Amyloid-PET. Neurology 2021, 96. [Google Scholar] [CrossRef]
  35. Young, C.B.; et al. APOE effects on regional tau in preclinical Alzheimer’s disease. Mol Neurodegener 2023, 18, 1. [Google Scholar] [CrossRef]
  36. Novellino, F.; López, M.E.; Vaccaro, M.G.; Miguel, Y.; Delgado, M.L.; Maestu, F. Association Between Hippocampus, Thalamus, and Caudate in Mild Cognitive Impairment APOEε4 Carriers: A Structural Covariance MRI Study. Front Neurol 2019, 10. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Workflow of the proposed model. Data labeling, Image pre-processing through co-registration, Segmentation of key brain regions (Whole brain and Hippo-Amygdala), feature extraction and selection, and, model building with hyperparameter tuning and evaluation.
Figure 1. Workflow of the proposed model. Data labeling, Image pre-processing through co-registration, Segmentation of key brain regions (Whole brain and Hippo-Amygdala), feature extraction and selection, and, model building with hyperparameter tuning and evaluation.
Preprints 119197 g001
Figure 2. Process and Criteria for Participant Selection and Inclusion in the Study.
Figure 2. Process and Criteria for Participant Selection and Inclusion in the Study.
Preprints 119197 g002
Figure 3. Rank-based performance of a classification model using features selected by LASSO. The plot illustrates the evolution of four key metrics (ROC AUC, Accuracy, Specificity, and Sensitivity) as the number of features increases, determined by the LASSO feature selection method. The optimal number of features is marked, corresponding to the point with the highest ROC AUC value.
Figure 3. Rank-based performance of a classification model using features selected by LASSO. The plot illustrates the evolution of four key metrics (ROC AUC, Accuracy, Specificity, and Sensitivity) as the number of features increases, determined by the LASSO feature selection method. The optimal number of features is marked, corresponding to the point with the highest ROC AUC value.
Preprints 119197 g003
Figure 4. Coefficient values of the top 18 LASSO-selected features. Features selected for further analysis based on correlation analysis are highlighted in green.
Figure 4. Coefficient values of the top 18 LASSO-selected features. Features selected for further analysis based on correlation analysis are highlighted in green.
Preprints 119197 g004
Figure 5. Feature correlation matrix for the top 18 LASSO coefficients, with labels including feature name, Freesurfer identifier (FS ID), and brain region (hippocampus/amygdala (HA) subregion or whole brain (WB)).
Figure 5. Feature correlation matrix for the top 18 LASSO coefficients, with labels including feature name, Freesurfer identifier (FS ID), and brain region (hippocampus/amygdala (HA) subregion or whole brain (WB)).
Preprints 119197 g005
Figure 6. Model Performance and Feature Importance (a.) ROC curve with a 95% confidence interval demonstrating the model’s ability to discriminate between classes. (b.) SHAP summary plot visualizing the impact of individual features on model predictions.
Figure 6. Model Performance and Feature Importance (a.) ROC curve with a 95% confidence interval demonstrating the model’s ability to discriminate between classes. (b.) SHAP summary plot visualizing the impact of individual features on model predictions.
Preprints 119197 g006
Table 1. Participant Clinical Characteristics.
Table 1. Participant Clinical Characteristics.
Group No. Case Age CDRSB MMSE ADAS13 RAVLT Forgetting
(Male/ Female) (mean ± SD) (mean ± SD) (mean ± SD) (mean ± SD) (mean ± SD)
non-ε4 carrier 33 / 23 71.34 ± 7.6 1.61 ± 1.14 27.64 ± 1.87 15.49 ± 7.03 4.29 ± 4.33
Two ε4 carrier 31 / 25 69.35 ± 6.66 1.5 ± 0.87 27.46 ± 2.08 16.65 ± 6.17 5.39 ± 2.5
Total 64 / 48 70.35 ± 7.2 1.55 ± 1.01 27.55 ± 1.97 16.07 ± 6.61 4.84 ± 3.57
Table 2. Top 18 features selected by LASSO. Features marked with asterisks (*) were selected for further analysis after a refinement step based on correlation analysis.
Table 2. Top 18 features selected by LASSO. Features marked with asterisks (*) were selected for further analysis after a refinement step based on correlation analysis.
FS ID Region (left/right hand) Feature homozygous ApoE4 carriers non-ε4 carriers P_Value
mean SD min max mean SD min max
238 CA1-body (Hippocampus) (lh) glszm_SmallAreaEmphasis* 0.3000 1.2963 -0.3682 3.6735 -0.3000 0.4157 -0.3681 2.6631 0.0016
2017 paracentral (rh) glrlm_RunVariance -0.1289 0.9366 -2.3246 2.3972 0.1289 1.0608 -2.0647 3.1278 0.1756
1006 entorhinal (lh) glszm_ZoneVariance* 0.2701 1.1090 -0.6391 2.7400 -0.2701 0.8111 -0.6391 3.4895 0.0040
1011 lateraloccipital (lh) shape_Flatness 0.1819 1.0437 -1.8295 3.0743 -0.1819 0.9379 -2.7474 2.1383 0.0549
10 Thalamus (lh) shape_SurfaceVolumeRatio* 0.2874 1.0451 -1.8291 2.9666 -0.2874 0.8809 -2.8339 2.2949 0.0021
245 molecular_layer_HP-head (Hippocampus) (lh) shape_Flatness -0.2439 0.7830 -1.8315 2.1238 0.2439 1.1411 -1.7182 5.4304 0.0097
53 Hippocampus (rh) firstorder_90Percentile 0.2082 0.9508 -2.2811 2.8583 -0.2082 1.0218 -2.1932 2.3466 0.0276
244 GC-ML-DG-body (Hippocampus) (lh) glrlm_RunEntropy 0.1656 1.0613 -2.0553 2.6429 -0.1656 0.9242 -2.1335 1.5815 0.0811
235 subiculum-head (Hippocampus) (lh) shape_Flatness -0.1804 0.9948 -2.0349 2.6071 0.1804 0.9902 -1.8604 2.5275 0.0571
226 Hippocampal_tail (Hippocampus) (lh) glszm_SizeZoneNonUniformity 0.1602 1.2777 -0.3627 6.7370 -0.1602 0.5930 -0.3627 1.6094 0.0926
5 Inferior Lateral Ventricle (lh) gldm_DependenceVariance 0.2497 0.8670 -3.1855 1.8362 -0.2497 1.0760 -4.4154 1.2877 0.0080
10 Thalamus (lh) firstorder_Energy -0.1668 0.9803 -1.8968 2.1634 0.1668 1.0093 -3.4538 2.1225 0.0789
7005 Central-nucleus (Amygdala) (lh) glrlm_RunEntropy* 0.3171 0.9355 -1.8487 2.4870 -0.3171 0.9780 -2.3047 1.4931 0.0007
226 Hippocampal_tail (Hippocampus) (lh) shape_Sphericity* 0.1124 1.0234 -2.0888 1.7519 -0.1124 0.9815 -2.4489 1.6912 0.2379
7008 Accessory-Basal-nucleus
(Amygdala) (lh)
shape_Maximum2DDiameter
Column*
0.2723 1.0032 -1.8863 2.2889 -0.2723 0.9375 -1.8863 2.2889 0.0037
2029 superiorparietal (rh) firstorder_Minimum 0.1898 0.9722 -1.7638 2.2018 -0.1898 1.0089 -1.9986 2.3255 0.0451
1019 parsorbitalis (lh) shape_SurfaceArea* -0.2675 0.9132 -2.3284 1.4640 0.2675 1.0280 -2.1684 2.6100 0.0044
46 Cerebellar White Matter (rh) glszm_SizeZoneNonUniformityNormalized -0.2310 0.8696 -1.4481 1.6651 0.2310 1.0824 -1.4481 1.6651 0.0144
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated