1. Introduction
According to the Diagnostic and Statistical Manual of Mental Disorders 5 (DSM-5), autism spectrum disorder (ASD) is defined as a childhood-onset neurodevelopmental disorder with two essential features: persistent deficits in social communication and interactions, and restricted, repetitive patterns of behavior, interests, or activities [
1]. Although the prevalence of ASD in Taiwan was estimated to be 1% in 2017 [
2], the figure has markedly increased in the past decade. Recent data showed that around 1 in 36 persons (2.8%) aged 8 years have been diagnosed with ASD come from 11 communities in the Autism and Developmental Disabilities Monitoring Network in the United States published in 2023 CDC’s Morbidity and Mortality Weekly Report [
3], and that males are more frequently affected than females, with a male-to-female ratio of approximately 4.3:1 [
4,
5]. The heritability of ASD is estimated to be >90%, suggesting a strong genetic component [
6]. Recently, large-scale gene discovery efforts have shown that ASD is not a simple Mendelian disorder [
7,
8,
9]. Conversely, the genetic underpinnings of ASD are heterogeneous and complex, involving multiple genes, gene–gene and gene–environment interactions [
10]. The genetic risk factors for ASD identified so far range from common variants conferring a small clinical effect to rare mutations with a high clinical outcome [
11]. Recently, the application of large-scale exome sequencing has led to the identification of hundreds of novel ASD-associated genes enriched in common genetic signaling pathways such as synaptic development, plasticity, and signaling [
12]. The high heterogeneity of ASD may account for the varied clinical presentations of patients with ASD. Despite these advances, there are likely more ASD-associated genes to be discovered.
Microarray-based gene expression profiling allows simultaneous measurement of hundreds to thousands of gene transcripts, useful for large-scale gene discovery [
13]. Comparative gene expression profiling analysis of lymphoblastoid cell lines (LCL) identified shared pathways among different forms of autism [
14]. Similarly, altered pathways in neural development and steroid biogenesis were detected in a total gene expression profiling analysis of LCL in autistic patients and their unaffected siblings [
15]. Previously, our group conducted comparative gene expression profiling of LCL in ASD patients and control subjects to identify differentially expressed genes associated with ASD [
16]. 187 differentially expressed probe sets, including 131 transcripts (50 up-regulated and 81 down-regulated), were detected between cases and controls. We verified one of the selected differentially expressed genes, FOXP1, using real-time quantitative polymerase chain reaction (RT-qPCR) in a sample of ASD patients and control subjects. The average expression level of FOXP1 in ASD was significantly higher than that of the controls [
16].
Neuregulin 2 (NRG2) emerged as a notable differentially expressed gene during the comparative total gene expression profiling analysis in our previous study [
16]. NRG2 belongs to the NRG gene family, including NRG1–NRG4. NRGs serve as cell–cell ligands for receptor tyrosine kinases belonging to the ErbB family, which comprises four homologous type I receptor tyrosine kinases known as EGFR (epidermal growth factor receptor), ErbB1, ErbB2, ErbB3, and ErbB4 [
17,
18]. The neuregulin–ErbB signaling network is involved in several processes in both the developing and adult brain [
19]. NRGs specifically play a role in synaptic plasticity, promoting neuronal migration and differentiation, regulating the expression of neurotransmitter receptors, and influencing glial proliferation, survival, and differentiation [
19,
20]. Through previous genetic and functional analytic studies, several variants of NRG1 and its neuronal receptor ErbB4 were found to be associated with schizophrenia and its endophenotypes [
21,
22,
23]. Moreover, studies in human induced pluripotent stem cells from affected subjects demonstrated that the NRG–ErbB pathway is related to psychiatric disorders [
24,
25,
26].
NRG2, encoding a novel member of the NRG family, induces the growth and differentiation of epithelial, neuronal, glial, and other cell types through interaction with ErbB receptors while sharing a similar genomic structure with NRG1 [
27]. Like NRG1, NRG2 can bind ErbB4 and ErbB3 but has no active kinase domain [
21]. NRG2 is expressed in the embryonic heart, lung, bladder and the developing nervous system [
28,
29] but confined to the cerebellum (granule cells and Purkinje cells), dentate gyrus (granule cells), and olfactory bulb (granule cells) in the adult brain [
28,
29,
30]. Recently, Vullhorst et al. reported that NRG2, but not NRG1, is a major functional ErbB4 ligand in the postnatal brain controlling N-methyl-D-aspartate (NMDA) receptor function in cortical interneurons associated with cognitive deficits in psychiatric disorders [
31]. Recently, Yan et al. found higher dopamine levels in the dorsal striatum but lower levels in the medial prefrontal cortex (mPFC) in NRG2 knockout mice (KO). The pattern of dopamine expression was similar to schizophrenia, and animals showed behavioral abnormalities relevant to psychiatric disorders, such as impaired social behavior and cognitive function [
32]. Further, a genome-wide association analysis of disturbances in the electroencephalography early gamma-frequency band between schizophrenia and control subjects found significant differences in several markers of the NRG2 and KALRN genes involved in neuronal development and the NRG–ErbB signaling pathway [
33]. Furthermore, several studies have shown that NRG1, NRG2, or genes of the NRG–ErbB network contribute to the etiology of psychiatric disorders [
34,
35,
36]. Nevertheless, the function of NRG2 is not well known, and its association with complex neurodevelopmental disorders like ASD remains unclear.
NRG2 (Gene ID 9542) is located at chromosome 5q31.2, an ASD candidate gene region, as shown by genetic linkage studies [
37,
38]. In addition, several case reports indicated that cases with ASD have de novo translocations of chromosome 5q and chromosomes 1q, 4q, and 18q [
39,
40,
41]. Furthermore, Phillippi et al. reported a strong and consistent association between two Single-Nucleotide Polymorphisms (SNPs) within the paired-like homeodomain transcription factor 1 (PITX1) on chromosome 5q31 and autism [
42].
Based on the above, we suggest that NRG2 might be a susceptibility gene for ASD. To test this hypothesis, we first used RT-qPCR to assess NRG2 expression in a sample of 20 patients with ASD and 20 controls to confirm the microarray gene expression analysis results. Next, we conducted systemic sequencing of all exons of NRG2 in another independent sample of ASD patients. We compared them with super-controls of the Taiwan Biobank using a case-control association study to determine the existence of mutations or SNPs in NRG2 associated with ASD.
3. Discussion
In our study, we found a significantly lower average NRG2 expression in ASD patients (3.23 ± 2.80) than in controls (9.27 ± 4.78, p < 0.001;
Figure 1). Previously, several linkage analysis or association studies pointed to a genome region encompassing the NRG2 locus to be associated with psychiatric disorders [
43,
44,
45]. In addition, many studies have pointed out that 5q containing the NRG2 gene may also be a candidate region for ASD [
37,
38,
39,
40,
41,
42]. Recently, Yan L. et al. reported that NRG2-KO mice presented a pattern of dopamine that was similar to schizophrenia and also showed various behavioral abnormalities relevant to psychiatric disorders, such as impaired social behavior and cognitive function in several behavioral tests [
32]. Our findings align with prior research indicating that NRG2 could be a susceptibility gene for ASD, and the reduced NRG2 expression or its loss may be associated with the development of ASD.
Furthermore, the indel GCCCGGC variant corresponded to amino acids 661–663 of NRG2, which are part of the EGF-like domain region at the C-terminal of the NRG2 precursor protein as predicted in silico (UniProt). Previous studies have shown that the EGF-like domain regulates the biological activity of NRG2 [
19,
28], as it is the region that binds to receptors and can regulate processes such as cell proliferation, differentiation, and survival. The function of the protein C-terminal domain includes regulation of protein stability or function or interactions with other proteins, such as binding domains or regions mediating protein–protein interactions. Recently, Czarnek and Bereta reported the proteolytic degradation mechanism of NRG2 protein; the C-terminal part contains a site for γ-secretase. They are being cleaved by ADAM10 or BACE2 for degradation. The interaction domains in the C-terminal region of NRG2 lie at 712–713 amino acids. This domain interacts with other neurotrophic factors; these interactions might help regulate physiological and pathological processes such as neural development and neurodegeneration [
46].
Our study found that the indel GCCCGGC variant at 661–663 amino acids also in the C-terminal region of NRG2 was significantly more common in ASD patients than in controls. Consequently, we suggested the indel SNP might affect the function of NRG2 in the NRG–ErbB signaling pathway and be associated with ASD.
This study still has some limitations. First, we identified two intronic SNPs (rs889022 and rs182642591) in nearly 300 ASD Taiwanese patients, but we did not find a significant association with ASD in our autistic samples. Population-based case-control association studies of complex conditions, like ASD with clinical and genetic heterogeneity, need a large sample size to identify risk genes with small effects. In this study, we recruited 349 ASD subjects excluding about 50 cases without DNA with limited power (0.32) to detect small effects in our post-hoc analysis. Therefore, further research should involve a larger patient population with this disorder. Second, despite the significantly lower NRG2 expression level in ASD patients than in 20 controls, the regulatory mechanism underlying its potential influence on ASD remains unknown. We could not address causality in this study. Future studies should focus on the regulatory mechanism governing the effects of lower NRG2 expression on ASD. Finally, the indel GCCCGGC variant located in the EGF-like domain at the C-terminal region of NRG2 was significantly higher than the control group (p < 0.0001). However, how this indel SNP mediates protein–protein interaction in the NRG–ErbB pathway remains unknown. Consequently, functional analysis of NRG2 and the influence of this indel SNP on its function might help us understand the physiological and pathological mechanism underlying its contribution to ASD.