4. Discussion
The screening of susceptible and resistant hybrids in this study was conducted through two infection tests on candidate eucalypt hybrids using three
Calonectria pathogens. Based on the results of the two infection tests, the resistant genotype EC338 and the susceptible genotype EC333 were screened [
20]. Although susceptibility tests had shown differences in the susceptibility index of eucalypt hybrids to different
Calonectria pathogens, the relative value of the susceptibility index of the same eucalypt hybrid after being infected twice by three pathogens was basically the same. This basically eliminates the randomness of the susceptibility test results, indicating that the difference in susceptibility of the tested hybrid was due to its inherent genetic differences, that was, the disease resistance was strongly controlled by genetics. Although heterosis is generally controlled by multiple genes and the mechanisms are very complex, the genetic basis for the formation of heterosis varies among different plants, traits, and populations with different genetic backgrounds [
33,
34]. However, by tracking the parents of disease resistant hybrids and conducting scientific parent combination, it is highly possible to achieve the goal of improvement of disease resistant eucalypt varieties.
Studies had shown that the expression or transgenosis of Carbohydrate binding Modules (CBMs) could change cell wall properties and regulate plant growth and development. Such as Keadtidumrongkul et al. found that CBM2a promoted the growth of Arabidopsis thaliana, and the growth promotion of
E. camaldulensis was reflected in the increase of plant height, xylem area, xylem fibers, and conduit cells, there was no significant growth-promoting effect on tobacco [
35]. But there are also studies indicating that this is not always the case. Such as the overexpression of certain CBMs did not result in enhanced growth of
N.tabacum [
36]. And in present study, the enrichment analysis based on GO annotation revealed that the differentially expressed gene pathway between the disease-resistant genotype EC338 and the susceptible genotype EC333 was associated with carbohydrate binding (GO: 0030246), indicating that CBMs might also have a correlation with leaf blight resistance in eucalypt.
In this study, GSEA enrichment based on GO annotation revealed a differential pathway of terpene synthetase (TERPENESYNTHASEACTIVITY) in the disease resistant hybrid EC338 compared to its parent
E. pellita P9060. Terpenes are critical components of plant defense against herbivores, insects, pathogenic fungi, and bacteria [
37,
38]. In many forest tree species, terpenes are vital chemical, and physical defense compounds [
39,
40]. Conifer terpenes are a primary component of a durable, quantitative defense mechanism against stem boring insects and fungal pathogens [
41]. Constitutive and induced terpenes are critical for resistance [
42,
43,
44]. This finding might indicate a certain correlation between terpene synthase and resistance to eucalypt leaf blight.
Furthermore, two differential pathways between EC338 and its parent P9060 were found, namely, the carbon-oxygen cleavage activity on phosphates (CARBON-OXYGEN-LYASE-ACTIVITY_ACTING-ON-HOSPHATES) and carbon-oxygen lyase activity (CARBON-OXYGEN-LYASE-ACTIVITY). A study shown that Cold-resistant DEGs-DEPs had abundant hydrolase activities that could act on glycosyl bonds, carbon-oxygen lyase activity, and iron binding [
45]. The MCRA proteins from carbon-oxygen lyase family could be classified as fad-containing double bond hydratases and play a role in the virulence of
Streptococcus pyogenes M49 [
46]. Based on the results of current and other study, it could be found that the above pathways were indeed related to the stress resistance mechanism of plants.
The current study, GSEA enrichment based on KEGG annotation revealed differential pathways in sesquiterpene and triterpenoid biosynthesis (SESQUITERPENOID-AND-TRITERPENOID-BIOSYNTHESIS) between the resistant hybrid EC338 and the susceptible hybrid EC333. Studies had shown that this pathway was related to the stress resistance mechanism of plants. Such as the study on the molecular response mechanism to drought stress in
Atractylodes chinensis had revealed 15 single gene encoding enzymes in the biosynthesis pathways of sesquiterpenes or triterpenes [
47]. And a finding done by Singh et al. suggested a relationship between sesquiterpenes, phenolics, flavonoids, and the antioxidant capacity of
Ferula plants [
48]. Moreover, a study had proposed that the synthesis of sesquiterpenes, triterpenes, flavonoids, and phenylpropanoids might play a crucial role in the response to mechanical damage in
Aquilaria sinensis [
49].
GSEA enrichment based on KEGG annotations also revealed a differential pathway for phenylpropanoid biosynthesis (PHENYLPROPANOID_BIOSYNTHESIS) in the disease-resistant hybrid EC338 compared to its parent. Gan et al. determined that phenylpropanoid biosynthesis might play an important role in salt tolerance in mulberry at proteomic level [
50]. Additionally, a research indicated that
Pichia galeiformis enhances resistance to diseases and reduces disease incidence and lesion diameters in
chalcone synthase by promoting the activation of genes involved in phenylpropanoid biosynthesis [
51].
KEGG enrichment revealed that there is a differential plant pathogen interaction pathway (PLANT_PATHOGEN_INTERACTION) between the resistant hybrid EC338 and the susceptible hybrid EC333. Under biotic stress, plants trigger double immunity against pathogens through plant-pathogen interaction mechanisms, i.e. pathogen-associated molecular pattern-triggered immunity (PTI) and effector-triggered immunity (ETI) [
52]. Recent research indicates that PTI and ETI work together to enhance plant immunity against pathogens, thereby enhancing the overall defense system of plants [
53,
54]. Nonetheless, pathogens have developed sophisticated mechanisms to circumvent plant defenses, leading to successful infection by exploiting host susceptibility [
55,
56]. Whether it involved plant autoimmunity or pathogen evasion, immunity was intricately linked to the plant-pathogen interaction pathway.
In addition, exploring the causes of plant disease resistance heterosis can also evaluate with phenotypic characteristics such as leaf surface wax, cuticle layer, morphological structure, and chlorophyll content. Recent study conducted by Bonora et al. reveals that resistance mechanism of
Corymbia citriodora to different pathogens might be related to leaf phenotypic traits, such as the deposition of polyphenols and tannins on the upper/lower epidermis, and the proportion of monoterpenes, steroids, monounsaturated hydrocarbons, and long-chain hydrocarbons [
57]. And, increasing the chlorophyll content in cotton can promote photosynthesis and enhance the plant's ability to resist diseases [
58]. So, in the subsequent research of this study, in addition to verifying the identified functional genes, attempts could also be made to study the differences in leaf phenotype characteristics between disease resistant and susceptible genotypes, and analyze the reasons for their disease resistance differences from multiple perspectives.
The susceptibility test in the early stage of this study was conducted under controlled conditions on seedlings aged 6 months, using the pathogen of
Calonectria to artificially infect and select susceptible hybrids. However, in the 1.5-year field experiment, there was no significant difference in susceptibility between the two hybrids. This might be related to seedling age and environment. As seedling age increases, the temporal expression of genes causes protein structure modification or isomerism, or different proteins interact to resist adverse environments [
59].Yang et al. (2022) illustrated, through comparative transcriptomic and proteomic analysis that
P. notoginseng mediated resistance to pathogens by regulating the lignin biosynthesis pathway [
60]. Therefore, the protein products most likely associated with the gene pathway associated with eucalypt leaf blight discovered in this study can be isolated, and substrate specificity and kinetic parameter analysis could be performed to further explore the formation mechanism of leaf blight resistance.
Figure 1.
Testing the distribution of gene expression levels in eucalypt genotypes (A: Boxplot: A boxplot of gene expression levels, where the horizontal coordinate is the name of the sample (group), and the vertical coordinate is log2(FPKM+1). B: Density: A density plot of gene expression levels, where the horizontal coordinate is log2(FPKM+1), and the vertical coordinate is the density of The gene. C: Violin plot of gene expression levels, where the horizontal coordinate is the name of the sample (group), and the vertical coordinate is log2(FPKM+1). The width of each violin represents the number of genes at that expression level.).
Figure 1.
Testing the distribution of gene expression levels in eucalypt genotypes (A: Boxplot: A boxplot of gene expression levels, where the horizontal coordinate is the name of the sample (group), and the vertical coordinate is log2(FPKM+1). B: Density: A density plot of gene expression levels, where the horizontal coordinate is log2(FPKM+1), and the vertical coordinate is the density of The gene. C: Violin plot of gene expression levels, where the horizontal coordinate is the name of the sample (group), and the vertical coordinate is log2(FPKM+1). The width of each violin represents the number of genes at that expression level.).
Figure 2.
Correlation heat map of gene expression among various eucalypt genotypes (the horizontal and vertical coordinates are the square of the correlation coefficient between each genotype).
Figure 2.
Correlation heat map of gene expression among various eucalypt genotypes (the horizontal and vertical coordinates are the square of the correlation coefficient between each genotype).
Figure 3.
Principal component analysis plot of gene expression for each eucalypt genotype (A is 2D, B is 3D).
Figure 3.
Principal component analysis plot of gene expression for each eucalypt genotype (A is 2D, B is 3D).
Figure 4.
Gene co-expression Venn diagram in eucalypt genotypes tested.
Figure 4.
Gene co-expression Venn diagram in eucalypt genotypes tested.
Figure 5.
Differential gene Venn diagram in eucalypt genotypes tested.
Figure 5.
Differential gene Venn diagram in eucalypt genotypes tested.
Figure 6.
Heatmap of differentially expressed gene clustering (Horizontal coordinates are sample names, vertical coordinates are values of differentially expressed genes after FPKM normalization. The redder the color, the higher the expression level. The greener, the lower the expression level ).
Figure 6.
Heatmap of differentially expressed gene clustering (Horizontal coordinates are sample names, vertical coordinates are values of differentially expressed genes after FPKM normalization. The redder the color, the higher the expression level. The greener, the lower the expression level ).
Figure 10.
GSEA enrichment analysis using KEGG annotations (here the legends same as
Figure 9.).
Figure 10.
GSEA enrichment analysis using KEGG annotations (here the legends same as
Figure 9.).
Figure 11.
SE type of Alternative Splicing.
Figure 11.
SE type of Alternative Splicing.
Figure 12.
SNP variant loci region statistics (The horizontal coordinates represent the samples. A: The histogram of each sample variant loci categorized based on function (SNP_function). B: The histogram of that by impact (INDEL_impact). C: The histogram of that by region (INDEL_region). D: The histogram of that by region (SNP_region).).
Figure 12.
SNP variant loci region statistics (The horizontal coordinates represent the samples. A: The histogram of each sample variant loci categorized based on function (SNP_function). B: The histogram of that by impact (INDEL_impact). C: The histogram of that by region (INDEL_region). D: The histogram of that by region (SNP_region).).
Figure 13.
Module division in WGCNA analysis (A: Soft threshold plot; in the left plot, the horizontal coordinate is the soft threshold and the vertical coordinate is the correlation between connectivity k and p(k). The right plot is the soft threshold versus the average connectivity of the network. B: Module hierarchical clustering tree; the branches are gene modules and the leaves are genes.Merged colors are the modules with dissimilarity coefficients less than 0.25 merged with colors to represent the different modules.C: Inter-module correlation heatmap; the top part of the plot is the clustering based on the module's eigenvalues, and the values of the vertical coordinates correspond to the similarity between different modules. The upper part is the clustering based on the module eigenvalues, the value of the vertical coordinate is the similarity between different modules. The lower part is the clustering heatmap between different modules, each row and column in the graph represents one module. The darker the color (redder) in the box, the stronger the correlation; the lighter the color of the box, the weaker the correlation.).
Figure 13.
Module division in WGCNA analysis (A: Soft threshold plot; in the left plot, the horizontal coordinate is the soft threshold and the vertical coordinate is the correlation between connectivity k and p(k). The right plot is the soft threshold versus the average connectivity of the network. B: Module hierarchical clustering tree; the branches are gene modules and the leaves are genes.Merged colors are the modules with dissimilarity coefficients less than 0.25 merged with colors to represent the different modules.C: Inter-module correlation heatmap; the top part of the plot is the clustering based on the module's eigenvalues, and the values of the vertical coordinates correspond to the similarity between different modules. The upper part is the clustering based on the module eigenvalues, the value of the vertical coordinate is the similarity between different modules. The lower part is the clustering heatmap between different modules, each row and column in the graph represents one module. The darker the color (redder) in the box, the stronger the correlation; the lighter the color of the box, the weaker the correlation.).
Figure 14.
Heatmap of genotype and inter-module correlations in eucalypt genotypes (The horizontal axis, known as the abscissa, denotes the sample, while the vertical axis, referred to as the ordinate, represents the module. Each cell in the grid contains a numerical value indicating the correlation between the module and the sample. A value closer to 1 signifies a more robust positive correlation between the module and the sample, whereas a value nearing -1 indicates a stronger negative correlation. The degree of negative correlation strength is proportional to the value. Additionally, the significance level, denoted by the value in parentheses, reflects the strength of significance, with smaller values indicating higher significance.).
Figure 14.
Heatmap of genotype and inter-module correlations in eucalypt genotypes (The horizontal axis, known as the abscissa, denotes the sample, while the vertical axis, referred to as the ordinate, represents the module. Each cell in the grid contains a numerical value indicating the correlation between the module and the sample. A value closer to 1 signifies a more robust positive correlation between the module and the sample, whereas a value nearing -1 indicates a stronger negative correlation. The degree of negative correlation strength is proportional to the value. Additionally, the significance level, denoted by the value in parentheses, reflects the strength of significance, with smaller values indicating higher significance.).
Table 1.
Eucalypt leaf blight susceptible and resistant genotypes and their parents.
Table 1.
Eucalypt leaf blight susceptible and resistant genotypes and their parents.
Hybrids |
Susceptibility |
Female |
Species |
Male |
Species |
EC338 |
resistant |
W1767 |
E. Wetarensis |
P9060 |
E. pellita |
EC333 |
susceptible |
H1522 |
E. urophylla × E. pellita |
Unknown |
E. urophylla |
Table 2.
GO and KEGG enrichment analysis of four pairs of differential genes.
Table 2.
GO and KEGG enrichment analysis of four pairs of differential genes.
Comparisons |
GO |
KEGG |
EC338 VS EC333 |
Carbohydrate binding, Pattern binding, Polysaccharide binding (−log10(padj)>6) |
Phenylpropanoid biosynthesis, Flavonoid biosynthesis, Plant-pathogen interaction, Sesquiterpenoid and triterpenoid biosynthesis (−log10(padj)>2.5) |
EC338 VS P9060 |
Terpene synthase activity, Carbon-oxygen lyase activity, acting on phosphates, Carbon−oxygen lyase activity (−log10(padj)>10) |
Phenylpropanoid biosynthesis, Sesquiterpenoid and triterpenoid biosynthesis (−log10(padj)>5) |
EC338 VS W1767 |
DNA binding transcription factor activity, Terpene synthase activity, Carbon-oxygen lyase activity, acting on phosphates, Iron ion binding, Tanscription regulator activity (−log10(padj)>6) |
Phenylpropanoid biosynthesis, Plant hormone signal transduction, Sesquiterpenoid and triterpenoid biosynthesis, Plant-pathogen interaction, Biosynthesis of various plant secondary …, Flavonoid biosynthesis (−log10(padj)>4) |
EC333 VS H1522 |
Carbohydrate binding, Terpene synthase activity, Carbon-oxygen lyase activity, acting on phosphates, Carbon-oxygen lyase activity (−log10(padj)>4) |
Phenylpropanoid biosynthesis (−log10(padj)>7.5) |
Table 3.
Significance test for differential gene analysis in GSEA enrichment using GO-annotated.
Table 3.
Significance test for differential gene analysis in GSEA enrichment using GO-annotated.
Differential genotypes |
Genotypes |
Name |
Size |
ES |
NES |
NOM p-val |
FDR q-val |
FWER p-val |
Rank at max |
Leading edge |
EC338 VS EC333 |
EC333 |
Carbohydrate binding (GO:0030246) |
286 |
-0.519 |
-1.580 |
0 |
0.171 |
0.75 |
5450 |
tags=40%, list=17%, signal=48% |
EC338 VS P9060 |
EC338 |
Terpene synthase activity (GO:0010333) |
86 |
0.469 |
1.952 |
0 |
0.201 |
0.188 |
2191 |
tags=36%, list=7%, signal=39% |
EC338 |
Carbon-oxygen lyase activity, acting on phosphates (GO:0016838) |
89 |
0.462 |
1.939 |
0 |
0.149 |
0.188 |
2191 |
tags=35%, list=7%, signal=37% |
EC338 |
Carbon-oxygen lyase activity (GO:0016835) |
112 |
0.429 |
2.024 |
0 |
0.098 |
0.047 |
3561 |
tags=35%, list=11%, signal=39% |
Table 4.
Significance test for differential gene analysis in GSEA enrichment using KEGG-annotated.
Table 4.
Significance test for differential gene analysis in GSEA enrichment using KEGG-annotated.
Differential genotypes |
Genotypes |
Name |
Size |
ES |
NES |
NOM p-val |
FDR q-val |
FWER p-val |
Rank at max |
Leading edge |
EC338 VS EC333 |
EC338 |
Sesquiterpenoid and triterpenoid biosynthesis (EGR00909) |
65 |
0.610 |
1.714 |
0 |
0.066 |
0.093 |
3071 |
tags=35%, list=10%, signal=39% |
EC338 VS W1767 |
EC338 |
Phenylpropanoid biosynthesis (EGR00940) |
224 |
0.369 |
1.900 |
0 |
0.048 |
0.048 |
3594 |
tags=27%, list=11%, signal=30% |
W1767 |
Plant-pathogen interaction (EGR04626) |
281 |
-0.455 |
-1.501 |
0 |
0.178 |
0.599 |
4752 |
tags=37%, list=15%, signal=43% |
EC338 |
Flavonoid biosynthesis (EGR00941) |
89 |
0.521 |
1.455 |
0 |
0.103 |
0.760 |
4985 |
tags=42%, list=15%, signal=49% |