1. Introduction
Legumes (family
Fabaceae) are a specific group of plants capable of forming two types of mutualistic symbioses: root nodules (RN) with nitrogen-fixing bacteria (rhizobia) and arbuscular mycorrhiza (AM) with the fungi of the order
Glomeromycota [
1,
2,
3]. These symbioses are beneficial for both the host plant and the microsymbionts, as well as for the environment. Indeed, RN and AM improve mineral nutrition by supplying plants with macroelements – nitrogen and phosphorus, respectively – thus increasing the plants’ fitness, yield and stress resilience. In turn, rhizobia and AM fungi receive carbon compounds produced by the plant, mainly in the form of succinate and malate for rhizobia and in the form of carbohydrates and fatty acids for AM fungi. Lastly, the symbioses improve the soil structure and fertility: the amount of N
2 fixation by legumes is estimated to be up to 300 kg N ha−1 yr−1, depending on the legume species and the method of assessment [
4,
5,
6], and AM stabilizes soil macro-aggregation and thus protects the soil from erosion [
7]. These advantages associated with a symbiotic lifestyle make legumes ideal crops for use in the modern approach of sustainable agriculture [
8].
The molecular mechanisms underlying the formation and functioning of RN and AM symbioses have been well studied, especially in the model legumes such as
Medicago truncatula Gaertn. (barrel medik),
Lotus japonicus (Regel.) K. Larsen and
Glycine max (L.) Merr. (soybean) [
1,
9]. There is, however, one aspect of symbiosis where our understanding is still lacking: symbiotic efficiency and its genetic control. Indeed, strains of rhizobia differ in their nitrogen fixation capabilities [
10,
11]; moreover, some rhizobial strains are incompatible with particular varieties of legume plants due to the features of both plant and microbe genotypes. Similarly, some isolates of AM fungi have less beneficial effect on plants than the others (the genetic basis of this trait is unknown), and plant species (and, possibly, even plants within the same species) differ in their mycorrhizal dependency (which is determined as the degree of plant growth change associated with arbuscular mycorrhizal colonization) [
12]. One can generalize that the plant’s genome apparently affects the effectiveness of the formed symbioses, but the genetic determinants responsible for this trait are still to be elucidated.
Pea (
Pisum sativum L.) is an important legume crop worldwide (FAOSTAT, 2021) and a promising candidate for use in sustainable agriculture. During the last decade, significant advances in the area of genetics and genomics of pea have been made, including the development of useful modern tools such as reference genomes, pan-genomes, transcriptomic atlases and several sets of molecular markers for marker-assisted breeding (reviewed in [
13]). Currently, pea breeding is mostly aimed at improvement of pathogen resistance and stress tolerance, with less attention to symbiosis-related traits [
14,
15]. However, several years ago, it was proposed to consider the increase in seed biomass due to the complex inoculation with rhizobia and AM fungi as a trait for legume breeding [
16]. This integral trait was named Efficiency of Interactions with Beneficial Soil Microorganisms (EIBSM), or symbiotic responsivity [
8,
16]. In accordance with this proposal, pea genotypes with high symbiotic responsivity have been identified in the germplasm collection of the N.I. Vavilov All-Russian Institute of Plant Genetic Resources (VIR) [
17,
18] and subsequently involved in breeding programs. As a result of backcrossing of cultivar ‘Classic’ (a donor of plant architectonics) on cultivar ‘Vendevil’ (=k-8274 in the VIR collection, a donor of symbiotic responsivity), the pea breeding line with increased EIBSM named ‘Triumph’ has been created [
19]. Symbiotic responsivity of the breeding line ‘Triumph’ has been successfully proven in three-year field trials under inoculation with nodule bacteria and AM fungi [
20].
Since the EIBSM trait could be of high interest for pea breeders, we aimed to characterize the molecular genetic basis of this trait in order to pave the way for further molecular breeding programs in pea. To this end, we sequenced, analyzed and compared the genomes and transcriptomes of ‘Triumph’ and its parental cultivars, ‘Classic’ and ‘Vendevil’. Our findings demonstrate that: (i) ‘Triumph’ inherited 22.5% of its genome from ‘Vendevil’, including the genes annotated as related to AM and nodule formation, (ii) under the combined inoculation with nodule bacteria and AM fungi ‘Triumph’ and ‘Vendevil’ similarly show up-regulation of genes involved in nodules and AM development, which are related to solute transport, hormone regulation and flavonoid biosynthesis, and (iii) the expression of one of such genes, namely, the gene encoding a germin-like protein, which we named PsGLP2, is associated with differences in its promoter region, that makes it a promising marker of the symbiotic responsivity.
3. Discussion
The mutualistic symbiosis with rhizobia has long been known to have a beneficial effect on legumes, allowing them to grow in nitrogen deficiency and increasing their fitness and yield. Since there are many crops among legumes, the ability to form an effective symbiosis with bacteria should be considered as a valuable trait for breeding [
35]. In turn, since arbuscular mycorrhizal fungi are present in virtually all cultivated soils, the ability to form an effective AM can also be a subject for legume crops’ breeding. In order to simplify and formalize the breeding process, we previously proposed a quantitative metric for the effectiveness of symbiosis (EIBSM, for Effectiveness of Interaction with Beneficial Soil Microorganisms), or, in other words, the responsivity of the plant to the symbiotic microorganisms [
8,
16]. An example of the practical application of this metric is the pea breeding line ‘Triumph’, which demonstrates increased responsivity to inoculation with nodule bacteria and AM fungi.
The breeding line ‘Triumph’ was created as a result of five backcrosses of the cv. ‘Vendevil’ (=k-8274 in the VIR pea genetic collection) with cv. ‘Classic’ (Netherlands) followed by four generations of selfing with simultaneous selection for high yield under inoculation with nodule bacteria and AM fungi [
19]. The idea of this breeding program was to combine high EIBSM with agriculturally important traits such as semi-leafless phenotype (inherent to ‘Classic’ due to the mutation in the
afila (
af) gene) and high seed productivity [
36]. The resulting breeding line ‘Triumph’ has proved its symbiotic responsivity in three-year field trials [
20], but no further characterization of this line was performed, except for the study of Kuzmicheva et al. (2014), in which ‘Triumph’ was shown to excrete high amounts of pyruvic and succinic acids in the root exudates, similar to its parental cultivars ‘Vendevil’, while ‘Classic’ excreted low amounts of those organic acids [
37].
In the present work, a similar experimental design was used, i.e., ‘Triumph’ was compared to its parental cultivars ‘Vendevil’ and ‘Classic’ under simultaneous inoculation with nodule bacteria and AM fungi. In our inoculation experiment ‘Triumph’ and its parental cultivar ‘Vendevil’, the donor of EIBSM, responded to inoculation more readily than the second parental cultivar ‘Classic’, the donor of plant architectonic, even despite the fact of rhizobial contamination in control pots. Interestingly, ‘Classic’ demonstrated a tendency to form fewer roots with more nodules than ‘Triumph’ and ‘Vendevil’, although this difference was not statistically significant. This fact, however, coincides with the observed down-regulation of several nodule-specific genes such as those encoding leghemoglobin and lectins in the roots of ‘Triumph’ and ‘Vendevil’ as compared to ‘Classic’ (
Supplementary Table S3). Further experiments are required to establish the connection between the regulation of the nodule number from the plant side and the symbiotic effectiveness in pea.
The results of our analysis point towards the important role of plant hormones in determining the effectiveness of symbiosis with nodule bacteria and AM fungi. Among the similarly regulated genes in roots of ‘Triumph’ and ‘Vendevil’ we found genes presumably involved in the biosynthesis pathways of ABA, cytokinins, gibberellins and brassinosteroids. Unfortunately, the annotation data currently available in the databases does not allow us to draw a clear conclusion about differences in biological processes; for example, according to BLAST, the
evm.TU.scaffold_258.357 gene upregulated in ‘Triumph’ and ‘Classic’ (Тable 3) is encoding a flavonol synthase, but its closest homolog in
M. truncatula is annotated as 2-oxoglutarate-dependent dioxygenase, an enzyme participating in gibberellin synthesis [
38,
39,
40]. Lange & Lange, 2020, say that such cases are not uncommon, and homologs of GA 7-oxidase (GA7ox) are often called flavonol synthases [
39]. Thus, at the moment we cannot be certain that the gibberellins affect the EIBSM. However, gibberellins are known to block the infection during RN symbiosis while stimulating the growth and development of the existing nodules [
41]; indeed, in our experiment, ‘Classic’ formed many small nodules, while in ‘Triumph’ and ‘Vendeville’ nodules were less numerous. It is also worth mentioning that succinic acid, which is abundant in root exudates of ‘Triumph’ and ‘Vendevil’ [
37], is a by-product of the reaction catalyzed by 2-oxoglutarate-dependent dioxygenase (gibberellin 2-beta-dioxygenase) which might be encoded by
evm.TU.scaffold_258.357 (KEGG REACTION: R03008) [
42]. Metabolome profiling may help elucidate the possible roles of biologically active molecules in plant control over the effectiveness of the formed symbioses.
Moreover, there was a veritable increase in expression of genes involved in flavonoid biosynthesis in roots of ‘Triumph’ and ‘Vendevil’, which may suggest the more active interaction with microsymbionts, as flavonoids have been shown to accumulate in roots during the early stages of both nodulation and mycorrhization, playing role in molecular dialogue between plant and microorganisms and enabling symbiotic specificity. Our previous work suggested the high expression level of flavonoid biosynthesis genes in roots to be a transcriptional biomarker of pea cultivars with high EIBSM [
43]; the results of the present study corroborate that idea (intriguingly, neither ‘Vendevil’ nor ‘Triumph’ were included in the set of pea cultivars used by Afonin et al. in [
43]). It is known that domestication resulted in a reduction in secondary metabolites content in several legumes [
44]; possibly, the content and diversity of flavonoids in roots (and root exudates) of pea may be connected with the symbiotic responsivity. Similarly, the ‘mycorrhizal dependency’, i.e., the benefit for a plant from arbuscular mycorrhizal colonization is lower in cultivated plant species than in wild species [
45]. In this regard, it seems relevant to study the root flavonoid content in wild and cultivated pea varieties paying attention to the symbiotic properties of these varieties.
Our approach combining genomic and transcriptomic sequencing allowed us to reveal biomarkers of the EIBSM, i.e., the genes that increase expression in response to inoculation in ‘Triumph’ and ‘Vendevil’, as opposed to ‘Classic’. Homologs of some genes from this group were described as mycorrhiza-specific signatures in
M. truncatula, thus the increased expression of these genes indicates the proper response to inoculation with AM fungi. We did not find a clear response to nodule bacteria, though, probably due to contamination of the control samples. Among the signatures associated with mycorrhization, genes with the assigned function in the biosynthesis of brassinosteroids were detected; it is known that these phytohormones affect both AM and RN symbioses, playing opposite roles to gibberellins in their regulation [
41,
46,
47]. Perhaps, the fine tuning of the balance between the development of the two symbioses represents the molecular genetic base of the EIBSM in pea.
However, transcription markers are difficult to apply in breeding programs, while DNA-based markers are considered more suitable for the widespread use in breeding. Among the genes whose expression patterns distinguished ‘Triumph’ and ‘Vendevil’ from ‘Classic’ we were able to find candidate genes with differences in the sequence of the coding or promoter part. One of them encodes the 12-oxophytodienoate reductase 1 protein (the homolog of
M. truncatula MTR_5g006740), which participates in the biosynthesis of the precursor of jasmonic acid [
34,
48]. The other, the most promising marker of EIBSM, is the gene we named
PsGLP2, which carries the 10-bp insertion in the promoter region in ‘Vendevil’ and ‘Triumph’ and is up-regulated in response to inoculation, unlike that of ‘Classic’. The ortholog of
PsGLP2 in
M. truncatula is
Medtr2g031270 encoding the germin-like protein (GLP) with putative function of the rhicadhesin receptor. The germin-like proteins are part of the biochemically diverse cupin superfamily that has a conserved tertiary structure with limited similarity in primary sequence [
49,
50]. GLPs participate in various development processes in plants, usually exhibiting enzymatic activity of oxalate oxidase and superoxide dismutase [
50,
51]. It has been shown that these proteins are involved in interaction with microorganisms, in particular, the establishment of symbiosis with rhizobia and AM fungi [
51,
52]. In this regard, the rhicadhesin receptor plays an important role in the early stages of root nodule symbiosis, as it enables the initial attachment of the bacteria to the root surface via the Ca
2+-dependent rhicadhesin protein; without this attachment, further stages of symbiosis are impossible [
51,
53]. However, this is at odds with our data on nodulation, according to which ‘Classic’ formed more nodules than ‘Triumph’ or ‘Vendevil’, so
PsGLP2 probably plays another role, perhaps, associated with mycorrhiza; at least three different AM-induced GLPs seem to be involved in AM symbiosis, as shown in several studies on
M. truncatula [
49,
50,
52], and the closest homolog from
M. truncatula expresses during AM formation (see
Supplementary Figure S9). Also worth mentioning, despite the fact that the primary sequence is of little relevance for GLPs,
PsGLP2 is quite similar to
PsGER2a and
PsGER2b, both expressed in pea roots, though with unknown function [
51].
Regardless of the possible function of the PsGLP2, this gene contains the cis-regulator of its expression (located upstream the ORF) and therefore can be easily converted into the DNA marker suitable for marker-assisted breeding. The expression of other transcriptional biomarkers identified in the present study, apparently, is regulated by trans-regulators (i.e., the sequences located far from the ORF, which may either encode transcription factor(s) or be enhancer/silencer motifs). Probably, such is the case of the genes involved in flavonoid biosynthesis. Further work in this direction may be focused on the search for transcription factors co-expressed with the biosignatures of EIBSM, on the identification of the features in promoter regions of the differentially expressed genes, and on the test of these DNA markers on the set of pea lines differing in the symbiotic responsivity.
In general, the direct search for genetic determinants inherited by ‘Triumph’ from ‘Vendevil’ brought only limited success, since ‘Triumph’ appeared to carry an unexpectedly large portion of the ‘Vendevil’ genome (as much as one-fourth of the genes). We anticipated that after 5 backcrosses the portion of the ‘Vendevil’ genome should decrease down to about 3% (i.e., ½ ^ 5), but apparently the genetic material used for crosses was not linear, which caused high heterogeneity in the progeny. As a result, more than 20% of the ‘Triumph’ genome came from ‘Vendevil’. At the same time, we detected the allele-specific expression only for the genes expressed in the underground parts, i.e., where the symbioses form and function, which indicates that the breeding for the symbiotic responsivity was specific and effective. Among the genes with an allele-specific expression in roots, the promising gene PsGLP2 was identified, which will be tested in future work for its feasibility and usability as the marker of the EIBSM trait in pea.
Author Contributions
Conceptualization, V.A.Z. and I.A.T.; methodology, E.A.Z., A.I.Z., D.A.R., O.A.K., G.A.A., O.Y.S.; validation, E.A.Z., A.I.Z.; formal analysis, E.A.Z., A.S.S., A.I.Z., D.O.K., V.A.R., M.S.K., O.A.K., V.A.Z.; investigation, E.A.Z., A.S.S., A.I.Z., D.O.K., V.A.R., M.S.K., D.A.R., O.A.K., G.A.A., O.Y.S., I.A.T., V.A.Z.; resources, V.A.Z., I.A.T.; data curation, E.A.Z., A.I.Z., D.O.K.; writing—original draft preparation, E.A.Z., A.S.S., V.A.R., V.A.Z.; writing—review and editing, A.S.S., D.A.R., V.A.Z.; visualization, D.O.K., A.S.S.; supervision, V.A.Z.; project administration, V.A.Z.; funding acquisition, I.A.T., V.A.Z. All authors have read and agreed to the published version of the manuscript.