1. Introduction
Root nodule bacteria (rhizobia), N
2-fixing symbionts of leguminous plants, represent a convenient model for the evolutionary genetics of symbiotic organisms. As in other bacteria, rhizobia genomes are composed of conservative core and variable accessory parts [
1]. Core genes are responsible for housekeeping functions (basic metabolism, cell development and reproduction, template processes) while the accessory genes encode for various adaptive functions including symbiotic interactions with the leguminous plants. Rhizobia genomes are subjected to the multilevel evolution based on modifications of: (i) symbiotically specialized (
sym) genes resulted in formation of polytypic species composed of host-specific biovars; (ii) core genes resulted in formation of cryptic (twin) species [
2].
Previously we studied the genome diversification in
Rhizobium leguminosarum, a polytypic species which includes two biovars:
viciae (symbionts of legumes from tribe
Fabeae, genera
Lathyrus,
Lens,
Pisum,
Vavilovia,
Vicia) and
trifolii (symbionts of genus
Trifolium from the tribe Trifolieae) [
3]. Cross-inoculation between these biovars is limited and results in the non-N
2-fixing nodules, which are usually underdeveloped and morphologically abnormal [
4]. In
R. leguminosarum, genomes are composed of circular chromosomes and several plasmids, one of them (pSym) having size 200-500 kb harbors the majority of
sym genes. They include
nod genes encoding for synthesis of nodulation-inducing lipo-chito-oligosaccharidic Nod factors, and
nif/fix genes encoding for the nitrogenase synthesis and operation. We suggested that in
R. leguminosarum, evolution of
sym genes is implemented under impacts of the host-induced natural selection [
3,
5] while mechanisms for the core gene evolution remain obscure.
In the presented paper we compare the evolutionary dynamics of
sym and core genes in goats’ rue rhizobia (
Neorhizobium galegae), a polytypic species differentiated into host-specific biovars
orientalis and
officinalis – N
2-fixing symbionts of
Galega orientalis and
G. officinalis, respectively. In contrast to
R. leguminosarum,
N. galegae biovars cross-inoculate their hosts readily resulting in the morphologically normal although non-N
2-fixing nodules. Majority of
N. galegae sym genes are located on chromids having sizes over 1600 kb. These circular replicons have a plasmid type
repABC system combined with the core genes which are typically located on bacterial chromosomes, including tRNA and rRNA encoding genes [
6].
Previously we demonstrated [
7] that populations of
N. galegae bv.
orientalis collected at the North Caucasian region are more polymorphic for
sym and core genes than
N. galegae bv.
officinalis populations. This difference apparently reflects the diversity of respective host plant species which is sufficiently higher in
G. orientalis than in
G. officinalis. Difference between two
N. galegae biovars for
nif/fix genes was much more pronounced than for
nod genes since the host specificity of compared biovars pertains N
2 fixation, not nodulation activity [
7].
The presented paper demonstrates that in N. galegae, impacts of driving/purifying selection (dN/dS) on gene evolution differ sufficiently in the core and sym genes and are biovar-specific. Strict phylogenetic congruence of core and sym genes was revealed in N. galegae, reflecting the location of sym genes on non-transmissible chromids. In spite of this congruence, mechanisms of evolution are different in core and sym genes, as it was demonstrated by analysis of correlations between p-distance and dN/dS values. These correlations as well as analysis of core gene ontology groups allow us to suppose that in bv. officinalis evolution is more dependent on adaptation to endosymbiotic niches than in bv. orientalis.
Analyses of the presented data completes some important gaps in our knowledge on the tradeoff between polymorphism of genes and impacts of natural selection on their evolution. Specifically, we suggest that in ecologically versatile organisms, such as N. galegae bv. orientalis, driving selection enhances the nucleotide gene polymorphism since the newly emerged alleles may coexist stabile with the preexisted ones. However, in ecologically restricted organisms such as N. galegae bv. officinalis, driving selection does not influence or even decreases the gene polymorphism since new alleles supported by this selection replace (push away) the preexisted alleles from the genepool.
3. Discussion
Root nodule bacteria (rhizobia) represent the genetically best studied group of symbiotic microorganisms. Being highly effective producers of N compounds for terrestrial ecosystems, these bacteria are characterized by exclusive ecological and agronomic importance. Moreover, rhizobia are used as a model to address a range of general evolutionary problems including the genomic mechanisms of micro- and macroevolution [
8], and the tradeoff between genetic polymorphism and natural selection [
10]. As it was demonstrated previously, purifying (stabilizing) selection usually results in a decreased population/gene polymorphism [
11] while disruptive and negative frequency-dependent selection result in an extended (balanced) polymorphism [
12,
13]. The data on influence of driving (directed) selection on the genetic polymorphism are contradictory: it may be increased [
14,
15,
16,
17], conserved or even decreased [
18,
19,
20] by this selection.
Application of rhizobia as a model to study impacts of natural selection on genetic diversity is based on the complicated genomic structures in which the core parts encoding for housekeeping functions differ in their natural histories from the accessory parts including the symbiotically specialized (sym) genes. Convenient models for analyzing the rhizobia genome dynamics are represented by polytypic species Neorhizobium galegae composed of two host-specific biovars – bv. orientalis and bv. officinalis (symbionts of Galega orientalis and G. officinalis), and Rhizobium leguminosarum, composed of bv. viciae (symbionts of plants from tribe Fabeae, genera Lathyrus, Lens, Pisum, Vavilovia, Vicia), bv. trifolii (symbionts of genus Trifolium from tribe Trifolieae), bv. phaseoli (symbionts of Phaseolus vulgaris from tribe Phaseoleae).
Previously we demonstrated that in
R. leguminosarum,
sym and core genes differ greatly in nucleotide polymorphism (p-distance) and are phylogenetically non-congruent, suggesting that evolution of these genes is independent [
3]. This independency may be resulted from intensive horizontal gene transfer in rhizobia populations wherein two gene groups recombine randomly due to location of
sym genes on mobile plasmids [
1].
In order to reveal impacts of natural selection on the rhizobia gene polymorphism we used the set of
N. galegae strains originated from North Caucasus region. As we demonstrated earlier [
7], diversity of nucleotide sequences (measured as p-distance) in
N. galegae is higher for core genes than for
sym genes and is biovar-dependent: bv.
orientalis is more polymorphic than bv.
officinalis for both gene groups (
Table 1). This difference may be due to contrasting levels of diversity in the respective host plant species. Specifically, North Caucasus is the longstanding center of origin for
G. orientalis while colonization of this area by
G. officinalis is more recent [
21]. Previously we quantified diversity of two
Galega species in North Caucasus using the nucleotide polymorphism analysis in a range of genes followed by genomic fingerprinting and confirmed the morphological data suggesting a higher
G. orientalis diversity with respect to
G. officinalis [
22,
23]. An important source of genetic diversity in
N. galegae may be represented by translocations of the Insertion Sequences (IS) which are more abandoned in bv.
orientalis than in bv.
officinalis [
24].
In this paper, we demonstrated that in
N. galegae, core and
sym genes are phylogenetically congruent (
Figure 1) apparently due to their restricted recombination based on location of
sym genes on non-mobile chromids. Nevertheless, some evolutionary important parameters of diversity are different in these genes: tradeoff between nucleotide polymorphism and evolutionary impacts of natural selection depend on the gene group (core or
sym) and on the
N. galegae biovar (
orientalis or
officinalis) (
Table 1). Analyses of the total gene pools (
Table 2) as well as of Gene Ontology Groups (GOGs) (
Table 3,
Figure 3), suggest that driving selection pressures result in an increased polymorphism of core genes in bv.
orientalis, not in bv.
officinalis. We suggest that in bv.
orientalis, maintenance of novel core gene alleles by driving selection (dN/dS > 1) may be combined with preservation of preexisted alleles due to a broad ecological amplitude of this biovar, therefore, its genetic polymorphism is elevated. However, in bv.
officinalis, the newly emerged gene alleles possibly substitute the preexisted ones (due to a restricted ecological amplitude of this biovar), therefore, gene polymorphism in this biovar is not changed.
In accordance to contrasting ecological affinities of the
Galega species, a range of differences between their symbionts were revealed: (i) low polymorphic GOGs are affiliated with N metabolism (apparently responsible for symbiotic adaptations) in bv.
officinalis and with the synthesis of surface polysaccharides (responsible for adaptations to edaphic stresses) in bv.
orientalis (
Tables S1–S3 in Supplement); (ii)
sym genes evolve under purifying selection (dN/dS < 1) impacts in bv.
officinalis while a neutral evolution (dN/dS ≈ 1) was revealed for these genes in bv.
orientalis; (iii) evolution of core genes occurs mostly under impacts of driving selection in bv.
orientalis while this evolution is neutral in bv.
officinalis (
Table 1,
Table 2 and
Table 3,
Figure 3). From these data, we can suppose that operation of
sym genes is most critical for bv.
officinalis because at North Caucasian region this biovar persists under unfavorable soil conditions and should survive mostly due to colonization of endosymbiotic niches. However, bv.
orientalis persists under more favorable conditions, as compared to bv.
officinalis, therefore adaptations to edaphic factors dependent on core genes are highly important for bv.
orientalis.
Interestingly,
N. galegae differs in its evolutionary dynamics from the previously studied [
3]
R. leguminosarum species:
sym and core genes
in N. galegae are more similar in their diversity parameters than in
R. leguminosarum. This difference between two polytypic species may be due to a more restricted recombination of
sym and core genes in
N. galegae with respect to
R. leguminosarum (
Table 4). Comparative analysis of these species contributes sufficiently to understanding of tradeoff between microevolution, speciation and macroevolution and between genetic polymorphism and natural selection.
Specifically, we demonstrate that in both rhizobia species,
N. galegae and
R. leguminosarum core genes responsible for speciation and macroevolution differ greatly in their natural histories from
sym genes responsible for microevolution. Different genetic mechanisms were proposed for micro- and macro-evolution by J. Philiptschenko (1927) [
25] who was the first to define these processes and correlated them with changes of the eukaryotic nuclear and cytoplasmic genes, respectively. Later, R. Goldschmidt (1949) [
26,
27] suggested that microevolution is based on the minor adaptive changes (“micro-mutations”) which can not initiate the speciation and macroevolutionary processes dependent on “macro-mutations” (responsible for generation of “hopeful monsters”). The similar approach was proposed in the punctuated evolution concept [
28,
29], which may be apparently used to address the symbiosis evolution since hosting of symbiotic microbes by eukaryotic organisms represents the rapid evolutionary bursts in contrast to gradual evolution suggested by the models gradualist evolution based on natural selection [
2,
30].
The other important issue of the rhizobia evolutionary genetics pertains the tradeoff between driving selection and gene polymorphism which may be increased by this selection in an ecologically versatile organism (such as N. galegae bv. orientalis) allowing a broad allelic diversity in the analyzed genes. However, in an ecologically restricted organism (such as N. galegae bv. officinalis), gene polymorphism is not changed or is even decreased under impacts of driving selection since co-existence of different gene alleles is presumable blocked. An extended bioinformatics analysis is required to analyze a relationship between adaptive potentials of organisms and impacts of natural selection on their polymorphism expressed in the diverse rhizobia species and in other symbiotic organisms.
Author Contributions
Conceptualization, P.N. and A.E.; methodology, A.E., K.E. and S.H.; software, K.E. and S.H.; validation, P.N. and S.H.; formal analysis, P.N. A.N. and S.H.; investigation, A.E., K.E. and S.H.; resources A.T., K.O., O.O. and D.N.; data curation, K.E. and S.H.; writing—original draft preparation, P.N., A.E., K.O. and S.H.; writing—review and editing, P.N., A.E., K.O. and S.H.; visualization, A.E. and S.H.; supervision, A,E.; project administration, P.N. and A.E.; funding acquisition, P.N. and A.E. All authors have read and agreed to the published version of the manuscript.