1. Introduction
Poplars are a widespread genus of woody plants with pronounced economic value. They have long been used as a source of pulp and wood, and the International Poplar Commission (IPC) was established in 1947 to promote the sustainable management of fast-growing trees by its 38 member countries [
1]. At the same time, due to their fast growth, poplars can be used to combat climate change as they store atmospheric carbon in their wood [
2,
3]. Poplars have traditionally been used for landscaping rural and urban areas, including in Russia. This is partly due to their ability to withstand heavy soil pollution, which allows poplar to be used for phytoremediation, cleaning both urban and industrial soils [
4,
5]. Finally, poplars are used by scientists as model objects among trees - for example,
Populus trichocarpa became the first tree the genome of which was sequenced in 2006 [
6]. Due to all these reasons, the geographical distribution of selected valuable poplar species has become wider, through introduction in selected regions.
White poplar (
Populus alba) is a well-known member of the genus with a native range from Central and Southern Europe to Xinjiang and the Western Himalayas (
https://powo.science.kew.org/taxon/urn:lsid:ipni.org:names:776573-1, accessed 13 October, 2024), and is currently present on five continents due to its biological characteristics that allow it to be grown as a source of cellulose and lignin or for landscaping in urban areas. Poplars are dioecious plants, much less common than monoecious plants. The ARABIDOPSIS RESPONSE REGULATOR 17 (
ARR17) gene, located in the sex-determining region (SDR), is the main regulator of sex, which ensures female phenotype development [
7], while male development is regulated by suppression of this gene expression [
8]. White poplar is an exception in the sense that males do not have genetic mechanisms to suppress
ARR17 expression, but instead have a deletion of this gene [
9], making it possible to use genome editing tools to change the sex of a tree by inserting or deleting the gene, which could allow the cultivation of trees of the preferred sex to, for example, reduce economic losses by reducing the number of trees of the preferred sex.
The complex history of white poplar range formation, including the reduction of its habitat area during the glacial period and the subsequent development of new territories, has led to the formation of a large intraspecific genetic diversity, which arose both because of range disjunction and the need to adapt to the conditions of very diverse regions in terms of climate and biota. Populations from different regions are characterized by very contrasting indicators of growth rate, winter hardiness, frost resistance, drought tolerance, salt tolerance, as well as resistance to pests and pathogens, primarily those that cause stem rot. In addition, populations of different origins differ in their ability to vegetative reproduction and the set of symbiotic microorganisms, which is very important in connection with the increasing use of biotechnology methods in the cultivation of these tree species. The current intraspecific diversity has both great practical applications in breeding work and can be used to reconstruct the history of white poplar dispersal after the retreat of glaciers, to identify primary and secondary ranges of the species, the history of origin and dispersal of individual forms, and to solve problems of intraspecific systematics.
In this study, we present the first insights into the formation of the modern P. alba region during the postglacial era, based on the genomes of 36 white poplar individuals collected in Russia and Kazakhstan, and using available genomic data of white poplars from Italy, Hungary, and China. Our findings shed light on the evolutionary history of this species in the East European Plain, the Caucasus, and Western Siberia. The data may also be employed to model the range dynamics of other species of the genus in Eurasia and North America, as poplars exhibit analogous strategies in the development of their populations.
2. Results
To understand the process of modern white poplar range formation, we used a genomic approach. To do so, we collected genome data from plants originating from a range of countries and geographic locations and then evaluated clusters of genetically similar samples based on SNP profiling. First, we mapped the reads to the reference genome of white poplar and then performed SNP calling as described in the Materials and Methods section. For the most samples (73 of 87) more than 85% reads were successfully mapped to the reference sequence. About 23 million SNPs (at least in one sample) were identified. Based on the profiles of per-sample SNP occurrence, we performed PCA analysis (
Figure 1). The phylogenetic tree corresponding to this graph is shown in
Supplementary Figure S1.
Based on this PCA graph, we identified three clusters. First, samples collected in the Caucasus (in the vicinity of Sochi, Pyatigorsk, and in Dagestan) were most strongly separated from all other points and formed a separate cluster. Chinese (growing in Xinjiang and the Irtysh River valley) and Siberian trees were also grouped separately. All other plants formed a large third cluster. We will now note that the Italian specimens are furthest away in it, as well as plants from Beijing (which were imported). And 6 poplars, 4 of which were pyramidal (these are all pyramidal poplars in our study), formed a separate subcluster inside this third cluster.
Having studied the PCA plot, we hypothesized that there were probably 3 ancestral populations according to the 3 identified clusters. Based on these genotypes, the entire modern diversity of white poplars could have been formed. To test this hypothesis, we used the NGSAdmix program as written in the Materials and Methods section, the results of which are presented in
Figure 2.
We also used NGSAdmix with the number of genotypes k = 2, 4, 5, and 6, and the results are shown in
Supplementary Figures 1, 2, 3, and 4, respectively. However, while at k = 3 the split corresponds to geography (Europe, Caucasus, and Chinese regions), at k = 4 Chinese poplars split into two genotypes, and at k = 5 a cluster appears in the Southern Federal District, where it is unlikely that a refugium existed during glaciation due to the harsh climatic conditions. Thus, we hypothesize that there were three major refugia in terms of the global population of
P. alba.
We interpreted the results of this analysis as follows. A1 is the most probable ancestor of poplars in China, largely in Western Siberia, and has also contributed to the gene pool of plants in the vicinity of Uryupinsk. A2 is a key ancestor of trees in the Transcaucasia and Caucasus, and its participation in the artificial settlement of CFD territories and the creation of pyramidal forms is also notable. A3 is an ancestor of poplars from Italy, a key member of the gene pool of the central and lower Volga region, secondary area of central part of Russia, as well as contributing to populations in south of Western Siberia and the Caucasus and to the creation of pyramidal forms, known as the variation pyramidata sovietica. We reveal our theory of white poplar distribution in more detail in the discussion section.
3. Discussion
The family Salicaceae separated about 128 million years ago. It includes 54 genera and about 1,400 species, with most genera represented by a small number of species and distributed in Southeast Asia, the likely center of origin of the family [
10]. The most evolutionary successful genera are
Salix and
Populus. Their common ancestor about 60-65 million years ago underwent Salicoid whole genome duplication, which affected approximately 92% of the genome and resulted in more than 8000 pairs of paralogous genes [
6]. This is probably why the genera
Salix and
Populus were able to spread almost all over the Northern Hemisphere, especially in the boreal regions, and now number around 450 and 50 species, respectively [
11,
12].
The genus
Populus is divided into four sections: Abaso, Turanga, Populus and ATL, the latter including representatives of the traditional sections Aigeiros, Tacamahaca and Leucoides. Abaso is the most primitive section, including
P. mexicana, common in Mexico. Turanga includes
P. euphratica, known to grow in deserts in the Middle East and named after the Euphrates River. Populus and ATL are the most evolutionary advanced groups. Populus includes the aspens:
P. tremula,
P. tremuloides, as well as
P. alba,
P. tomentosa,
P. qiongdaoensis, and some other poplars. The ATL group includes
P. trichocarpa,
P. balsamifera,
P. deltoides,
P. nigra,
P. lasiocarpa, and others [
13]. Interestingly, poplars, like willows, are dicotyledonous plants, and some representatives of the Populus section, including
P. alba, have a ZW system of sex determination, while most poplars are characterized by an XY system [
14,
15]. Moreover, only 2 species are tropical in the genus
Populus, namely
P. qiongdaoensis and
P. ilicifolia, distributed in Hainan and Africa (Kenya and Tanzania), respectively [
15].
P. trichocarpa and
P. balsamifera are distributed in temperate forests of North America.
P. trichocarpa is spread along the west coast of the United States and Canada, while
P. balsamifera occurs somewhat northward, from Alaska to Labrador; however, there is a significant degree of hybridization between these closely related species in the habitat overlap zone [
16,
17,
18,
19].
P. tomentosa is a Chinese endemic [
20]. The range of
P. alba extends from Western Europe to Altai and Xinjiang, bounded in the north by mixed forests and taiga, where the climate is too cold for it, and in the south by steppes, semi-deserts and deserts, where it is too dry, as can be seen in
Figure 3.
P. alba did not spread across the Altai, probably due to the harsh sharply continental climate of Eastern Siberia and competition from other species [
21,
22,
23,
24].
We did not find articles describing a possible scenario for the formation of the modern range and population structure of
P. alba, there was only a research devoted to Central and Southern Europe. It was shown that the two main refugia in this region were in Italy and Romania. At the same time,
P. tremula survived in more northerly and harsh conditions, near the ice shield of the glaciated Alps. When the glaciers retreated, the ranges of both species expanded, and in the contact zone they began to hybridize actively, forming the natural hybrid
P. x canescens [
25]. White poplars in China, growing in the Irtysh River basin, have also been studied. Interestingly, the heterozygosity of populations there is lower than in Italy and Hungary [
26]. In addition, in this region, as in Europe,
P. alba also hybridized significantly with
P. tremula. This process, in turn, was significantly limited by plastid-nuclear incompatibility [
27].
We will also consider analogous research for two other species, namely P. cathayana in China and P. balsamifera in Canada and USA, just for example.
A recent study of
P. cathayana revealed 4 genetically distinct populations, named by the authors for growing in the Southwest (NW), Northwest (SW), and North China (NC), and the Taihang Mountains (TH). Splitting of the ancestral population began 1649 thousand years ago (kya), with one branch dividing 1430 kya into TH and NW and the other branch splitting 987 kya into NC and SW. The distribution and adaptation of
P. cathayana has been linked to climate changes [
28].
Another interesting evolutionary story concerns the postglacial distribution of
P. balsamifera in North America after the last glacial maximum, which occurred about 18,000 years ago. The authors identified three demes, namely the Central, Northern, and Eastern ones. The Central deme is the most genetically diverse, occupies the largest area, and is probably directly related to the ancestral population in the refugium. The Northern deme occupies Alaska, and the Eastern deme inhabits Quebec and Labrador [
18]. In another paper, it was shown that the boundary between the Eastern and Central demes is clearer than between the Central and Northern demes (there referred as Western), which can probably be explained by the earlier separation of the Eastern deme from the general evolutionary branch [
19].
Quaternary glaciation appears to have been a significant force in shaping the present-day range of
P. alba. It began 2.58 million years ago and has continued to modern times, with periods of glaciation alternating with much warmer periods of interglaciation [
29,
30]. For example, the last glaciation ended 11,700 years ago, followed by the Holocene, which is an interglacial period [
31,
32], and the next glaciation is projected to begin in 50,000 years [
33]. During the maximum glaciation glaciers were in the northern part of Europe (most of Great Britain, the territory of modern Germany, Poland, Belarus, northern Ukraine and part of Russia), also the centers of glaciation were mountain systems: the Alps and the Caucasus. But in Eastern Siberia glaciation also occurred but covered quite a little percentage of all surface - probably, the dry continental climate did not allow the formation of a glacier [
34,
35,
36].
Based on our findings and all the above information, as well as on the data on the current distribution of white poplar we formulated a hypothesis of how the formation of its modern range occurred, which is shown in
Figure 3, and the numbers in this section of the text coincide with the numbers in the figure for ease of perception.
In the late Pliocene, about 2.5 million years ago, a single population of white poplar probably existed across vast territories of Eurasia. However, after a series of glaciations in the Pleistocene, this tree became extinct in a significant part of its historical range, surviving, among others, in three regions on the territory of Russia and neighboring countries. These regions, those we consider as refugia, were never covered by glaciers and correspond to putative ancestors A1, A2, and A3. The first (5) is the Altai Region and mountains of Central Asia, located in present-day Russia, Kazakhstan, China (Xinjiang), Kyrgyzstan, Uzbekistan, Tajikistan and Afghanistan, which matches A1. The second (4) is Transcaucasia (since the Caucasus Mountains themselves were covered by glaciers at that time and the southern coast of the Caspian Sea on the territory of modern Azerbaijan, Armenia, eastern Turkey and northern Iran, it conforms to ancestor A2. The third region (3) was in the north of Africa and in the south of Europe, on the Iberian, Apennine and Balkan peninsulas, on some large islands, including Sicily, Sardinia and Crete, and on peninsula Asia Minor, settling on the territory of modern Spain, France, Italy, Croatia, Greece, Turkey and some other countries, and corresponded to ancestor A3. We named these regions Altai-Middle Asian refugium, Transcaucasian refugium, and South European refugium, respectively.
When glaciers began to retreat, white poplars started to spread outside these refugia. It was the third, Southern European refugium that became the most important for the formation of poplars in the European part of Russia. Migrations (6) from it in the northeastern direction led to the settlement of a significant part of the East European Plain, including the vicinities of Nizhny Novgorod and Uryupinsk. At the same time, steppes (10) in the south of modern Ukraine and the Pre-Caucasus, as well as semi-deserts and deserts (11) in the territory of modern Russia, Kazakhstan and other countries became a natural obstacle to the spread of poplar in certain directions. It penetrated the respective regions only along river valleys (12), such as the Don (12a), Volga (12b), and Ural (12c). From the European part of Russia, poplar migrated further eastward to Western Siberia, making a certain contribution to the gene pool of this region, as can be seen from the results of genomic analysis of samples collected near Novosibirsk in
Figure 3. In parallel with this migration, plants from the Black Sea coastline of Turkey reached the Caucasus, when the glaciers retreated from here. This event had an impact on the local genofond, as some of the Caucasian samples are also carriers of European genes.
The main contribution to the gene pool of modern populations of white poplar in the Caucasus comes from the Transcaucasian refugium. When the glacier in the Caucasus receded, trees from Transcaucasia penetrated northward (7), but did not spread further due to steppe and semi-desert zones north of the Caucasus Mountains.
The Altai-Middle Asian refugium may have expanded somewhat (8) after the end of glaciation; the poplars living there are the ancestors of modern Chinese plants and, to a large extent, the poplars of Western Siberia. There was also their migration westward, at least to the East European Plain, since all plants collected in the vicinity of Uryupinsk are carriers of Altai genes.
We separately note that although the plants we collected in the Central Federal District (Moscow and Moscow Region, Obninsk, Tula, etc.) are genetically close to other European populations and also have a certain percentage of Caucasian genes, in these regions, poplars appeared as a result of introduction in the nineteenth and twentieth centuries (9), so this part of their habitat cannot be called natural.
All pyramidal poplars, two of which were collected by us in Kazakhstan and two in CFD, are artificially bred based on European and Caucasian genotypes and are genetically relatively close to each other.
Therefore, we consider Southern Europe, Transcaucasia and the Altai-Middle Asian system as potential refugia where poplar could have survived during glaciations. During glacial retreat, poplar spread to regions suitable for it in terms of climatic conditions. This probably happened repeatedly. Our study is the first hypothesis about the formation of the modern range of white poplar. Further research in this direction involving more data is required.