1. Introduction
Lupinus mutabilis Sweet (Fabaceae), also known as “tarwi”, “tarhui” or “chocho” is an Andean legume cultivated in Andean region in South America [
1]. Tarwi has had importance in diet since pre-hispanic time [
2]. It has important nutritionally compounts, mainly due to its protein values that varies from 32% to 51.6% (rich in globulins, 43–45% and albumins, 8–9%) [
3], high oil content (13–24%), crude fiber (6.2–11%), minerals such as iron, magnesium and phosphorus, and bioactive compounds such as isoflavones and phenols, with proven antioxidant capacity [
4]. These nutritional levels present in tarwi are even better than soybeans [
5]. Due these qualities, it constitutes an alternative crop to reduce malnutrition, and is considered as the emerging protein crop for Europe and temperate climate zones [
6]. Likewise, its adaptation to altitudes of 3100 to 3,850 m.a.s.l., temperate climate, and influence of the length of the day, makes it susceptible to low temperatures (−2 °C) in the initial stages, it requires around 350 to 800 mm of rain, and can grow for 240 to 300 days [
7]. Also, for its cultivation on marginal lands, under drought stress, mostly without tilling the land and implementing agronomic practices, it is considered a resilient species against the impact of climate change. Likewise, it possesses the ability to fix atmospheric nitrogen and mobilization of phosphorus that promotes agroecological production [
8].
On the other hand, tarwi presents key domestication characteristics, including indehiscent pods and permeable seeds with tegument, which represents a locally important crop in several Andean areas [
9].
L. piurensis is indicated as the probable wild progenitor of
L. mutabilis. Atchison et al. [
2] generated nextRAD sequence data for 212 accessions of Andean
Lupinus, representing 63 species and resolved relationships between species that diverged over time and sheds light on the origins of domestication. It has been reported that the tarwi ecotypes from northern Peru are bushy and decumbent in growth and generally have a prominent stem; while the ecotypes of the central and southern zone have herbaceous and bushy growth with semi-erect growth and non-prominent stem. Also the vegetative period varies from 180 to 270 days [
8]. Camillo et al. [
10] evaluated accessions from 22 populations of 16
Lupinus species, and showed that
L. mutabilis and
L. semperflorens, among other 13 species, presented 2
n = 48 chromosomes; while in
L. bandelierae presented 2
n = 36 chromosomes. They also suggested that cytlogically Andean lupines are more closely related to North American species than those of South America [
10]. The introduction of
L. mutabilis in the Mediterranean area shows a wide intraspecific genetic variability in collections, which allows the establishment of conservation and improvement programs [
6]. Guilengue et al. [
6] also evaluated the associations between genome size and morphological characters using Spearman's correlation analysis for 23 accessions, finding that no individual morphological trait presented strong correlations with genome size [
6].
Currently,
L. mutabilis remains a poorly studied crop in the field of genetics. Chirinos-Arias et al. [
11], indicated that the inter-accession genetic modification in
L. mutabilis, according to the accessions and ISSR markers evaluated, is considerable. They reported that
L. mutabilis is an autogamous plant with a considerable degree of allogamy. Ruiz-Gil et al. [
12] carried out a morphometric analysis using the flower standards of
L. mutabilis,
L. piurensis and the population that presents intermediate characteristics. The analysis of canonical discriminants with data from a morphometric analysis showed the existence of three different groups: (1)
L. mutabilis, (2)
L. piurensis and (3) the population with intermediate characteristics. Allo and autopolyploidization events, along with other chromosomal rearrangements, during the evolution of this species could have led to duplication/or triplication of genome regions, as reported in the Old World species
L. angustifolius [
13].
Molecular data increased the understanding of plant systematics at various taxonomic levels [
14]. The genetic similarity between genotypes can be assessed with DNA markers [
15], also it can help to select accessions for establishing a core collection. Genotyping by sequencing (GBS) is one such sequence identification method variants using next generation sequencing technology, producing powerful and cost-effective genotyping process [
16]. In addition, its genotyping can be easily replicated [
17]. Its application has been reported in different crops such as dry bean [
18], potato [
19], reed canarygrass [
20], lentils [
21], maize [
22], barley [
23], rice [
24], soybean [
25], switchgrass [
26], and wheat [
27] Thus, the objective of this study were to characterize a collection of tarwi germplasm currently maintained by the Grain Legumes and Oilseeds Research Program of the National Agrarian University - La Molina (UNALM for its acronym in Spanish) to gain a better understanding of the genetic diversity and population structure of this legume by employing a NGS technique.
4. Discussion
Molecular markers represent an important component in the plant breeding area and are widely used today for multiple purposes. These markers are employed to deepen the knowledge of the diversity and population structure in plant genetic resources that help plant breeders to develop new and improved cultivars with favorable characteristics for farmers [
47,
48]. Knowledge of the genetic structure and diversity of germplasm collections is an important foundation for crop improvement [
49]. Single nucleotide polymorphisms have gained popularity due to their abundance in the genomes and their amenability for high-throughput detection formats and platforms [
50]. To date, limited studies were conducted with molecular markers to determine the genetic diversity of tarwi and other crops in Peru. We here for the first time employed genome-wide SNPs to infer the genetic diversity and population structure of Peruvian germplasm of tarwi.
Genetic diversity indices of lupin, based on SNPs, is high among the nine populations sampled across the Peruvian Andes, which is concordant for individuals that are landraces, as reported for other landraces of rye [
51], pea [
52], maize [
53], rice [
54,
55], squash [
56], bean [
57], wheat [
58]. The wealth and abundance among tarwi landraces can be explained due to their adaptation to local environments and diversity of grower´s choice [
49]. Unfortunately, genetic diversity indices and population structure for
L. mutabilis have not been reported in detailed. On the other hand, these were inferred for other
Lupinus species. Raman et al. [
59] used simple sequence repeat (SSR) and DArT molecular markers and 94 Ethiopian accessions of white lupin (
L. albus). They reported that those accessions represent a unique genepool with high level of genetic diversity. Similarly, with 11 SRAP primer pair combinations, El-Harty et al. [
60] reported high genetic diversity for Egyptian while lupin genotypes. In addition, Atnaf et al. [
61] used 15 SSR and 212 Ethiopian white lupin landraces, indicating that this germplasm possessed high genetic diversity. Their gene diversity (i.e. expected heterozigosity) (0.31) is close to the average value obtained in this study (0.427). A very similar gene diversity index was reported by Ji et al. [
62] (0.476) for narrow-leafed lupin (
L. angustifolius) using 76 SSR markers. On the contrary, Skorupski et al. [
63] indicated that average heterozigosity of
L. nootkatensis is 0.03. This reduced value may be explained by the isolation of this species in Iceland. Genetic studies on tarwi are scarce. Chirinos- Arias et al. [
11] analyzed the genetic variability of 30 accessions of tarwi from the Andean Peruvian region with inter simple sequence repeat (ISSR) markers, indicating a broad genetic diversity among them. In a more recent study [
6], a total of 23 tarwi accessions with six ISSR markers were employed and revealed important levels of diversity, but this is not related to phenotypic diversity, reflecting the recent domestication of tarwi.
Assessing population structure provides insights into the genetic diversity of the species under study and facilitates association mapping studies [
64]. STRUCTURE analysis revealed that 89 samples of tarwi from the Peruvian Andes clustered in two well-defined groups associated with their geographic zone (center + south and north). Similar result were provided by PCoA. This clustering pattern meets our expectations as individuals from these two geographic zones differ on their morphology. Tarwi landraces from the center and south of Peru tend to be more compact with reduced branching and present early plant maturity. On the other hand, in northern Peru, tarwi landraces are more vigorous and possess more branching with late maturity. However, farmers cultivate tarwi under two forms of conditioning of the land i) zero tillage on fallow land. They make a hole to deposit the seeds in most of the localities in the northern regions; ii) in the north-center (Ancash, Huánuco and Huanvelica), farmers use the “yunta” for soil preparation in fallow land [
65] and iii) in southern Peru, farmers have started preparing the soil by conventional tillage. That is, soil conditioning and the use of local varieties in each region influence the period of the crop cycle, which is late when soil movement is involved [
66,
67]. Morevoer, these differences may be explained by the latitude where these tarwi landraces are cultivated. Latitude affects plant growth significantly [
68]. The higher the latitude, the shorter the growing season of these landraces with a more reduced size, as revealed in
Arabidopsis thaliana [
68] and
Ambrosia artemisiifolia [
69]. In addition, this clustering pattern may be due to the common process of exchanging tarwi seeds by growers living in close geographic areas like the center and south of Peru versus farthest places (northern area). Other lupin species such as
L. albus [
61],
L. angustifolius [
62] were also grouped into two populations.
A low degree of differentiation was exhibited among the two populations of tarwi, demonstrating that they share genetic material through high levels of breeding. Lupins are generally considered self-pollinating species [
70], therefore, they tend to homozygosity. However, the negative F
is for both populations indicated an excess of heterozygotes for
L. mutabilis, demonstrating that tarwi depends also on cross-pollination. These results are in agreement with Caliari et al. [
1] who indicated that outcrossing rates of
L. mutabilis varied between 16.6% and 58.8%. Consequently, this crop should be treated as a cross-pollinated plant in breeding programs. Similarly,
L. albus [
71],
L. nootkatnsis [
63] and
L. angustifolius [
62] depend also on cross-pollination. In the Andean Peruvian zone it is very common to observe populations of cultivated tarwi coexisting with its wild relative,
L. piurensis [
12], which could favor interbreeding. According to AMOVA, the greatest variation exist within accessions of tarwi (92.41%), which is explained by the sexual propagation of this species. In addition, low genetic variation between tarwi populations may be due to gene flow caused by the exchange of seeds, as depicted for
L. angustifolius [
62]. Similarly, Atnaf et al. [
61] indicated that 92% of allelic variability was attributed to individuals within populations of
L. albus.
The growing demand of novel sustainable protein sources (legumes, insects, others) [
72] can be supplied by lupins, which are protein-rich legume crop but they are still limited for human consumption due to the presence of alkaloids [
73]. Currently, lupin breeders only deal with a reduced part of the gene pool of this species, employing mainly low-alkaloid individuals to develop new cultivars [
74]. In Peru, this study represents an initial step for breeding and conservation of this importand legume also known as “lost crop of the Incas”. However, further research is needed. For instance, NGS techniques should be employed to develop molecular tools for this crop, considering that its introduction in other continents will provide new source of proteins and biomass, while contributing to the improvement of poor soils [
75]. Our next steps is to employ genome editing techniques for functional genomics and improvement of this neglected crop, aiming to alleviate poverty in rural communities of the Andean Peruvian region.
Author Contributions
Conceptualization, A.H.-J., C.I.A, C.L.S and S.G.; methodology, A.H.-J., C.I.A, C.L.S and P. R.-G.; software C.I.A and C.L.S; validation, A.H.-J., C.I.A and C.L.S; formal analysis, A.H.-J., C.I.A and C.L.S; investigation, A.H.-J., C.I.A, C.L.S, D.S and P.I resources, A.H-J., W.S., P.I. and C.I.A.; data curation, A.H-J, C.L.S. and C.I.A.; writing—original draft preparation, A.H-J, C.I.A, C.L.S and D.S.; writing—review and editing, A.H.-J., C.I.A., C.L.S. and W.S.; visualization, A.H.-J., C.I.A and C.L.S.; supervision A.H.-J. and C.I.A.; project administration, A.H.-J.; funding acquisition, A.H.-J.