4.1. Genetic Variability
The high number of samples included in this study has uncovered a pattern of genetic variability which had not been appreciated in previous studies. The mean heterozygosity per locus (H) is higher in Atlantic populations, and lower in Mediterranean populations, especially in the Eastern Mediterranean region (EMED). However, the mean number of alleles per locus (Na) is higher in the Mediterranean populations, and increased to the East. This contrasting patterns of H and Na are due to the presence of several alleles in the populations from the Mediterranean, especially in the EMED, that are not present in the Atlantic populations, and increase Na in that region. Lower heterozygosity associated to high number of alleles in low frequency is the expected outcome of mutation and genetic drift when a population experiences population size contractions followed by population expansions. The Pleistocene glacial-interglacial provide a framework to explain these observations. If most of the alleles that are restricted to the Eastern Mediterranean were in very low frequencies, a contraction followed by expansion associated to Pleistocene glacial-intergalcial cycles would be a possible explanation for the observed pattern. However, several alleles restricted to the EMED appear in relatively high frequencies (
Ech-3, up to 0.35;
Ech-4 up to 0.28;
TBP-3 up to 0.38), and therefore, other processes are need in order to account for the results. It is possible that the populations in EMED are remnants from an ancient population with a large effective size (Ne), and glacial-interglacial cycles decreased H, but many alleles still remain. Cordero et al. [
8] dated the origin of alleles restricted to the WEMED and EMED at the loci Ech and
TBP. Alleles Ech-D,
TBP-A1 and
TBP-C (corresponding to Ech-4,
TBP-1 and
TBP-3 in this study), gave best estimates for their divergence from the phylogenetically closest alleles of 206, 273 and 416 thousand years before present (KYBP). Allele Ech-A2 (which was a subgroup of sequences of the allele Ech-A (Ech-2 in this study) was present in the whole Mediterranean, and diverged at 424 KYBP. Finally, Ech-C (Ech-3 in this study), which is also widespread in the Mediterranean, diverged at 1078 KYBP. The divergence of the Mediterranean alleles therefore seems to have occurred earlier than the divergence of the alleles that are exclusive to the eastern Mediterranean alleles. The most probable date for this divergence seems to be the late Pleistocene period, but much earlier than the last glacial maximum (20 KYBP). Therefore, it is probable that, between less than 500 KYBP and the last glacial maximum, at two separated times, the Eastern Mediterranean harbored an isolated population of clams that lost part of their genetic variability, but did not lose many alleles. The oldest alleles would have expanded to the whole Mediterranean, while the more recent alleles have been restricted to the EMED. Another factor that could have had a role is natural selection, which could have favored one or more EMED-restricted alleles due to their linkage disequilibrium with other variants in the coding regions of the genes or in nearby genetic locations.
4.3. Geographic Genetic Differentiation
The results of this study agree with previous studies using intron-RFLP and microsatellite markers in showing a subdivision of the species in an Atlantic and two Mediterranean groups, corresponding to the Western and Eastern basins[
6,
12]. The genetic differentiation found between the Atlantic and the Mediterranean populations is consistent with many other studies showing the same result in other species using several types of genetic markers (enzyme polymorphisms, mitochondrial DNA, microsatellites, SNP), and have been related to the Pleistocene glacial-interglacial cycles and present-day restriction to gene flow at some points ([
8,
9]).
The inclusion in this study of new samples, and their analysis together with those from previous studies, has allowed to discover new features of the genetic structure of the clam populations living the Mediterranean Sea. In the Eastern Mediterranean, Cordero et al. [
6] examined populations from the northern part of the basin only, i.e. the Adriatic and the Aegean seas. They used the initial letters of both seas to create an acronym (AEGAD) to refer to that group of populations. The inclusion of a sample of clams from Egypt in this study has allowed to prove that the southern part of the Eastern Mediterranean basin shares the genetic characteristics of the northern part, and therefore it is possible to speak of an Eastern Mediterranean subpopulation or race (EMED). Further research could help to decide if the EMED populations represents a subspecies of
R. decussatus.
Cordero et al. [
6] also observed that the Tunisian population WM5 (Sfax), which geographically is located in the Eastern Mediterranean basin, was genetically more similar to Western Mediterranean populations than to the remaining Eastern Mediterranean populations. Gharbi et al. [
13] examined the genetics of twelve Tunisian populations with sequences of the mitochondrial gene COI and the nuclear ITS, and found no important differences in haplotype frequencies between the populations from the northern coast of the country, which geographically belong in the Western Mediterranean basin, and the populations from eastern coast, which belong in the Eastern basin. In this study, two of the population samples analyzed by Gharbi et al. have been included, and, together with the population studied by Cordero et al., it can be clearly seen that these populations fall on the Western Mediterranean group. Studies have reported that populations of marine species from the eastern coast of Tunisia can be genetically more similar to western Mediterranean populations in some cases, or to the eastern Mediterranean in others. The reasons for this variability are not well known. A combination of factors might be acting, including the east to west flow of the northern African current favoring dispersal past the Siculo-Tunisian Strait, and the effect of endogenous barriers to gene flow[
6,
17]. Specific studies, including populations beyond the Tunisian borders to the east, should be carried out in order to understand the mechanisms that explain this situation.
Some aspects of the allelic distributions in the Mediterranean could be due to historical and/or environmental factors, or to anthropogenic causes. For example, the presence in very low frequency of allele TBP-4 in Tunisia (WM4) was interpreted as a private allele by Cordero et al., but in this study it appeared in slightly higher frequency in a population near the Suez Canal (Timsah Lake) in Egypt, some 2500 km away. This finding points to the presence of this allele in low frequency along the northern African coasts, but is also compatible with a spread of this allele from Egypt to Tunisia through ballast waters along one of the busiest maritime routes in the world. This seems also the most plausible explanation for the presence of typical eastern Mediterranean alleles Ech-5 and TBP-3 in the Thau lagoon, near the port of Sète in south east France (WM3 population).
The inclusion of data from a sample from SW France reported by Saavedra et al. [
15], along with the addition of two new samples from Tunisia, have allowed to confirm the relative homogeneity of the populations of the southern part of the western Mediterranean (WMED), and to show a slightly higher differentiation of the northern populations in that basin, here represented by samples from the Ebro Delta in NE Spain and from the mentioned Thau lagoon in SW France. These populations formed a separate branch in the NJ tree and showed a higher frequency of cluster 1 and a lower frequency of cluster 2 in the Bayesian analysis of genetic structure. Differentiation of populations from the north and south of the WMED has been shown in other marine species living in that region, and it has been related with the presence of the Balearic oceanographic front [
20]. It would be interesting to study samples from other French and Italian populations, as well as from the western Mediterranean Islands, in order to better characterize the genetic structure of the WMED.
This study has increased considerably the sampling along the Atlantic coasts of the Iberian Peninsula, and has found out interesting novelties. Previously, Arias-Pérez et al. had found that the populations from the SW of the Iberian Peninsula were more similar to the populations of the Bay of Biscay than to other populations on the Atlantic façade of the Iberian Peninsula, which are geographically closer [
12]. Here we have shown that these two regions are no so similar, as they rendered significant pairwise-F
ST values and showed different frequencies of clusters in the Bayesian analysis of genetic structure (especially of clusters 2 and 6 for K=6).
Cruz et al. [
14] described a set of two populations in central Portugal (Obidos lagoon and Aveiro ) that showed moderate differention with respect to their neighbor Portuguese and Spanish populations. In this study, we have presented a more detailed data set of that part of the Iberian Peninsula. We have confirmed the differentiation of the same two populations from central Portugal (AT14 and AT15) with respect to their sampled neighbor populations, not only the Spanish ones, but also one Portuguese population located in northern Portugal (AT13, from Viana do Castelo). Surprisingly, AT13 is neither more similar to its closest neighbor in the northwest of Spain (AT12, from Redondela in the Ría de Vigo estuary), but rather to the populations of the Bay of Biscay. Moreover, the clams from Obidos and Aveiro have the highest similarity with those from A Coruña (AT7), a more northern population ca. 500 km apart, which is also different from their closest neighbor populations AT6 and AT8. Finally, between A Coruña and Viana do Castelo, there is a group of populations which appear closer to one another in the NJ tree and have in common the sharing of STRUCTURE cluster 4 in higher frequency than any other populations included in this study. These populations belong in the set of estuaries known as Rias Baixas, but also include the northern neighbor population AT8, which is part of the Rias Medias together with AT7 and AT6.
In summary, there is a genetic mosaic along the NW corner of the Iberian Peninsula that had not been described before. This mosaic is partially related to the geographic framework of the different sets of rias, and require specific explanations. The differentiation between populations of species such snails, algae and littoral plants from the NW of the Iberian Peninsula has been explained as the result of the Finisterre oceanographic Front [
30,
31,
32]. This front is formed in summer off the coasts of Cape Finisterre due to the encounter of eastern north Atlantic central waters (ENACW) of subtropical and subpolar origins (
Figure 6)[
33,
34]. The front could affect the transport of the grooved carpet-shell clam larvae, which reproduces in summer months in that region [
35,
36]. Larval transport disruption could result in limited gene flow, which could account for the differentiation between the AT8-AT12 group from the AT6-AT7 and Bay of Biscay populations. But his would leave unexplained the similarity of A Coruña (AT7) in the Rias Medias with Aveiro (AT14) and Obidos (AT15) in central Portugal, and the similarity of Viana, in northern Portugal (AT13) with the populations in the Bay of Biscay. A possible explanation for this inconsistency is just a higher isolation and lower effective size in these populations, which would result in that distant populations would look alike just by chance. Another possible explanation is the effect of river plumes, which have been proposed in some marine species as causes of genetic differentiation of populations at both sides of the plume [
37,
38]. In this case, the plumes of rivers Minho and Douro (
Figure 1b), which show the highest discharge rates among the Iberian rivers flowing to the Atlantic, could act as barriers to gene flow. The differentiation of AT13 with respect to the populations from Rias Baixas and to AT14 and AT55 would fit this model.
However, since intronic markers are located in protein coding regions (genes), which may be sensitive to natural selection, non-neutral (selective or adaptive) explanations cannot be excluded. One adaptive explanation for the differentiation of Rias Baixas is the upwelling of cold and nutrient rich ENACW water inside these estuaries. The upwelling is favored by N component winds in spring and summer. Upwelling episodes are more frequent in the Rias Baixas than in the more northern estuaries, and induce important differences between the two groups of estuaries in temperature, primary production and phytoplankton communities that may affect both larval and adult clams [
39]. Another plausible adaptive explanation is that AT7, AT13 and AT14 locations may be characterized by a more oceanic than estuarine environment, and the differences with respect to their neighbor locations are due to the adaptation to this type of environment.
Finally, anthropogenic causes cannot be left aside, as the concerned populations in the NW Iberian Peninsula are among the most commercially exploited in Europe. The exchange of juvenile clams between locations for fishery management purposes, and the introduction of clam spat obtained in hatcheries could also result in the observed pattern of differentiation. The similarity of the Viana (AT13) population with those of the Bay of Biscay would fit this human intervention, as hatcheries in N Spain and Atlantic France are usual suppliers of seed for the fishermen guilds.