1. Introduction
The Los Angeles River has the potential to influence systems beyond its boundaries such as estuarine environments at its outlet to the Pacific Ocean. In 2020 the County of Los Angeles Gross Domestic Product was
$6.5 billion [
1] and population was over 10 million [
2]. Well-documented issues with contamination such as heavy metals, excess nutrients, coliform bacteria, and cyanide [
3] have resulted from industrialization and high population. The LA River is also a habitat for bacteria, fungi, fish, plants, and invertebrates that are all sensitive to pollution. More recently, efforts have focused on protection and recognition of the river as a natural ecosystem, and part of that effort has been assessing the impacts of urbanization on the LA River ecosystems through eDNA sampling [
4].
There have been few studies which have aimed to characterize the biome of the LA River, however interest in characterizing microbial communities in this biome has increased in recent years [
5,
6]. The diversity of life, including fungi, bacteria, plants, fish, and invertebrates is indicative of ecosystem health. The presence or absence of certain “indicator” species reflect health and the presence of oxygen, or degradation and pollution [
7,
8,
9,
93]. By investigating microbial community composition and identifying relative species abundance, ecosystem health can be compared among different locations subject to different pollutant profiles.
By investigating microbial community composition and identifying relative species abundance, ecosystem health can be compared among different locations subject to different pollutant profiles. The L.A. River presents a unique opportunity to measure the impact of various types of urban pollution and infrastructure on microbial communities. The river runs through rural, suburban and urban areas and the impact of population density can be assessed. In addition, the Los Angeles River was highly modified to facilitate flood control [
10]. The city could not have grown to be such a metropolitan center if it was not for the engineering of the LA River [
11,
12]. The difficulty was that although the climate was dry for most of the year, when the rains did come there was often precipitation greater than 2 inches per hour, which led to flooding that could be catastrophic. Another crucial question documented by Wenger et al refers to the relationship between urbanization and the structure and function of microbial communities, which has not been well-studied [
13]. The question of how microbial communities may differ from one another in different land use areas, and how urbanization may affect the proportions of different classes of microbes, remains vital. The importance of this type of investigation was also underscored in Antwis et al’s perspective on the most important areas of inquiry in microbial ecology [
14]. In terms of urbanization, the modification or toxification of the environment may have influenced which organisms were present.
Ecological habitat was believed to have been diminished due to most of the LA River bottom being impervious concrete [
15]. According to Wenger et al, inquiry into the characteristics of piped or concrete paved tributaries as they influence biogeochemical processes represents one of the most important questions in urban stream ecology [
13]. Presence of a concrete river bottom has been known to influence the oxygen content of freshwater and this factor is expected to be one of the key factors which would influence communities in the concrete bottom condition. Nevertheless, if River organisms such as oxygenic autotrophs generate oxygen to the aboveground environment, it would help to offset such a concern by performing a beneficial function.
Since bacteria play a huge role in breakdown of wastes, nitrogen cycling, plant growth promotion, and pathogenicity, differences due to a bottom-up effect warranted a closer look. Lack of oxygen in the underwater environment was expected to be one of the key factors which would influence communities in the concrete bottom condition. Furthermore, concrete paved rivers contribute to the urban heat island effect, which involves increased light intensity and heat [
13]. Urban rivers generally have a cooling effect on the metropolis by virtue of the water that flows from them and the green spaces they support [
16].
In the absence of rain, the LA River has been fed by water from three water reclamation plants. Ackerman et al found in 2003 that there were higher ratios of ammonia to nitrate near the water reclamation plants [
10]. The benefits of using reclaimed water are obvious in terms of ecosystem services as a river fed by recycled water would be expected to provide more habitat than a dry river bed. The year-round supply of water has the potential to support wildlife and vegetation. The water sources have been shown to increase the NO
3- concentration near the treatment plant sources, but it also would be expected to dilute the concentration of other pollutants such as hydrocarbons from households and industry pollutants such as heavy metals. The proximity to a water reclamation plant could influence the diversity of bacterial sequences recovered from different sampling locations. A potential abundance of nitrate from water treatment plants was considered to be a priority at sites nearby to Glendale [
10], however the flow of water to wildlife through the river conduit would be expected to promote the diversity and abundance of organisms. On balance, the river would otherwise be a dry ravine during most of the year due to the Mediterranean climate, if it were not for the releases from the water treatment plants.
In this study we sought to investigate the impact of urbanization, presence of concrete river bottom, and nutrient pollution on microbial communities along the L.A. River. This was done by meta-barcoding and community analysis of environmental DNA (eDNA). Organisms that perform beneficial functions in the LA River Ecosystem were identified and quantified among samples taken along the river [
17]. This study focused on both eukaryotic and prokaryotic microbes including archaea, bacteria, cyanobacteria, fungi, plants, and eukaryotic algae. Differential abundance of these organism types was measured and analyzed computationally. This work contributes to a better understanding of the microbial ecology of the LA River ecosystem and helps identify urbanization impacts on microbial communities.
3. Results
The Unweighted Unifrac distance method coupled with PERMANOVA, visualized by PrinCoA was the most sensitive for detection of differences between groups based on sampling site, habitat or depth. The Chi squared standardized heatmap was not sensitive. PCA alone was not sensitive, although the factor loadings were useful for revealing the few important taxa that differed between samples. PAM coupled with PCA was more useful for identifying highly similar groups of samples, and elucidating community structure. PCA with PAM gave a better visualization than the hierarchical clustering methods for this sample size, although overall the PAM and UPGMA results were very similar.
Table 3 shows the medians and ranges for taxon abundance and sequences per sample. The FITS marker had a median number of sequences per sample of 18,157.
Table 3 displays the Summary Statistics resulting from the NJ (Neighbor Joining) and UPGMA (Unweighted Pair Group Method with Arithmetic Mean) Tree analyses in R phyloseq. As shown in
Table 3, the Branch Length means were similar but the variance is higher for Neighbor Joining, with respect to the FITS marker. The higher variance for Neighbor Joining would be expected.
Table 3.
Abundance of ASVs and Assigned Sequences per sample across the LA River sites.
Table 3.
Abundance of ASVs and Assigned Sequences per sample across the LA River sites.
LA RIVER |
Taxon Abundance |
Assigned Seqs/ Sample |
Marker |
Min |
Med |
Max |
Min |
Med |
Max |
FITS |
17 |
211 |
183,729 |
369 |
18,157 |
40,447 |
16S |
29 |
181 |
109,927 |
1 |
15,178 |
44,190 |
18S |
30 |
168 |
299,045 |
386 |
24,799 |
56,966 |
COI |
30 |
208 |
153,574 |
14 |
18,555 |
41,257 |
12S |
30 |
713 |
31,898 |
0 |
953 |
30,699 |
PITS |
0 |
265 |
238,793 |
133 |
9,642 |
24,730 |
Depicted in Figure S3 is the PCA for the fungal ITS sequences that were recovered from the LA River sediment samples. The first two principal components capture about 37% of the variation in the data. Fungi samples separate high on PC 2 based on abundance of
Penicillium, which may be important to the decomposition of leaf litter along the river, and
Cladosporium sequences, which produce the antibiotic and antimalarial metabolite Cladosporin [
28]. Low on PC 2, the separation is based on abundances of
Desmodesmus armatus and
Desmodesmus sp. variants of algae, especially in Maywood, Glendale Narrows, Glendale, and Elysian Valley. These genera have been known to break down radioactive materials.
Figure
Other results from the DESeq2 analysis showed that in frequently submerged river condition samples, there was a trend toward a differential abundance of fungi and a decrease in the abundance of bacteria, when compared with submerged samples. In frequently submerged sediment samples, Capniodales sp. were differentially abundant (p<1*10^-13), as well as Penicillium sp. (p<0.0005). Notably, Tricladium angulatum (p<1.5*10^-46), Monocillium tenue (p<2.5*10^-39), Acremonium nepalense (p<5*10^-30), and Peziza badia (p=9.5*10^-15) also had differentially higher abundance in frequently submerged samples.
As shown in
Table 3, the 16S assay had a strong median number of sequences per sample at 15,178. This shows that the sample had good sequencing depth. As shown in
Table 4, the Branch Length means are similar but the variance is about 50,000 units higher for Neighbor Joining, with respect to the 16S marker. In terms of the number of clusters reflected by the rooted and unrooted trees, both trees point to k=5 for the number of clusters in terms of bacteria.
Table 4.
Summary statistics from the Neighbor Joining and UPGMA Trees for each marker. The trees were generated from the Euclidean Distance Matrix. Tree topological distances have been provided in the far-right column.
Table 4.
Summary statistics from the Neighbor Joining and UPGMA Trees for each marker. The trees were generated from the Euclidean Distance Matrix. Tree topological distances have been provided in the far-right column.
LA RIVER |
Branch Length NJ |
Branch Length UPGMA |
NJ vs. UPGMA |
Marker |
Mean |
Variance |
Mean |
Variance |
Tree Distance |
FITS |
1,657 |
5,419,114 |
1,585 |
4,124,851 |
8,195 |
16S |
620 |
460,349 |
609 |
417,224 |
2,473 |
18S |
2,018 |
5,534,355 |
1,978 |
4,278,736 |
10,919 |
COI |
2,312 |
8,746,132 |
2,114 |
6,010,691 |
9,697 |
12S |
634 |
4,710,694 |
1,585 |
4,124,851 |
12,130 |
PITS |
1,457 |
6,728,373 |
1,351 |
4,241,554 |
8,516 |
Figure S5 shows the PCA for the bacterial 16S DNA sequences that were recovered from the LA River sediment samples. The first two principal components capture about 42% of the variation in the data. Bacteria DNA samples separate by numerous important taxa factor loadings such as abundance of Erythrobacteracea, Proteobacteria, and Oscillatoriales cyanboacterium.
Among others, samples from Maywood and Glendale scored low on PC 2, in the direction of high cyanobacteria abundance.
Figure 3 shows the PCA plot for the 16S samples color coded by the best PAM clustering. The best PAM clustering in this case was k=4 with the highest average silhouette width. The samples in the second cluster, colored red, are from Glendale Narrows. The third cluster, colored green, is mostly made up of sediment samples from Maywood and Glendale.
Figure 3.
PCA for Bacterial identified sequences from the 16S marker by sample, color coded by the best PAM clustering. Note that there is evidence of overdispersion, in particularly high on PC1.
Figure 3.
PCA for Bacterial identified sequences from the 16S marker by sample, color coded by the best PAM clustering. Note that there is evidence of overdispersion, in particularly high on PC1.
Among the bacteria with differentially higher abundance of 16S sequences in Glendale Narrows, Cyanobacteria microcystis (p<1.5*10^-7) and Oscillatoriales cyanobacterium (p<3*10^-14). Verrucomicrobia were also differentially more abundant in Glendale Narrows (p<4*10^-23). On the other hand, the alphaproteobacteria Devosia from Rhizobiales had differentially higher counts of sequences in samples from Verdugo Wash.
Table 5. The results of the differential abundance analysis for Glendale vs. Verdugo Wash. Positive log fold change results represent sequences that were differentially abundant at the Glendale site. Negative log fold changes represent sequences that were differentially abundant at the Verdugo Wash site.
Table 6.
Positive log fold change results represent sequences that were differentially abundant at the soft bottom sites. Negative log fold changes represent sequences that were differentially abundant at the concrete sites.
Table 6.
Positive log fold change results represent sequences that were differentially abundant at the soft bottom sites. Negative log fold changes represent sequences that were differentially abundant at the concrete sites.
log2FoldChange |
padj |
Taxon |
Notes |
22.09927 |
3.71E-23 |
Prosthecobacter sp. |
possible pathogen, anaerobic, tubulin like genes, low nutrient environments |
34.31956 |
1.53E-41 |
Dechloromonas sp. |
may oxidize benzene |
-22.258 |
5.73E-05 |
Devosia sp. |
Nitrogen fixer |
-25.3115 |
1.67E-05 |
Bacillus sp. |
many beneficial species |
23.78784 |
1.22E-06 |
Chromatiaceae (unclassified) |
purple sulfur bacteria, use sulfide to fix carbon and generate oxygen |
-30.519 |
0.009416 |
Sandaracinobacter sp. |
metabolism of sulfide to cysteine (or from serine) |
25.68591 |
0.000938 |
Chloroflexaceae (unclassified) |
green non-sulfur bacteria, many heat-loving anoxygenic photoheterotrophs [29, 30] |
-22.3636 |
0.00014 |
endosymbiont of Ridgeia piscesae
|
Gammaproteobacterium, symbiont of a tubeworm |
-6.85917 |
4.08E-06 |
anaerobic bacterium MO-CFX2 Chloroflexi
|
17.1087 |
4.15E-08 |
Rhodocyclales (unclassified) |
nitrogen fixing or nitrogen reducing |
33.82601 |
2.58E-14 |
Phormidium setchellianum |
Potential cause of gastroenteritis, concentrates caused neuro- and hepato-toxicity in mice [31] |
20.18264 |
0.000268 |
Cytophaga xylanolytica |
xylan degrading, does well in sulfogenic and methanogenic environments, anaerobic and gliding |
-23.4117 |
0.002659 |
Synechococcus sp. |
Photolysis of sulfide or water, produces neurotoxins [32] |
11.0032 |
0.000123 |
Scenedesmaceae (unclassified) |
Green algae, may degrade radioactive materials |
8.245038 |
0.000199 |
Flavobacterium sp. |
Often associated with plant resistance to pathogens |
7.271474 |
0.005122 |
Oscillatoriales cyanobacterium HF1 |
Cyanobacterium which may cause illness or death in humans and animals |
10.11933 |
0.001645 |
Tetradesmus obliquus |
Produces valuable saturated and unsaturated esters, extract has anticancer and antimicrobial effects [33, 34] |
28.7773 |
1.03E-07 |
Microcystis sp. |
Cyanobacterium which is toxic to humans [35] |
28.91261 |
5.24E-05 |
Rhodocyclaceae bacterium enrichment culture clone Y62 |
nitrogen fixing or nitrogen reducing |
log2FoldChange |
padj |
Taxon |
Notes |
-25.207183 |
3.06E-23 |
Oscillatoriales cyanobacterium YACCYB599 |
Cyanobacteria which may cause illness or death in humans and animals |
-24.66764915 |
4.55E-23 |
Chroococcus subviolaceus |
Freshwater or high salinity environments, Cyanobacteria which can survive with low O2 [36] |
-24.50212313 |
4.55E-23 |
Haliea sp. |
Marine gamma proteobacterium which tolerates up to12% salinity [37, 38] |
24.49667323 |
3.81E-31 |
Halomonas sp. |
chloride and saline tolerance |
24.12963073 |
1.43E-27 |
Marmoricola sp. |
Denitrifying bacteria [39] |
10.00393321 |
8.21E-09 |
alpha proteobacterium LS7-MT |
Methanol oxidizer, lives in high temperatures [40] |
9.188395232 |
2.37E-18 |
Nitrosarchaeum koreense |
Aerobic ammonia-oxidizing archaea [41] |
-8.382519826 |
0.001244 |
Microcystaceae (unclassified) |
Common Eutrophic Bloomer, toxin-producing Cyanobacterium |
7.849119335 |
3.12E-07 |
Acidobacterium sp. SCGC AAA007-P13 |
Potential saprobe |
-7.732408042 |
4.32E-08 |
Oscillatoriales cyanobacterium IRH12 |
Cyanobacterium which may cause illness or death in humans and animals |
-7.389766623 |
0.000539 |
Roseisolibacter agri |
Grows in low oxygen environments [42] |
-7.310779292 |
1.03E-07 |
Pleurocapsa concharum |
Ostracod-dependent Cyanobacterium [43] |
7.242636088 |
5.51E-07 |
Devosia sp. |
Nitrogen-fixing bacteria |
6.970043209 |
0.001616 |
Nitrospira sp. enrichment culture clone LD3 |
Nitrifying bacteria nitrite oxidizing bacteria |
6.533527317 |
1.83E-13 |
gamma proteobacterium SCGC AAA007-P21 |
Uncultivated bacterioplankton |
6.503508981 |
0.001529 |
alpha proteobacterium Schreyahn_AOB_Aster_Kultur_5 |
Cultured alphaproteobacterium |
-6.479686479 |
0.000178 |
Chlamydomonadales (unclassified) |
Green algae [44] |
-6.382235759 |
0.000425 |
Chloronema giganteum |
Photoautotrophic, anoxygenic green non-sulfur bacteria [91] |
-6.230017507 |
0.002384 |
Chamaesiphon sp. |
Widely distributed Cyanobacterium [45] |
6.02052523 |
0.007591 |
Altererythrobacter sp. |
Alkaline or salt tolerant aerobic phototroph, anoxygenic [46, 47, 48] |
5.990283542 |
0.000524 |
Mycobacteriaceae (unclassified) |
Potential human and animal pathogens |
5.737312813 |
2.78E-06 |
Acidobacteriaceae (unclassified) |
Likely saprobe of plant organic matter |
-5.72085055 |
0.009826 |
Candidatus Viridilinea mediisalina
|
Anaerobic phototroph, salt-tolerant and prefers alkaline environments [49] |
-5.56037325 |
2.59E-05 |
Veillonellaceae bacterium 6-15 |
bacterial vaginosis |
-5.548460876 |
0.000699 |
Phormidium setchellianum |
Cyanobacterium with possible antitumor agents, neuro and hepatotoxicity |
-5.531306605 |
0.003193 |
Calothrix sp. UAM 374 |
Cyanobacterium which grows on plants and hard substrates [89] |
5.344610141 |
0.0001 |
Candidatus Nitrosocosmicus sp.
|
Aerobic ammonia-oxidizing archaea |
-5.019693824 |
0.003193 |
Treponema stenostreptum |
syphilis relative |
-4.952937198 |
0.001067 |
Leptolyngbyaceae (unclassified) |
Thermophilic and potentially iron-loving Cyanobacterium [50] |
-4.934291389 |
0.000964 |
Holophagaceae (unclassified) |
Anaerobic dweller of freshwater sediments [51] |
4.926495832 |
0.009823 |
unidentified eubacterium RB01 (Verrucomicrobia) |
-4.711954167 |
0.002384 |
Xanthomonadaceae bacterium |
Potential phytopathogens |
-4.711366069 |
0.005914 |
Leptolyngbya geysericola |
Alkaline tolerant non-heteroctic Cyanobacterium, produces calcite on microplastics [52] |
4.50039412 |
4.71E-06 |
Caldilineales bacterium |
Thermophilic and anaerobic [53] |
-4.35065315 |
0.009823 |
Fusibacter sp. enrichment culture |
Thiosulfate reducing, potentially halotolerant |
-4.16646108 |
0.002439 |
Desulfomicrobium sp. |
oxidizes sulfide and arsenate in the presence of CO2 and acetate [54], reduces nitrate to ammonium [55] |
-3.874861377 |
0.005914 |
Oscillochloridaceae (unclassified) |
anoxygenic phototrophic bacteria [29, 56] |
-3.695598612 |
0.009826 |
Pleurocapsales (unclassified) |
Cyanobacterium from calcareous environments |
3.602101991 |
0.002384 |
Vicinamibacter silvestris |
Polyphosphate accumulating organisms |
2.378738101 |
0.004923 |
Firmicutes (unclassified) |
High abundance in suburban rivers, negatively correlated with ammonia concentration |
2.253024076 |
0.008829 |
Stenotrophobacter terrae |
opportunistic pathogen |
2.126473277 |
0.00044 |
Vicinamibacteraceae (unclassified) |
Degrades chitin [57] |
2.033767588 |
0.003193 |
Actinobacteria (unclassified) |
Many denitrifying bacteria [58, 59] |
The soft-bottom river condition was associated with a differentially higher abundance of Alphaproteobacteria and a decreased abundance of Cyanobacteria Pleurocaps (p<1*10^-6) and Phormidium (p<0.0007), Oscillatoria (p<3*10^-23), and Chroococci (p<5*10^-23) when contrasted with concrete sites. Notably, Devosia was more abundant in soft bottom (p<6*10^-7) whereas Desulfomicrobium (p<0.003) was more abundant under concrete conditions. On the other hand, Verrucomicrobia and Haliaceae family Proteobacteria were differentially abundant in the soft bottom condition (p<5*10^-23, p=0.01 respectively).
Most of the bacteria that were differentially expressed in the concrete sites were Cyanobacteria and autotrophs. There was also a trend toward a differentially high abundance of DNA sequences from potential human and plant pathogens, including the potential plant pathogen Xanthomonas, Clostridia, and bacteria related to the agents that cause reproductive infections. Nevertheless, the soft bottom sites also had differentially high abundances of Norcardiaceae and Verrucomicrobia, which are also potential pathogens. For the concrete sites, there was a less clear picture of the nitrogen cycle when considering the bacteria alone. There was a clear picture of the nitrogen cycle for the soft bottom sites, as well as a candidate species for phosphate accumulation.
The highest number of assigned sequences per sample was for the 18S marker, as shown in
Table 2. This suggests that the highest overall sequencing depth was for the 18S assay. As shown in
Table 4, the Branch Length means were both near 2,000 but the variance is about 125,000 units higher for Neighbor Joining, with respect to the 18S marker. For both tree topologies, k=4 is apparent for the number of clusters in terms of 18S sequences identified by the assay.
In Figure S6, the PCA for the 18S DNA sequences that were recovered from the LA River sediment samples is shown. The first two principal components capture about 46% of the variation in the data. The PCA by sample for 18S validates the FITS results, because the samples scored low on PC 2 based on factor loadings for
Desmodesmus and other
Scenedesmaceae taxa of algae. Further, samples scored high on PC 2 based on
Podocopida and
Cypridida high relative sequence abundance.
Podocopida is a crustacean that has freshwater and brine-dwelling groups [
60]. The
Cyprididae are a group of freshwater Ostracods [
61]. Figure S7 shows the 18S PCA color coded by the best PAM clustering, which was k=5, with the highest average silhouette width. The red samples in cluster 2 were all from Glendale. Cluster 5, in light blue, corresponds to the Long Beach sediment samples. Considering the spatial heterogeneity displayed by the samples, there is a sense that the genetic material is funneling into Long Beach, reflecting the physical landscape. The fourth cluster, in dark blue, is comprised of Sepulveda Dam, Tujunga Wash, and Arroyo Seco.
The observed alpha diversity for fungi sequences based on the 18S marker is shown in
Figure 5. Los Angeles River Proportions of Ascomycota to Basidiomycota were compared to Freshwater and River habitats worldwide. The equality of these proportions were tested on a Chi square distribution. The results showed that the proportion of Ascomycota vs. Basidiomycota in the LA River differed significantly from freshwater and river environments worldwide, based on published 18S data [
25].
Figure 5.
The observed alpha diversity of plant species is depicted in boxplots. This figure answers the question: Which site had the highest number of plant species detected overall? Note that the highest Observed Alpha Diversity tended to be in Glendale, Glendale Narrows, and Long Beach. Again, there is evidence of overdispersion, especially for the Bull Creek, Glendale, and Long Beach samples.
Figure 5.
The observed alpha diversity of plant species is depicted in boxplots. This figure answers the question: Which site had the highest number of plant species detected overall? Note that the highest Observed Alpha Diversity tended to be in Glendale, Glendale Narrows, and Long Beach. Again, there is evidence of overdispersion, especially for the Bull Creek, Glendale, and Long Beach samples.
Figure 3.
The boxplot of Observed alpha diversity shows that the species richness for Ascomycota is the highest for Arroyo Seco, Bull Creek, Compton Creek, and Maywood.
Figure 3.
The boxplot of Observed alpha diversity shows that the species richness for Ascomycota is the highest for Arroyo Seco, Bull Creek, Compton Creek, and Maywood.
The data that was used for this portion of the analysis was publicly available [
25] amplicon sequence variants tables, also known as ASVs or OTUs. OTU stands for operational taxonomic unit. Essentially, these tables have counts of sequences that were identified from organisms in the environment. The goal is to compare the proportions of different Divisions of fungi in the LA River to other environments.
The results of the Chi square test for equality of proportions shows that the proportion of Ascomycota to Basidiomycota for the LA River is not equal to the proportion of Ascomycota to Basidiomycota for Freshwater Habitats (p<0.0005) nor River Habitats (p<10^-11) described by Lepère’s analysis of worldwide freshwater data. In terms of the River Habitats, the proportion of Ascomycota to Basidiomycota is 21.5%-39.2% higher in the LA River. Furthermore, for the freshwater habitat comparison, the proportion of Ascomycota to Basidiomycota is between 7.3%-25.74% higher for the LA River, based on the 95% confidence intervals. When comparing the mosaic plots in Figure S9 and
Figure 4, the gap between the proportions of Ascomycota to Basidiomycota appears smaller for the LA River compared to Freshwater Habitats in Lepère et al’s study [
25], compared with river environments.
Figure 4.
The mosaic plot shows that there is a difference in the proportion of ascomycetes to basidiomycetes in the LA River compared to River Habitats worldwide [
25]. This gap was larger than the gap shown in Figure S9 for freshwater habitats.
Figure 4.
The mosaic plot shows that there is a difference in the proportion of ascomycetes to basidiomycetes in the LA River compared to River Habitats worldwide [
25]. This gap was larger than the gap shown in Figure S9 for freshwater habitats.
The alpha diversity analysis for ascomycetes is plotted in
Figure 3. The mosaic plot shows that the sites that had the most Ascomycota species were detected at Arroyo Seco, Bull Creek, Compton Creek, and Maywood. Maywood had much variability; two points were outliers with high counts >25, whereas most values were near zero. It is also interesting to note that more than 50 taxa of Ascomycota were identified only to the Family level, and some of these may represent heretofore uncharacterized ascomycetes. Based on these results, an interesting junction of the LA River to investigate ascomycete sequences to a higher depth, would be Arroyo Seco and Maywood, which were geographically connected.
The plot of alpha diversity for all fungi, given in
Figure 3, shows which sites had the most different types of fungi in any Division. Overall, there were 132 taxa of fungi identified. Arroyo Seco, Bull Creek, Compton Creek, Maywood, and Verdugo Wash accumulated the most taxa. An interesting aspect of this, is that out of the 132 taxa of fungi, over 30% were ascomycetes identified only to the family level.
The COI marker performed well in terms of median sequences per sample, which was 18,555. As shown in
Table 3, the Branch Length mean is about 200 units longer for NJ? and the variance is about 275,000 units higher for Neighbor Joining, with respect to the COI marker. For both tree topologies, k=3 is apparent for the number of clusters in terms of COI sequences identified. This seems to reflect that the animal diversity detected by the assay has less breadth than the biodiversity captured by 16S or FITS in this instance.
In Figure S10, the PCA for the COI DNA sequences that were recovered from the LA River sediment samples is shown. The first two principal components capture about 33% of the variation in the data. The COI assay captured a picture of lower diversity for the sequences. Samples score low on PC 2 based on relative abundance of
Dicrotendipes species, non-biting bloodworms [
62] . Also low on PC 2 were samples with high relative abundance of
Eucypris virens, a cyprididine ostracod [
63].
The PCA plot for the COI samples color coded by the best PAM clustering is shown in Figure S11. The best PAM clustering in this case was k=3, which exhibited the highest average silhouette width. For the COI sequences, 73 of the samples fall into the first cluster shown in black, ranging from Bowtie Parcel to Verdugo Wash. The second cluster, in red, is comprised of Glendale and Sepulveda sediment samples. The third cluster, shown in green, is made up of only 2 samples from Tujunga Wash and Glendale. This supports the observation that samples were similar for this marker.
The abundance of sequences per taxon was lower than the other markers assayed for 12S, at only 31,898, maximum. Furthermore, the median number of sequences per sample was 953. As shown in
Table 3, the Branch Length means differ for NJ and UPGMA. The UPGMA mean branch length is 1585 whereas the NJ branch length is about 600. The variance is higher for Neighbor Joining, for the 12S marker, consistent with the other markers. For the NJ tree topology, k=2 appears to be the number of clusters, whereas for UPGMA, k=3 is apparent for the number of clusters in terms of 12S sequences identified.
In Figure S12, the PCA for the 12S DNA sequences that were recovered from the LA River sediment samples is given. The first two principal components capture about 63% of the variation in the data. Samples appeared similar in this assay, except for the sample which is high on PC 2 from Elysian Valley that contained a high relative abundance of salmon sequences, which appeared to be an error. In that case, since the taxon is too rare among samples it could be excluded from the analysis because it might be an error, or unlikely to be relevant to many individuals in the population. Figure S13 shows the PCA plot for the 12S samples color coded by the best PAM clustering, which was k=5, with the highest average silhouette width. 79 out of 90 samples fall into the first cluster, shown in black. The second cluster is mostly made up of Sepulveda Dam sediment DNA samples. The first and third clusters were similar to one another. The fifth cluster, in light blue, is made up of a single sample from Long Beach.
The median number of assigned sequences per sample was relatively low for the plant ITS assay at 9,642, although it was not the lowest of all markers. Nevertheless, the number of sequences per taxa had a high maximum at 238,793. As shown in
Table 3, the Branch Length means were similar for NJ and UPGMA, and the variance is about 250,000 units higher for Neighbor Joining, with respect to the PITS marker. For both tree topologies, k=4 is reflected for the number of clusters in terms of plant sequences identified.
In Figure S14, it is possible to view the PCA for the plant DNA sequences that were recovered from the LA River sediment samples. The first two principal components capture about 34% of the variation in the data. One of the samples from Elysian Valley separates out on PC4 due to a high abundance of
Paspalum distictum sequences. This is a knotgrass found in most of the Southern US and Pacific Northwest, where it is native but can become weedy [
64]. It plays a role in wetland restoration since it tolerates waterlogged and saline environments, as well as providing food for deer [
64]. Samples from Arroyo Seco separate out high on PC 3 based on differential abundance of
Alnus rhombifolia sequences. Interestingly, most of the Alnus sequences came from a Tujunga Wash sample. White alders are native to streamside habitats in the US [
65]. Alders have been shown to be key to nitrogen cycling in riparian environments since they form an association with
Frankia bacteria. For that reason, they are better at colonizing disturbed habitats [
65].
Various samples separate out on PC1 due to abundance of reads that were assigned only to the phylum level. The main factor that separates samples on PC 2 is the abundance of willow species, especially in Bull Creek, Bowtie Parcel, and Arroyo Seco. Most of the Salix sequences came from two samples from Arroyo Seco.
Figure 6 shows the PCA for the plant sequences, color coded by the best PAM clustering. The best PAM clustering for the FITS markers was k=4. The model with four clusters had the highest average silhouette width. The second cluster, shown in red, is composed of Arroyo Seco and Bull Creek. The third cluster consists of sediment samples from Compton Creek, Sepulveda Dam, and Glendale adjacent sites. The fourth cluster, in blue, is made up of Arroyo Seco samples. The first cluster is made up of a mixture of all other samples, which were similar to one another, shown in black.
Figure 6.
PCA for identified plant sequences from the PITS marker by sample is presented, color coded by the best PAM clustering.
Figure 6.
PCA for identified plant sequences from the PITS marker by sample is presented, color coded by the best PAM clustering.
4. Discussion
This study has investigated the associations between microorganisms and environmental conditions including soft-bottom versus concrete, degree of urbanization, and proximity to a water treatment plant. The physical distance between samples appears to be mirrored by the genetic distance, based on the evidence from PCA with PAM clustering for the 18S markers. Matsuoka et al found similar results along a river network in Japan in 2019, where they found that fungal DNA assemblages had a spatial structure and samples which were closer to one another tended to be more similar. Overall, our results agree with numerous studies of urban, eutrophic, and brackish freshwater bodies since Proteobacteria, Bacteroidetes, Firmicutes, Cyanobacteria, Chloroflexi, Actinobacteria, Acidobacteria were all well-represented [
66,
67,
68,
69]. The elevated presence of
Verrucomicrobia and
Gammaproteobacteria aligned more with the brackish metagenome [
69]. The ostracods detected in high abundance are not known indicator species for heavy metal contamination [
70].
In Glendale Narrows, downstream from water reclamation plants, there were differential abundance of cyanobacteria and algae. According to Garcia et al, the greatest social costs associated with irrigating with reclaimed water are the costs to recreation and the risks to human health due to the potential for the presence of hazardous substances [
71]. Eutrophication can lead to hypoxic conditions; since hypoxia can be fatal to fish, this may partly explain the low 12S diversity. However, at Glendale Narrows, indicator species for both low nutrient environments and ammonia-abundance were also present. A potential explanation for this is the high abundance of plant species at Glendale Narrows, which assimilate nitrogen. Microbes with nutrient cycling capabilities such as Nitrogen reduction or Nitrogen fixation have been known to be associated with plant growth promotion, or may be associated with toxicity. Nevertheless, our results do not agree with Francis et al 2012, where plant species diversity was expected to decrease in urban environments compared to rural environments [
72].
Eukaryotic microbes in the rootzone such as
Basidiomycota and
Ascomycota may help plants with phosphorus solubilization, but may be pathogenic to plants or humans. Organisms such as these fungi which promote phosphorus mineralization have received less attention over the years [
73], although they play important roles in nutrient cycling. Fungi such as
Pleurotus have been shown to mycoremediate contamination with
E. coli [
74]. The results indicate that LA River biome is rich with Ascomycota beyond the expected proportion for freshwater bodies, including rivers.
Penicillium sp. are known to bioaccumulate arsenic and cadmium, and are thus mycoremediators of metals [
75].
Nitrogen cycling was explained by differential abundance of ammonia oxidizing archaea, the complete ammonia oxidizers
Nitrospira sp., nitrate reducing bacteria
Marmoricola sp., and nitrogen fixing bacteria
Devosia sp. were differentially abundant at soft-bottom sites (p adj < 0.002). The proposed nitrogen cycle for the soft bottom condition is shown Fgiure 7 Ammonia oxidizing archaea were represented by more species. This result partly disagrees with Cai et al’s findings [
76] since ammonia oxidizing archaea were more represented. However, some
Nitrospira bacteria are complete ammonia oxidizers, so they may be equally important. Interestingly, the results from a recent study indicated that nitrogen pollution in river sediments also contributed to bacteria community shifts [
67]. In contrast, differential abundance of several
Cyanobacteria and other anoxygenic phototrophs was associated with the concrete bottom sites, which suggested the accumulation of excess nitrogen.
Desulfomicrobium may play a part in nitrate reduction in concrete environments, but conserve more nutrition [
55], and is sulfate-dependent [
54]. Since denitrification generally requires substrate that is made under aerobic conditions [
77], it makes sense that denitrifying bacteria were not as abundant in the concrete environments. Clostridia are indicator species for fecal contamination and sewage [
78]. In regards to the reproductive pathogens, as Hervé et al noted street gutters are important in dispersal of putative pathogens from anthropogenic waste [
79] and bioremediating species.
The diversity of cyanobacterial species observed indicated health within the cyanobacteria community. As Stal noted in 2007,
Cyanobacteria have involvement in two essential biogeochemical processes on Earth, since they capture both CO
2 and N
2 [
80].
Cyanobacteria have been known to colonize hostile environments [
80] and to produce toxins that bring health risks to the public such as liver damage, eye irritation, vomiting, and death [
81]. However, only 1-2 species of algae were highly represented, which is not an indicator of health for the ecosystem. In Wang et al’s freshwater study, elevated cyanobacteria were associated with bacterioplankton, whereas algae were associated with zooplankton [
82]. Heterogeneity and diversity of algae is tied to ecosystem services [
83]. According to the Southern California Coastal Water Research Project,
Cladophora algae supports the habitat of wading shorebirds [
84]. Treating the underlying anaerobic conditions could promote algal and fish diversity.
The soft bottom sites tended to be represented by differential abundance of aerobes, whereas the concrete-associated species tended to be alkaliphilic, saliniphilic, calciphilic, sulfate dependent, and anaerobic. The presence of halophiles is a good indicator of salinity problems. Differential abundance of
Proteobacteria was associated with soft bottom sites, and there was an apparent balance in the abundance of organisms responsible for nitrogen cycling.
Figure 7.
Proposed Nitrogen Cycle for the LA River Soft bottom condition.
Figure 7.
Proposed Nitrogen Cycle for the LA River Soft bottom condition.
In recent years, the city of Los Angeles has been reluctant to move toward a soft bottom channel restoration, since it would necessitate widening of the channel, which would potentially affect landowners and other infrastructure. Furthermore, although some activists have favored riparian plantings, this also has the potential to slow the flow of water. As the River was channelized in order to decrease flooding risk and efficiently carry away water, the introduction of a vegetative buffer would likely require a widening of the river, and possibly the river’s overall footprint. As Levi et al pointed out, channel restoration benefits appear to be smaller when spread across a larger area [
85]. Therefore, this type of effort may be most impactful when applied to the urban stretches that would benefit most from the intervention.
Based on the Plant Diversity Analysis, it was indicated that Maywood had high sequence abundances of weeds such as Datura, Atriplex, Oxalis, and Chenopodium, as well as high abundance of toxic Cyanobacteria based on the factor analysis, Maywood could benefit from the planting of perennial foliage that can also remediate air pollution [
86]. According to Liu et al, air pollutants including particulate matter, nitrous oxide, and carbon monoxide also influence microbial and fungal communities [
87]. Indications tended to suggest that sonicating devices at Maywood and Glendale Narrows for the control of
Cyanobacteria should be considered
, as well as perennial vegetative buffers in Maywood to combat noxious
Datura plant species and toxic
Cyanobacteria blooms. Interestingly, Maywood samples had differentially abundant
Tetradesmus sp., including T
. obliquus, which is a phosphorus accumulator and produces valuable lipids for biodiesel [
33].
T. obliquus may also be used for animal feed; it is known to be rich in amino acids, including the essential amino acid leucine, with a low bioaccumulation of metals [
34].
A surprising result is that some sites along the LA River were more diverse with plant life than rural Arroyo Seco, especially Bowtie Parcel, Glendale, Long Beach, and Maywood, based on observed alpha-diversity. This is most likely due to landscape plantings of exotic species near Glendale, coastal species at Long beach, and a diverse panel of weed sequences that were identified at Maywood. Plants prevent erosion and create habitat for birds, mammals, invertebrates, amphibians, and reptiles. Plants also help balance nitrogen cycling and can provide a buffer by absorbing some of the nutrients involved in eutrophication.