1. Introduction
1.1. Fungal Metabarcoding
A significant hurdle in fungal ecology is that most species elude visual detection. This affects especially community ecological studies, where the aim is to include all community members. While second generation sequencing propelled fungal ecology by enabling large-scale metabarcoding, the taxonomic resolution using the dominant Illumina platform is constrained by short amplicon lengths at maximum 2×300 bp. This hampers the exploitation of the fungal barcode region, the Internal Transcribed Spacer (ITS), which ranges between 250-1500 base-pairs (bp) [1–3]. Third-generation sequencing technologies, such as Nanopore (Oxford Nanopore Technologies Inc.) and PacBio SMRT (Pacific BioSciences Inc.), have overcome read length limitations, enabling the targeting of the full-length ITS region. While Nanopore initially lagged in quality, advancements in chemistry (V14) and algorithms (transformer model-based architecture, dorado ≥v0.7) are closing this gap. Raw read accuracy now exceeds 99% correct base call probability [4]. This accuracy surpasses the interspecific distances typically observed for the ITS region [5], potentially making Nanopore a viable tool in fungal metabarcoding.
Despite these advancements, studies on the application of Nanopore sequencing in fungal metabarcoding remain limited, with conclusions varying from unsuitable to feasible for simple communities, depending on the version of the technology used [6–11]. Broadly speaking, two approaches have thus far been used for processing Nanopore metabarcoding data. The first, standard metabarcoding, approach aligns raw reads against a reference database, retaining reads that can be confidently mapped [7,9,10]. Most often some minimum Phred score is set. This approach is dependent on the comprehensiveness and accuracy of the reference database and shifts the quality issue to defining what constitutes a confidently mapped sequence. The second approach is more sophisticated and improves raw sequence quality by drafting consensus sequences from clusters [6,9,11], as implemented in tools like Decona, ONTrack2, and CONCOMPRA [12–14]. Notably, the most widely adopted approach in metabarcoding—generating operational taxonomic units (OTUs) de novo from raw data— has been considered unfeasible with Nanopore data as high error rates significantly impact clustering. This standard approach involves setting a minimum Phred score —often Q28 as implemented in FASTQC [15]— and clustering the filtered reads at a predefined similarity threshold. At the time of writing, enforcing a Q28 threshold results in a near-total reduction of usable data. Like consensus sequence generation, OTU clustering bypasses the need for a reference database, offering an advantage for taxonomic units absent from databases. As outlined above, setting a minimum Phred score is standard practice in all approaches. Therefore, establishing a robust minimal quality threshold for each approach is crucial for the reliable recovery of taxonomic units, which underpins all subsequent analyses. However, guidelines for determining an appropriate threshold remain sparse, complicating the analysis of Nanopore data and raising the question of what read quality suffices in a metabarcoding context.
In this paper, we aim to provide guidelines for quality filtering of Nanopore metabarcoding data using the standard metabarcoding reference-based and OTU-clustering approaches. We detail our experiences with analyzing Nanopore data from a mock community and share our bioinformatics pipeline eNano. Building on these insights, we apply our methods to an often-overlooked microsite—the community within decaying bark of European beech (Fagus sylvatica) and validate our results through morphotypification of EcM mantles.
1.2. Case-Study: Deadwood Bark Ectomycorrhizae
A specific aspect of habitat creation by deadwood is the case where large logs in intermediate to advanced decay stages act as a tree regeneration site (
Figure 1). In mycorrhizal research, these logs are of particular interest because the excavation of root systems can confirm the establishment of ectomycorrhizal fungi (EcM) through morphotypification.
While EcM are abundant in soil and readily mycorrhize roots after germination [16], they are initially absent or scarce in deadwood. Their prevalence increases as the wood decays and becomes more penetrable [17–21]. Eventually, the fungal community in deadwood transitions to resemble that of soil, completing a full ecological transformation [20]. In contrast to fruitbody based inventories, metabarcoding techniques have detected EcM surprisingly early in the decay process. For example, Rajala et al. (2011) found metabolically active EcM in slightly decayed Picea abies logs, suggesting a continuum of tree regeneration in old natural forests. Several reasons for colonization have been put forward, including nutrient mining [22–24], elevating fruiting positions, and associating with sapling roots in decaying logs [21,25].
with EcM is believed to be crucial for survival, especially in environments where nutrients are hard to access such as recent deadwood [26]. For example, in North-America Tsuga heterophylla regenerating on deadwood is known to associate with a wide range of EcM species [16,27,28]. In Europe, Tedersoo et al. [25,29] studied sapling EcM in mixed Estonian forests and found the most frequent species on roots in deadwood also to be common in soil, consistent with metabarcoding results.
Over the past decade, we have occasionally observed seedlings of European beech growing on relatively fresh deadwood. These seedlings primarily root in the decaying inner bark—comprising the secondary phloem and nutrient-rich cambium—rather than in the denser wood underneath (
Figure 1). This layer, often overlooked except in studies on bark beetles (e.g., [30]), offers a potentially favorable environment for EcM colonization due to its higher moisture content, higher nutrient availability, less compact nature and faster decay rate [31–34]. Such conditions could facilitate the penetration of sapling roots and EcM, either originating from soil or from propagules. However, there is limited literature available to contextualize our observations, making it an intriguing case-study for this underexplored EcM niche.
4. Discussion
4.1. Minimum Quality Evaluation
We evaluated Nanopore sequencing for fungal metabarcoding using the full-length ITS region. We defined acceptable read quality as the point where taxonomic unit recovery stabilized, indicated by consistent recovery rates of 98% for OTUs (OTU-approach) or zOTUs aggregated into SH (SH-approach). While we acknowledge that annotation accuracy might improve with even fewer sequence errors, we believe such changes would be minimal and unlikely to impact biodiversity patterns in community ecology studies.
Beta-diversity comparisons of OTU tables from Q24 and higher quality reads showed minimal differences, supporting the robustness of this threshold (
Figure S2). Furthermore, the difference in OTU numbers with and without singleton exclusion remained stable within this range, indicating consistent clustering. As illustrated in
Figure 2b, the number of recovered 98% OTUs declines rapidly when sequencing accuracy exceeds the clustering threshold (Q17 for 98% OTUs) and stabilizes at Q24-25, where errors minimally impact clustering.
The SH-approach bypasses Phred-based filtering by prioritizing read classifiability over average base call quality. Any zOTU that could be confidently mapped to an SH was deemed of sufficient quality, irrespective of its Phred score. Surprisingly, the proportion of classified reads is only partly correlated to Phred score, with strong increase at the low end of quality, up to Q16, especially in more complex datasets like our bark community (
Figure S3). For higher Phred scores, classifiability is independent on average Phred score. Therefore, beyond Q16, average Phred score does not reliably predict classifiability.
Our experimental results demonstrate that both the SH- and OTU-approach are viable and complementary due to their distinct quality filtering methods The SH-approach avoids signal loss from Phred score filtering and does not blur biological signal by pre-defining clustering thresholds, using mapping confidence as the quality filter instead. Because SH recovery remains stable regardless of Phred score, also when singleton zOTUs are included, the SH-approach eliminates the need for Phred score filtering and singleton zOTU exclusion
Our results show that raw Nanopore data, when appropriately filtered, can support a traditional metabarcoding approach without needing to generate consensus sequences or use complex protocols like Unique Molecular Identifiers [54,55] or Rolling Circle Amplification [56]. We recommend filtering at Q25 for 98% OTUs and to re-evaluate for other clustering thresholds or marker genes. While this currently still loses much data, it effectively identifies larger OTUs, likely key drivers of ecological patterns. Moreover, this threshold can be maintained with expected improvements in Nanopore data quality [57].
While the presented methods of quality filtering are effective, there remains an unoccupied space for more sophisticated techniques. Projecting the zOTU trend (
Figure 2a) suggests that Phred scores of >Q30 are necessary for zOTUs to serve as a reliable analysis unit. If such a quality threshold is reached, it opens up the possibility to use Amplicon Sequence Variants (ASVs) with DADA2-like error correction techniques, which assumes the data contains a non-trivial fraction of error-free reads [58].
4.2. Mock Community
Both approaches on the mock community yielded similar results in terms of identified taxa. The OTU-approach’ shortcomings of both low data retention and a priori clustering are obvious, with Lactifluus russulisporus filtered out due to low abundance (no presence at ≥Q25) and Russula nigrifacta and R. ustulata being merged to a single taxon (R. ustulata) as they differ less than the clustering threshold.
In contrast, the SH-approach, demonstrated a higher resolution with 46% of zOTUs being confidently mapped to SH, accounting for 75% of total reads (
Figure S3). Classification rates increased rapidly up to a Phred score of Q16, confirming that even modest-quality reads can be reliably classified.
The taxon recovery graph (
Figure 3) reveals that nearly all taxa, except for
Lactifluus russulisporus) from our mock community were effectively recovered at Phred scores up to Q27. We attribute the low recovery of
L. russulisporus to poor DNA extract quality and PCR bias [59]. Despite expecting some false positives, our analysis found no misidentifications, demonstrating the robustness of our classification thresholds.
Our study demonstrated the effectiveness of the SH-approach, rather than the OTU-approach, in accurately identifying closely related taxa. This is shown by the consistent detection of all species from the included included Lactifluus and Russula groups across various Phred scores. Notably, even at lower read qualities (as low as Q14), the closely related species R. ustulata and R. nigrifacta were reliably identified (using our reference sequence for R. ustulata). Although it is impossible to discern whether each read was correctly assigned to either R. ustulata or R. nigrifacta, we did not pick up any false positive. For example no zOTUs matched the reference sequence for the closely related R. ambusta. Similar precision was observed for other challenging taxa, such as Cortinarius alboadustus. In addition to the mock taxa, we detected a small number of contaminants (0.07% of reads in the mock), likely originating from co-extraction with the voucher material.
We recommend the routine use of mock communities as positive controls. Their use offers a low-effort strategy to evaluate both qualitative and quantitative aspects of sequencing runs. A more sophisticated mock should include taxa that are not present in the sample’s locality to ensure that index-switching and cross-contamination can be attributed to the control [1]. An interesting extension of this concept are non-biological synthetic spike-in controls, like SynMock, which mimic biological diversity without the risk of overlapping with natural taxa [60].
4.3. Inner Bark Community
Both approaches classified less than a quarter of the bark dataset reads. While the SH-approach still manages to use more than double the amount of reads of the OTU-approach, it is obvious that in complex and natural communities, many reads cannot be matched to a database confidently. Even with high-quality reads (Q28), only 36% of reads (31% of zOTUs,
Figure S3) were identifiable at the SH level.
At the phylum level, both approaches produced broadly similar outcomes (
Figure 4). However, the OTU-approach detected a notably higher proportion of Mortierellomycota, largely due to two large OTUs. One of these OTUs is annotated as
Linnemannia amoeboidea (SH0777285.10FU), with 4099 reads, although it also appears in the SH-approach, albeit in lower abundance (1586 reads). The other notable OTU is identified as
Mortierella sp., represented by 3058 reads. In contrast, the SH-approach yielded a greater recovery of Basidiomycota and Rozellomycota. The subtle nature of these differences suggest both approaches adequately capture diversity, with the main distinction being the higher number of unclassified phylum-level units in the OTU-approach, reflecting its lack of a priori filtering for database taxa. Unless stated otherwise, the species-level results discussed hereafter pertain to the SH-approach.
In our bark samples, across decay stages, we found a co-dominating mix of typical soil (Aspergillus fumigatus, Leuconeurospora, Sporothrix) and wood inhabiting fungi (Subulicystidium longisporum, Pleurothecium recurvatum, Pluteus podospileus, Helicogloea insularis, Postia tephroleuca and Basidiodendron caesiocinereum). This mix reflects our characterization of the substrate as ‘soil-like wood’. A significant number of reads were assignable to taxa capable of growing as yeasts (Saccharomycetales) and known mycoparasites (Tremella), alongside some less clearly defined taxa such as the enigmatic Rozellomycetous ‘GS05’ and a Hyaloriaceae SH. Interestingly, the latter is also present in the OTU-approach, and a manual BLASTn search suggests these sequences likely originate from Myxarium podlachicum, a common jelly fungus in the reserve. Interestingly, members of our lab have described a mycoparasite inhabiting this species, Slooffia micra [61], which, despite targeted efforts, has not been found in the sampled area. S. micra was incorporated in the latest version of UNITE [62]) as SH0867261.10FU and was recovered in relatively high abundance (213 and 64 reads in SH- and OTU-approach respectively). This surprising result hints that this species has part of its life cycle outside of its host basidiomes (fruitbodies were excluded from samples). Indeed, Slooffia spp. have been isolated as yeasts from soils, litter and insect faeces [61]. Nevertheless, at the time of writing it had not been observed in eDNA samples on UNITE. The abundant recovery of Helicogloea insularis was also striking, as this taxon is only known from two localities in Estonia and Norway where it was collected from strongly rotten elm logs [63]. If we expected to pick up any Helicogloea species at all, it was H. sebaceae, as we know it to be present in the sampling area. A false positive seems unlikely as H. sebaceae was properly retrieved in our mock community. Possible explanations include significant intraspecific ITS variation or incomplete lineage sorting where H. sebaceae might retain an ITS sequence similar to H. insularis. Evidently, it is also possible that the species is truly occurring as it concerns an understudied and often overlooked group. Additionally, several species typically associated with beech deadwood such as Simocybe centunculus, Coprinellus micaceus and eight species of Pluteus were found. Strangely, Mycena species, expected to be diverse and abundant, were conspicuously absent from our bark dataset. This is particularly striking given that Mycena galericulata and M. haematopus are among the most prevalent fungi on deadwood in the reserve [64,65]. The findings outlined in this paragraph highlight the added value of solid field knowledge when interpreting eDNA data.
We observed significant changes in species richness and composition corresponding to bark decay. Species richness (SR) and Shannon diversity metrics varied between early and late decay stages, but never with intermediate decay. This pattern suggests that early and late decay stages represent opposite ends of a decay continuum characterized by a gradual species gain and turnover. In the early decay stage, which exhibited the lowest species richness (106 ± 21 SH), we anticipated a high number of reads from pyrenomycetous endophytes, such as
Jackrogersiella cohaerens and
Eutypa spinosa. However, these were found only in relatively low abundance and showed no correlation with the bark decay stage. We hypothesize that their signals have been diluted over time, as even our samples from the earliest decay stage were collected from logs already in the second phase of decay. As the bark decays, it becomes more soil-like, as demonstrated by in
Figure 5. However, it does not fully converge to resemble the soil community. We attribute this to the presence of colonization barriers and greater environmental fluctuations, such as variations in temperature.
Bark decay stage best explains variation in beta diversity, accounting for 15.6% of the variation, while a model incorporating both bark and log decay stage explains approximately 23% of the variation. This suggests that the decay gradient is a primary determinant of community composition and aligns with patterns in coarse woody debris, where fungal communities vary significantly between decay stages due to shifting abiotic conditions attracting different specialized species [21,65,66]. We observed high community turnover, with intra-log communities exhibiting greater beta-diversity than those across different logs (
Figure 5). This suggests that our sampling was insufficient to capture bark community at the log level, and we hypothesize that bark volume is an important factor for community richness provided an adequate surface is sampled. Given that several strategies to maintain species richness in wood and soil have been proposed and demonstrated [67], we expect priority effects, along with stochastic and dispersal processes, to play crucial roles in the assembly of these communities
Both fidelity and indicator analyses underscore the utility of the OTU- and SH-approaches in pinpointing critical taxa, with all but one SH-level taxon being shared at a significance level of p≤0.01. Simultaneously, these analyses also emphasize the importance of incorporating OTUs at all taxonomic levels, as OTU-only indicators were detected at each stage of bark decay (early: Tulasnella; Mid: Hydnodontaceae, Rhinocladiella, and Helotiales; Late: Rozellomycota, Trichoderma, and Hypocreales).
The identification of decay stage-specific indicators, further underpins the successional nature of bark fungal communities. In early bark decay, only wood saprotrophs are recovered, all but Tulasnella are Sordariomycetes.
In intermediate bark decay most taxa - excl. Ganoderma adspersum and a Hydnodontaceae OTU that BLASTn matches Subulicystidium spp., are difficult to interpret. Rhinocladiella (Herpotrichiellaceae, ‘black yeasts’) and Mortierella are common in organic soils [68–71]. Serendipita species (e.g., S. vermifera) can occur free-living on wood [72], but are best known as root endophytes and mycoparasites. Chaetosphaeria decastyla is a common saprobe on decaying wood of many hosts, incl. Beech [73].
In late bark DS, most annotations are above the species level, complicating interpretation. Only SH1450888.10FU is annotated at the species level (Ilyonectria mors-panacis), and while this SH seems to be geographically widespread, its exact taxonomic annotation should be cautiously interpreted as evidenced by the ten Ilyonectria species names associated with sequences incorporated in the SH. Nevertheless, we know from surveys that many Nectria(-like) species occur in the sampling area. Likewise, the identified Trichoderma OTU (Hypocreales) BLASTn matches to T. viride, for which we have often observed both ana- and teleomorph stages on Beech deadwood. However, caution is warranted as the ITS region shows little variation in Trichoderma, complicating identification beyond the genus level [74].
Ectomycorrhizal Colonization of Beech Deadwood
In the morphotyping analysis, around half of the investigated saplings growing on logs were colonized by EcM fungi. EcM coverage, species diversity and proportion of colonized saplings were significantly lower compared to saplings growing in soil. Only two species could be identified forming associations with saplings growing on deadwood: Laccaria amethystina and Tomentella sublilacina. Tomentella is multistage, highly competitive ECM genus in mature forests [75]. T. sublilacina is a corticoid species forming fruiting bodies on wood and was also identified by Tedersoo et al. [25,76] on saplings growing on decaying logs of Betula pendula, Picea abies and Nothofagus cunninghamii. Baldrian et al. [17] metabarcoded course woody debris in a Beech dominated forest and found the species to be the second most common EcM at 0,45% of reads. Interestingly, only one sapling was colonized with both L. amethystina and T. sublilacina, which hints at competitive exclusion. Indeed T. sublilacina has been found to be capable of competitive exclusion of e.g., Tylospora fibrillosa [25] and Lactarius spp. [26].
Laccaria amethystina is considered a late successional species In Europe [77], and can form basidiomes on decaying wood of Beech [65], Norway spruce [78] and Silver fir [79]. This species produces small, short-lived genets dedicated to the production of meiospores [80]. Moreover, it prefers to associate with beech saplings within our study area [81]. Supporting this, Grebenc et al. [82] found a positive association between the occurrence of L. amethystina and the number of Beech seedlings and Hortal et al. [83] demonstrated that beech roots can be simultaneously colonized by multiple L. amethystina genets. These characteristics underscore its classification as an R-strategist, adept at spore production and dispersal, which explains its propensity to rapidly colonize new microsites within its late successional habitat.
Despite the relatively low abundance of mycorrhizal reads, several mycorrhizal SH were detected. Ectomycorrhizal (EcM) species accounted for 1.18% of the total reads, while endomycorrhizal species made up a slightly higher proportion at 1.4%, as illustrated in
Figure 6. We visually confirmed the presence of arbuscular mycorrhizal fungi by staining
Acer saplings growing on certain Beech logs (results not shown, staining method according to Vierheilig et al. [84]). Supporting our observations, Baldrian et al. [17] found EcM fungi in similarly low abundances across early and intermediate log decay stages of Beech, only becoming more abundant in late log DS. Our analysis revealed no significant correlation between the presence of logs with and without regeneration. Suggesting that, while specific mycorrhizal taxa are effectively colonizing decaying bark, their distribution is independent of sapling presence. The absence of correlation indicates that other environmental or biological factors have a more pronounced impact on shaping these mycorrhizal communities. Our metabarcoding results offered limited insights into the presence of ectomycorrhizal (EcM) fungi in deadwood. In contrast, morphotyping is per definition more detailed as it also demonstrates successful establishment. Like the results of our morphotyping analysis, EcM read abundance and diversity are higher in soil than in bark, with EcM diversity in bark being a subset of that in soil. Thus, both methods indicate that soil is the more beneficial habitat for mycorrhization.
Across our analyses, L. amethystina and T. sublilacina are identified as the key EcM in bark. In terms of EcM establishment, two critical factors can be identified: the source of the inoculum (via propagules or mycelium emerging from the soil) and the timing of arrival (before or after seedlings arrive). Mycelia in the soil may respond to the presence of deadwood by initiating growth into it. Initially, wood is resistant to penetration, but inner bark, with its softer consistency and quicker degradation, may serve as an early colonization pathway for soil-based EcM. Another possibility is that mycelium grows out of the soil regardless, as some fungi, such as Tomentella spp., are known to use wood as a substrate for fruiting. Alternatively, EcM can establish within deadwood bark through propagules, deposited either from the air or vectored by arthropods. The latter route benefits from direct deposition into the decaying substrate, a process well-supported by studies demonstrating the effectiveness of arthropods as fungal vectors [85–87].
While EcM are able to colonize deadwood, their viability in the absence of seedlings remains uncertain. Without seedlings, deadwood could act as a refuge from intense competition in the soil among well-established mycelia linked within the EcM network, especially among mature beeches. Such EcM may utilize delignified carbohydrates as a carbon source while awaiting their symbiotic partners [88,89], although such capabilities are now questioned [23]. This process, involving the formation of common mycelial networks prior to the arrival of saplings, could explain the enhanced sapling establishment sometimes observed in these specific forest microhabitats [25]. Alternatively, successful EcM establishment might require germination in the presence of roots, raising questions about whether the EcM reads detected in logs without seedlings are from spores, mycelia doomed to perish, or necromass. Our data present mixed signals. First, there is no evidence that EcM accumulation correlates with bark decay stage (DS), suggesting either that establishment is difficult after early bark DS, or more likely, that EcM do not accumulate in time. Second, the absence of a signal of Beech regeneration on mycorrhizal reads indicates that they occur independently of their partners. This dual pattern supports our belief that colonization can happen in the absence of seedlings, but forming symbiosis is crucial for the continued survival of mycorrhizae in decaying bark. Mechanistic experiments (e.g., Fukasawa and Kitabatake [90]), or sampling forests without natural regeneration would enhance our understanding.
Database
By prioritizing the classifiability of our data within the SH framework, our study demonstrates that sequences, even those with moderate average quality, can be accurately classified. In some instances, our data aligned more closely with UNITE’s reference sequences for a given SH than with our own Sanger-generated references (e.g., for Cortinarius alboadustus). User-defined references can also be included if existing SHs are inadequate, as demonstrated in our analysis of mock and bark samples. For example, we added references for mock and ectomycorrhizal taxa that could not be confidently assigned to a SH. Unfortunately, incorporating such references also hampers cross-study communication of taxa. In that regards, the nascent SH-matching tool [91] deserves further attention as it make possible the public assignment of new SH, but we did not extensively test it.
We assumed EcM in our sampling area to be well databased, and indeed many reference sequences are available for e.g., Laccaria amethystina. Nevertheless, we encountered difficulties in mapping zOTUs of Laccaria and Tomentella to SH, as well as issues with the interpretation of some taxon names, such as Illyonectria mors-panis. These problems may stem from the proliferation of sequence data and inconsistent sequence annotations, which complicate the precise definition and annotation of SH. Addressing these challenges will require manual curation, a daunting task at the scale of UNITE.
We recommend leveraging reference-based approaches, such as the SH-approach, in well-characterized communities, i.e., in regions or substrates that have been extensively sampled. A significant benefit of this approach is superior data retention and aggregation at the SH level, facilitating communication and comparisons of taxonomic units across analyses and studies and enhancing reproducibility.
Nanopore Metabarcoding
We found Nanopore to be a suitable platform for metabarcoding, although currently only a fraction of the generated data meets the quality required for broad-scale adoption. Despite these limitations, the platform offers unique advantages beyond the elimination of traditional read length constraints, making it particularly advantageous in the field of mycology, which heavily involves citizen scientists [92,93]. The low initial capital investment and the presence of an active user community contribute to its accessibility, which makes it a practical option for both amateurs and smaller research groups. Moreover, the ability to conduct in-house sequencing eliminates the need for outsourcing, thereby providing autonomy over workflows and data, empowering especially citizen scientists.
The Nanopore metabarcoding field is experiencing rapid growth, with a variety of tools and analysis strategies being developed. We built eNano to be a simple to understand tool, with straightforward integration of standard metabarcoding steps. Therefore, it is not designed to outperform emerging tools but rather to serve as a valuable baseline for comparison. The simple installation process (a single binary file) and the clarity of the procedures make eNano also suitable for educational purposes.
Considering the dynamic nature of this field, we recommend that researchers also explore other strategies like implementing Unique Molecular Identifiers [54,55] or tools such as EMU [94] and PIMENTA [95]. Noteworthy alternatives such as Decona [12] and CONCOMPRA [14] do not require a reference databases to generate taxonomic units, and have demonstrated to be viable alternatives [11,14,96]. Regardless of the approach, benchmarking tools should ideally include a comparison to a properly quality-filtered OTU table as a baseline.
Moreover, targeting longer marker regions such as the full rDNA operon shows promise for improved identification and phylogenetic placement [97,98], though it requires optimization of long-range PCR and handling increased chimera formation. In our dataset, 9.2% of reads were chimeric, consistent with other reports of elevated chimera rates compared to short-read sequencing [14,99–102]. While our Vsearch-based chimera detection is partially effective, some chimeras likely remain undetected. Even in our SH-approach we found several per chance assigned to an SH, which we flagged as chimeric in UNITE. Artificial Intelligence could be particularly useful in the detection of chimeric sequences, which, although readily identifiable by humans in multiple sequence alignments, often elude conventional bioinformatic detection methods.
As the field continues to evolve, engaging with a variety of tools will be crucial for maximizing the effectiveness and accuracy of Nanopore metabarcoding analyses.
5. Conclusions
This study evaluates the effectiveness of Nanopore sequencing for fungal metabarcoding using the full-length ITS region using both a mock and natural community. We demonstrate that both the reference-based SH-approach (mapping reads to a database) and the OTU-approach (de novo clustering) are viable options. The SH-approach ensures stable taxonomic unit recovery, and is only minimally impacted by low Phred scores (<Q16) when strict mapping criteria are applied. The OTU-approach is feasible but requires stricter quality filtering, which currently results in significant data loss. However, it is capable of identifying taxonomic units that are not represented in existing databases. We recommend using reads with a quality score of ≥ Q25 when constructing 98% OTUs.
Our analysis of fungal communities in decaying bark emphasizes the decay gradient as a primary determinant of community composition. Ectomycorrhizal fungi are found to be less diverse and abundant in bark compared to soil, indicating that soil provides a more favorable environment for mycorrhization. Despite this, Laccaria amethystina and Tomentella sublilacina are identified as key species colonizing decaying bark and capable of successful establishment. These findings suggest that decaying bark can serve as a microhabitat for specific mycorrhizal species, providing a potential explanation for the presence of mycorrhizae observed in early decay stages in other metabarcoding studies on deadwood.
We introduce the eNano tool, which offers a simple, baseline approach to Nanopore metabarcoding, suitable for broad adoption. Overall, Nanopore offers a promising platform for fungal metabarcoding, with the potential to yield valuable ecological insights and engage a wider audience of researchers and citizen scientists. As read quality continues to improve, the use of Nanopore data in a more classical metabarcoding fashion, whether through reference databases or by constructing OTUs, is likely to take hold.
Figure 1.
Schematic representation of Fagus sylvatica deadwood microsite for regeneration. Seedlings root in the soft decaying outer layer, the inner bark. Ectomycorrhizal fungi may also proliferate in this substrate, helping seedlings grow.
Figure 1.
Schematic representation of Fagus sylvatica deadwood microsite for regeneration. Seedlings root in the soft decaying outer layer, the inner bark. Ectomycorrhizal fungi may also proliferate in this substrate, helping seedlings grow.
Figure 2.
Taxonomic unit recovery using all reads and excluding singleton units (zOTUS, 98% OTUs or SHs), Phred-specific datasets subsampled to amount of reads at Q28. Each datapoint represent 4245 reads for the mock dataset and 42560 for the bark dataset. Lines represent a loess smoother to visualize trends. (a) intermediate zOTU step in SH-approach; (b) OTU-approach (c) SH-approach (d) 98% OTUs merged into SH was excluded a priori (gray-shaded), shown for completeness. Graphs standardized by maximum recovered units per analysis per dataset. Nmax-zOTU-bark = 39382; Nmax-zOTU-mock=3818; Nmax-98%OTU-bark=39348; Nmax-98%OTU-bark=3722; Nmax-SH-bark=546; Nmax-SH-mock=20.
Figure 2.
Taxonomic unit recovery using all reads and excluding singleton units (zOTUS, 98% OTUs or SHs), Phred-specific datasets subsampled to amount of reads at Q28. Each datapoint represent 4245 reads for the mock dataset and 42560 for the bark dataset. Lines represent a loess smoother to visualize trends. (a) intermediate zOTU step in SH-approach; (b) OTU-approach (c) SH-approach (d) 98% OTUs merged into SH was excluded a priori (gray-shaded), shown for completeness. Graphs standardized by maximum recovered units per analysis per dataset. Nmax-zOTU-bark = 39382; Nmax-zOTU-mock=3818; Nmax-98%OTU-bark=39348; Nmax-98%OTU-bark=3722; Nmax-SH-bark=546; Nmax-SH-mock=20.
Figure 3.
Mock community recovered at different Phred scores using SH-approach. *Reference sequences added to database.
Figure 3.
Mock community recovered at different Phred scores using SH-approach. *Reference sequences added to database.
Figure 4.
Taxonomic composition across approaches. Right: Composition per Bark decay stage for the SH-approach.
Figure 4.
Taxonomic composition across approaches. Right: Composition per Bark decay stage for the SH-approach.
Figure 5.
Ordination of both approaches on Aitchison distances. Samples originating from the same logs are connected by the same line types. (a) NMDS of SH-approach; (b) NMDS of OTU-approach; (c) PCoA of SH-approach; (d) PCoA of OTU-approach.
Figure 5.
Ordination of both approaches on Aitchison distances. Samples originating from the same logs are connected by the same line types. (a) NMDS of SH-approach; (b) NMDS of OTU-approach; (c) PCoA of SH-approach; (d) PCoA of OTU-approach.
Figure 6.
Relative read abundance of EcM species in bark according to presence of regeneration. Bars indicate samples from the same log. Circles indicate logs where EcM where found by morphotyping. Large asterisks under bars indicate samples not used in morphotyping. Sample names from left to right: ZF031, ZF302, ZF311, ZF319_1, ZF319_2, ZF319_3, ZF320, ZF321_1, ZF321_2, ZF321_3, ZF322, ZF323, ZF324, ZF325, ZF326, ZF327, ZF’681’, ZF317, ZF328, ZF401. *dummy SH codes.
Figure 6.
Relative read abundance of EcM species in bark according to presence of regeneration. Bars indicate samples from the same log. Circles indicate logs where EcM where found by morphotyping. Large asterisks under bars indicate samples not used in morphotyping. Sample names from left to right: ZF031, ZF302, ZF311, ZF319_1, ZF319_2, ZF319_3, ZF320, ZF321_1, ZF321_2, ZF321_3, ZF322, ZF323, ZF324, ZF325, ZF326, ZF327, ZF’681’, ZF317, ZF328, ZF401. *dummy SH codes.
Table 1.
Decay stages of inner bark layer in European beech (Fagus sylvatica) deadwood. Note that Bark decay stage (DS) may differ within the same log.
Table 1.
Decay stages of inner bark layer in European beech (Fagus sylvatica) deadwood. Note that Bark decay stage (DS) may differ within the same log.
Bark decay stage |
Description |
Early |
Layer (incl. inner bark) difficult to penetrate with a metal spoon (force needed), almost all fibers intact, dark yellow in color. Corresponds to log decay stage (DS) 1 or early 2. |
Intermediate |
Layer crumbles and can be scooped up with a metal spoon. Crumbles with minimal force between fingers, some fibers intact, light brown in color. Corresponds to a late log DS 2 or early 3. |
Late |
Layer is almost fully decayed, penetrable without force, it easily disintegrates between fingertips – resembles soil, no fibers intact, (dark) brown in color. Corresponds to a late log DS 3 or 4. |
Table 2.
High-fidelity taxa for the OTU- and SH-approaches that met a significance threshold of p<0.01. Fidelity scores are those from SH-approach if taxon is present in both approaches.
Table 2.
High-fidelity taxa for the OTU- and SH-approaches that met a significance threshold of p<0.01. Fidelity scores are those from SH-approach if taxon is present in both approaches.
DS Bark |
lowest taxon name |
fidelity |
p |
SH | OTU |
Early |
Chaetosphaeriaceae (SH0980871.10FU) |
0.798 |
0,001 |
● |
● |
Tulasnella (OTU 1097) |
0.728 |
0,003 |
|
● |
Sordariales (SH0840221.10FU) |
0.779 |
0,008 |
● |
● |
Pleurothecium recurvatum (SH0926211.10FU)* |
0.950 |
0.007 |
● |
● |
Mid |
Ganoderma adspersum (SH0762773.10FU) |
0.839 |
0,001 |
● |
|
Hydnodontaceae (OTU 448) |
0.729 |
0,004 |
|
● |
Herpotrichiellaceae (SH0970954.10FU) |
0.751 |
0,005 |
● |
● |
Rhinocladiella (OTU 75 | SH0970950.10FU) |
0.697 |
0,005 |
|
● |
Helotiales (OTU 474) |
0.729 |
0,006 |
|
● |
Mortierella (SH0960682.10FU) |
0.745 |
0,008 |
● |
● |
Serendipita (SH0743656.10FU) |
0.730 |
0,008 |
● |
● |
Chaetosphaeria decastyla (SH0980872.10FU) |
0.655 |
0,010 |
● |
|
Late |
Hyaloscyphaceae (SH0973189.10FU) |
0.707 |
0,006 |
● |
● |
Rozellomycota (SH0910702.10FU) |
0.707 |
0,006 |
● |
|
Ilyonectria mors-panacis (SH1450888.10FU) |
0.707 |
0,007 |
● |
|
Rozellomycota (OTU 1883) |
0.711 |
0,007 |
|
● |
Pezizomycotina (SH0755218.10FU) |
0.808 |
0,008 |
● |
● |
Trichoderma (OTU 4045) |
0.707 |
0,008 |
|
● |
Hypocreales (OTU 2917) |
0.707 |
0,009 |
|
● |
Leotiomycetes (SH0948543.10FU) |
0.707 |
0,010 |
● |
● |