1. Introduction
According to the different virus hosts, viruses can be roughly classified into plant viruses, animal viruses, and bacterial viruses.
Iflavirus, belonging to small RNA viruses, is an animal virus composed of single-stranded, sense, non-segmented RNA sequence [
1]. According to the Iflaviridae chapter of the ICTV online report, there are 16 recognized species within the genus
Iflavirus. The virus particles of this genus are roughly spherical, with a diameter of about 22-30 nm.
Iflavirus has the basic characteristics of small RNA viruses: the 5’ end of the RNA strand is connected to the viral protein genome (VPg), and the internal ribosome entry site (IRES)a is contained in the 5’ non-coding region, which mediates the initiation of RNA translation [
2,
3]. Therefore,
Iflavirus can initiate RNA translation independent of the 5’ cap structure [
4].
Iflavirus has a single open reading frame (ORF), which can be directly translated into a polyprotein, which then post-translationally hydrolyzed into several structural and nonstructural proteins (functional proteins), i.e., structural proteins and functional proteins are cleaved and separated after translation before implement their biological functions. The 3’ non-coding region of
Iflavirus contains a typical poly-A (adenine) tail [
2,
5]. According to the related reports, a variety of
Iflavirus viruses have been found, such as deformed wing virus [
6], infectious flacheric virus [
7],
Perina nuda picornalike virus [
8],
Ectropis obliqua picornalike virus [
9],
Varrora destructor virus-1 virus [
10] and
sacbrood virus(SBV) [
11].
At present, most known hosts of iflaviruses are arthropods, especially members of Hemiptera and Lepidoptera [
12,
13]. Infested hosts exhibit abnormal growth and development and have a high mortality rate [
14]. However, not all iflaviruses cause observable pathology, and many appear to be transmitted vertically. The gregariousness of insects promotes the spread of viruses in their community [
15]. Once the
Iflavirus enters the host cell, the infection process is very rapid, and the offsprings of the virus replicate in large numbers within a few hours, and when
Iflavirus infects the host, it will quickly infect the brain, intestine, and gland tissues of the host [
16]. Iflaviruses pose a significant threat to certain essential environmental insects. It has been reported that Iflavirus infection of
Spodoptera exigua (
S. exigua) can increase host's susceptibility to nucleopolyhedrovirus infection, which could increase the larval mortality of
S. exigua [
17]. Infected by Iflavirus, the weight of
S. exigua larvae is reduced and the sex ratio is unbalanced.
Helicoverpa armigera is one of the most important agricultural pests of cotton and other important economic crops in the world.
Iflavirus is mainly distributed in the fat bodies of larvae and adults of the infected
H. armigera, which is not conducive to the growth and reproduction of this notorious pest [
18]. In some cases, Iflavirus infection can be fatal to the host.
sacbrood virus (SBV) mainly exists in the fat cells of the larvae of honeybees, impairing the metabolic function of honeybees and leading the failure of bee pupation, and ultimately causing the death of the hosts [
19,
20].
The Homoptera insect,
Pyrops candelaria, is strange and novel for its long nose and brightly colorful wings.
P. candelaria is also called Longan lanternfly, mainly damaging longan, litchi, olive, orange, mulberry, mango, and pomelo crops by sucking the juice from the trunks of these fruit trees [
21,
22]. Longan lanternflies prefer warm temperatures and high moisture, so they are distributed in the fruit-bearing forests in the low-altitude regions of the tropic and subtropic area, and they are found mainly in China (including Hainan, Fujian, Guangdong, Guangxi, and Hong Kong), Thailand, Vietnam, Malaysia, Cambodia, Sikkim, India, and Indonesia [
21,
22]. The development of longan lanternflies experiences three stages: egg, nymph, and adult, and their host preference are broad in the nymph stage, but become narrow to a few plant species in the adult stage, and the invasion of longan lanternflies serious endangers the culture and harvest of pomelo and longan trees [
22]. The nymph of
P.candelaria is good at bouncing, and the adult is good at bouncing and flying [
23].
P.candelaria mainly endangers longan trees, when
P. candelaria is prevalent, the dense population of the insects causes serious damage to longan trees. Adults and nymphs suck plant juice by inserting their needle-like mouthparts into the cortex of trunks and branches. After being sucked by longan lanternflies, small black spots would gradually appear in the cortex [
24]. When the number of
P. candelaria is too large the harm caused by them might become serious, which can weaken the longan trees, stunt their growth, make branches dry, and result in fruit drop or poor fruit quality, and the excreta of this insect can even cause soot disease of longan tree [
24]. And it’s found that, in a forest mixed planting of longan, litchi, olive and mango, the population of this insect is usually especially large [
25].
Longan lanternflies have co-evolved with various fruit trees, microbial pathogens, and other environmental insects through long-term interaction, and may play an important role in the transmission of some microbial pathogens of fruit trees or from other agricultural important insects. With the advancement of sequencing technology, microbial research has also entered the era of omics, in which to screen and identify possible microbial pathogens is one of the most important parts. The study found a new virus in the course of screening microbial pathogens carried by P. candelaria through high-throughput RNA sequencing, and the virus was further characterized in order to lay a solid foundation for revealing the possible interaction of the virus and its hosts, and providing new insights to reveal the role of P. candelaria as a potential vectot for microbial pathogens.
2. Materials and Methods
2.1. Insect Sample Collection
P. candelaria samples were collected from the longan forest in Fujian Academy of Agricultural Sciences (FRIFAAS) and Minhou County as insect samples in this study.
2.2. RNA Sequencing (RNA-seq) and Data Assembly
In this study, the collected
P. candelaria samples were frozen and transported to BGI (Beijing Genomics Institute) for RNA extraction and RNA-seq. In this study, samples were collected for deep RNA-seq. After the possible contaminated DNA was degraded by DNaseI, ribosomal RNA was removed from the total RNA with the kit of Ribo-Zero
TM, and all mRNA was retained to the greatest extent. NEB Fragmentation Buffer was mixed with the greatest extent of retained mRNA to cut the RNA into 250 to 300 bp short fragments. Then, the first strand of cDNA was synthesized with random hexamers as primers, and the second strand of cDNA was further synthesized by adding buffer, dNTPs, RNase H, and DNA polymerase I at Thermomixer, and purified by AMPure XP beads. A-Tailing Mix was added to repair the end of double-stranded cDNA and add A base at the end of 3’end of it. Then, the linker mix was added to connect the linker with cDNA, the joint product was purified and recovered with magnetic beads. The library was established by amplifying recovered cDNA in a PCR mix, and the concentration of the library was quantified by Qubit2.0 Fluorometer, and the library was diluted to 1.5 ng/µL. At the same time, the quality of the library was assayed by Agilent 2100 Bioanalyzer. The qualified library is denatured into a single chain by adding NaOH, and the denatured and diluted library was added into FlowCell, hybridized with the linker in the FlowCell, completed bridge PCR amplification in cBot, The sequencing platform was illumina novaseq6000 to generate 15.48 GB 150 bp paired-end reads In quality control, the software fastqc (version 0.11.8) was chosen to check the raw clean reads from RNA-seq; and then the quality of clean reads was checked by software fastp (version 0.20.1) by which low-quality sequence reads were removed again. Following quality control, the clean reads were assembled
de novo by Trinity (version 2.15.1) with default assembly parameters and blasted in the database of virus (
https://ftp.ncbi.nlm.nih.gov/refseq/release/viral/).
2.3. Rapid Amplification of cDNA Ends
RNA was extracted from P. candelaria samples according to the methods described in Materials and Methods 2.2, and cDNA was obtained by reverse transcription. The specific primers were used to identify whether the sample carried PyCaV virus, and the cDNA carrying the virus was stored at −20°C. Race double-end specific primers and nested primers were designed according to the sequence obtained by the second RNA-seq carried out in this study. PCR amplification was performed using cDNA as a template using 5'-RACE end-specific primers and reverse transcription primers, and the PCR products were detected by agarose gel electrophoresis to ensure specific amplification. The PCR product was purified and the purified cDNA chain was used as a template for subsequent experiments. The adaptor was ligated to the 5' end of the cDNA chain. After the ligation reaction was completed, the adaptor-specific primers and internal primers were used for nested PCR amplification. The first round of PCR amplification was performed using internal primers and adapter-specific primers. The first round of PCR product was used as a template, and the second round of PCR amplification was performed using nested primers. PCR products were detected by agarose gel electrophoresis to ensure specific amplification. The PCR product was transformed into E. coli Competent Cell JM109 vector, and the positive clones were selected for sequencing to obtain the 5' and 3' end information of the cDNA sequence. The 3'RACE amplification primers of the target gene were designed, and the obtained cDNA was used as a template for 3'RACE PCR amplification. The PCR product was purified and recovered by DNA, and the 3-terminal full-length sequence was obtained by DNA sequencing.
2.4. Bioinformatics Analysis
The conserved domain of the amino acid sequence was predicted by were predicted with the NCBI Conserved Domain Database (CDD,
https://www. ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). Phylogenetic analysis was performed based on the aligned conserved RdRp domain sequence from PyCaV and other members in family Iflaviridae, two members in
Picornavirales were selected as outgroup. MAFFT (v7.310) [
26] was used to perform multiple sequence alignment with default parameters, while the best substitution model was estimated by Prottest3 (version 3.4.2) [
27]. A maximum likelihood phylogenetic tree was constructed using PhyML (version 3.2.0) with 1000 bootstrap replicates with the best estimated model [
28].
2.5. RNA Extraction
In this study, the collected
P. candelaria samples were subjected to independent RNA extraction after alcohol disinfection.The preparation of RNA was following the protocol previously used by Toni et.al. with minor modifications [
29]. The insect samples were first disinfected with sodium hypochlorite and ground into powder by liquid nitrogen. After 1 mL
Trizol reagent mixed with the powder for 30 min at 4°C, 200 μL pre-cooling dichloromethane was added, and mixed thoroughly for 2 min, then centrifuge at 4°C, 12000 r/min for 20 min. A portion of 450 μL supernatant was transferred into the new sterile RNase EP tube and mixed with another 200 μL dichloromethane, mixed repeatedly for 2 min, then centrifuge at 4°C at 12000 r/min for 10 min. Following 350 μL supernatant transferred into a new sterile RNase EP tube, the same amount of pre-cooling isopropanol was added and mixed, and kept under -20°C for 20 min, then centrifuged under 4°C at 12000 r/min for 20 min. The pellets were washed with 1 mL of pre-cooled 75% ethanol (DEPC water) twice, air dried and finally dissolved in 20 μL DEPC water. The quality of prepared RNA samples was monitored by electrophoresis and with Nanodrop 2000, and the qualified RNA samples (the ratio of OD260/OD280 reaches about 2.0) were stored at -80°C for further use.
2.6.cDNA Preparation
The cDNA was obtained by reverse transcription of total RNA using Revert Aid First Strand cDNA Synthesis Kit (Quan, Beijing). 5 μg RNA, 1 μL Oligo [
30] 18 Prime, 2 μL RNase-free water supplement were mixed, and put in a 65°C water bath for 5 min, then transferred immediately into an ice bath for 2 min. After the ice bath, 10 μL R-Mix, 1 μL E-Mix, and 1 μL gDNA-Remover were added to the system and kept at 42°C for 30 min, then heat-shocked at 85°C for 5 s to inactivate the activity of the reverse transcriptase. Finally, the prepared cDNA was stored at -20°C.
2.7. DNA Purification
The PyCaV virus fragment was amplified by PCR with specific primers, and the DNA was purified and recovered, and the recovered DNA fragment was sequenced. After electrophoresis, Gene Gel Extraction Kit (Omega, Shanghai) gel recovery kit was used for the recovery of DNA products. The amplified DNA fragment was mixed with a 10×DNA loading buffer, and analyzed by electrophoresis with a 1% agarose gel at 200 V (180 mA) in TBE buffer for 20 min. After electrophoresis, the target band was cut out and transferred into a sterile EP tube and mixed with the same volume of Binding buffer, then the gel was melted in a 65°C water bath. The melted mixture was transferred to an adsorption column from the gel recovery kit, centrifuged at 10000 r/min for 1 min, and the filtrate was discarded. Following 300 μL Binding buffer was added to the adsorption column, and centrifuged at 12000 r/min for 1 min, 700 μL SPW wash buffer was added to the adsorption column, centrifuged at 14000 r/min for 1 min. The adsorption column was centrifuged at 15000 r/min for 2 more min, followed by adding 30 μL Elution buffer to the adsorption column, standing at room temperature for 2 min, then centrifuged at 15000 r/min for 2 min, and the DNA recovery product was collected. Finally, the concentration of DNA recovery product was measured by Nanodrop 2000 and stored at -20°C for further use.
2.8. Virus Verification
In this study, the following methods were used to identify whether the P. candelaria samples carried PyCaV virus. After surface disinfected by sodium hypochlorite, the longan lanternfly samples were ground into a powder with liquid nitrogen and lysed at 4℃ for 30 min with 1 mL Trizol reagent. Followed by added with 500 µL chloroform, and standing for 3 min, then, the samples were centrifuged at 12000 rpm for 1 min. After the supernatant was mixed with the same volume of isopropanol and kept still at -20℃ for 30 min, the mixture was centrifuged at 12000 rpm for 1 min. The supernatant was discarded, and the pellet was thoroughly washed with 20% ethanol prepared with DEPC treated water. After further centrifuged at 12000 rpm for 1 min, let the pellet stand for 10 min to air dry. Finally, the prepared total RNA was dissolved in 20 µL DEPC pre-treated water, and kept at -80℃ for further use. The total RNA was reverse transcribed into cDNA with the kit of TransScript®One-Step gDNA Removal and cDNA Synthesis SuperMix (Transgen Biotech, Beijing). The cDNA was used as a template and amplified with primers designed according to the sequence information obtained from RNA-seq, and the amplification results were finally verified by agarose gel electrophoresis, and sequenced by Fuzhou Sunya Biotechnology Co., Ltd.
2.9. Statistics Analysis and Model Construction
The samples of longan lanternflies were collected and classified according to the location and time, and they were collected from two locations: Fruit Research Institute of Fujian Academy of Agricultural Science (FRIFAAS) and Minhou County of Fuzhou City. The total sample number and number of samples carrying viruses at each time were counted, and the positive rate was calculated (Number of samples carrying viruses/The total sample number). The histogram was established by importing statistical data into the software GraphPad Prism 9.0, and a T-test was selected for significance analysis (*P < 0.05). The simulated model was constructed with Origin 2019 and choosing the method of the scatter plot, and selecting related curve trends according to the distribution pattern of obtained point. If Nonlinear Curve was chosen, and the Logistic model was selected for further fitting.
4. Discussion
According to the classification of ICTV, family Iflaviridae contains only one genus
Iflavirus and several unassigned members [
1]. It has been reported that most known
Iflavirus viruses are isolated from arthropods, and the main infected host of the virus is insects.
Iflavirus infection has different effects on hosts. Some hosts show no obvious symptoms after infection, but some hosts suffer abnormal development after the infection with
Iflavirus, for example: silkworms, bees and many other invertebrates infected by corresponding
Iflavirus virus can lead to fatal effect [
1,
31,
32]. Honeybee (
Apis mellifera), one of the common hosts of
Iflavirus infection [
33], are indispensable pollinators in ecosystems. [
34]. However, in recent years, reports on the decline in honeybee pollination have been common, posing a serious threat to biodiversity and global food security [
35,
36,
37]. McMahon D.P. et al. reported that
deformed wing virus (DWV), one of the
Iflavirus virus, can infect honeybees through
Varroa and
Tropilaelaps, resulting in a dramatic decrease in the number of honeybees [
38]. When DWV infects honeybees, it can spread vertically within the bee population and increase the prevalence of the virus among honeybees [
38]. In addition,
Iflavirus can co-infect a host with multiple viruses and affect the growth and development of the host [
39]. Two
Iflavirus viruses were found in armyworm
Spodoptera exigua and named as
Spodoptera exigua iflavirus 1 (SeIV-1) [
40] and
Spodoptera exigua iflavirus 2 (SeIV-2) [
41]. Experiments showed that
S. exigua larvae infected with SeIV-1 and SeIV-2 increased the susceptibility of
S. exigua to
S. exigua multiple nucleopolyhedrovirus (SeMNPV) [
17].
S. exigua infected with SeIV-1 and SeIV-2 viruses leads to a decrease in the weight gain of its larvae, and an imbalance in its sex ratio [
42,
43]. Thereby,
Iflavirus has serious harm to some arthropods, especially insects. However, according to current reports, the species number of
Iflavirus found is relatively small [
43]. Therefore, finding potential members of
Iflavirus and finding potential hosts of
Iflavirus are of great research significance for biosafety.
Dimocarpus longan is an important subtropical fruit tree of family Sapindaceae, commonly known as longan in Asia [
44]. According to published reports, longan fruits have a variety of functions, and their secondary metabolites have antioxidant, anti-cancer and other immunomodulatory activities [
45,
46,
47]. In addition, the flowering period of longan is in spring and summer, which is an excellent nectar source plant. Pattamayutanon et al.found that honey derived from longan flowers contains more volatile organic compounds than all other flower sources [
48].Therefore, longan is an important fruit crop. However, there are many pathogens that endanger the growth of longan, resulting in a serious reduction in its yield and quality.
Trialeurodes vaporariorum, P. candelaria Deudorix dpijarbas Moore, and
Cornegenapsylla sinica are the main pests that harm the growth and development of longan [
49]. Tran et al. reported that the infection of
C. sinica post a devastating effect on the growth of longan by spreading
longan pathogen witches broom virus (LgWB) [
50]. Therefore, it is very important to explore the potential microbial pathogens carry by main longan pests and their transmission mechanism. The
P. candelaria is widely distributed in China, India, and many other countries in Southeast Asia and harms a variety of fruit trees [
22]. It is also the main pest of longan. However, there are still few reports on the mechanism by which
P. candelaria endanger longan as a microbial pathogen vehicle, herein, it is urgent to reveal the diversity of the microbial pathogens that this insect vector carries. In this study, a new virus PyCaV was identified in
P. candelaria for the first time by RNA-seq. This is the first
Iflavirus virus found in
P. candelaria, which provided a new clue for the study of the pathogenicity of
Iflavirus viral and cast new insight to the hazard of
P. candelaria as a pathogenic vector of virus. However, the potential biological effect of PyCaV carrying by
P. candelaria to longan or other insects remains to be further explored.
Through RNA-seq, based on the classification information, viral contig from 57 families and 4 unclassified groups were detected in
P. candelaria (
Figure 1A), which broadened our understanding of the virus that can be carried by
P. candelaria. Apart from PyCaV, a near full length contig showed limited sequence identified with a member in Phasmaviridae, suggesting the high possibility of another virus carried by
P. candelaria. Through RACE experiments and the construction of related phylogenetic trees, the similarity between PyCaV and
Nilaparvata lugens honeydew virus 3 (NLHV3) was 67.67%, and it was distributed in the same cluster on the phylogenetic tree (
Figure 2B). Based on the species demarcation criteria from ICTV, our results supported that PyCaV is a new member of family Iflaviridae carried by
P. candelaria. Since
Iflavirus only infects arthropods, especially insects, and as the largest group of animal, insect accounts for two-thirds of animal species [
51]. In addition, insects occupy most of the ecological niches. More than 80% of flowering plants in the world are considered to be dependent on insect pollination, and about three-quarters of crops are also dependent on insect pollination [
52,
53]. Insects also become an important part of modern terrestrial ecosystems by regulating the carbon cycle in ecosystems [
54,
55]. Therefore, it is particularly important to further reveal the relationship between
Iflavirus and insect hosts.
At present, new viruses account for almost 50% of emerging infectious diseases, which seriously threaten agricultural production, national biosafety, and even human survival [
56]. Therefore, this study further explored the distribution of PyCaV virus in the insect. It is revealed that the PyCaV virus is distributed in every part of the
P. candelaria, including the head, chest, and abdomen, which means that
P. candelaria is, at least, one of the most important carriers of the PyCaV virus (
Figure 3D, Group 1). The detection of the body section distributing of this virus reflected that more PyCaV virus accumulated in the chest of
P. candelaria (
Figure 3D, Group 1). The chest of the insect mainly including salivary glands and muscles, and when detecting the tissues inside the insect, it was found that the PyCaV virus mainly accumulated inside piercing-sucking mouthparts and viscera (mainly salivary glands) in the chest (
Figure 3D, Group 2). The accumulation of the PyCaV virus in the piercing-sucking mouthparts and chest viscera inferred that the feeding system of the insect might play a critical role in the keeping and transmission of the virus. The salivary gland is one of the main tissues in psyllid (
Diaphorina citri) that carries the DcFLV (
Diaphorina citri flavi-like virus) [
57]. Through the piercing-sucking mouthpart, the virions of PLRV (Potato leafroll virus) enter green peach aphid (
Myzus persicae) and reach the salivary glands through the circulation in the hemolymph, and the virions are released into the saliva to contaminate other fruit plants [
58]. And in the small brown planthopper (
Laodelphax striatellus), a plasma membrane-associated protein, importin α, is utilized by the RSV (rice stripe virus) to enter the insect cell, especially the cells of salivary glands [
59].
P. candelaria, psyllid, aphid, and the small brown planthopper all belong to Homoptera insects, so our results inferred that the PyCaV virus recovered in this study might follow the route of piercing-sucking mouthpart, the gut, the hemolymph, the salivary gland, and finally to be released into virus sensitive plants through saliva. But this speculation needs to be supported by further determining if it has any plant hosts.
In the study, in all 123
P. candelaria, 33 were found carrying the PyCaV virus, and the virus detection rate (VDR) is about 26%. Compared to the 6.93% detection rate of BtAstV (Bat-transmitted astroviruses), 1.79% detection rate of BtCoV (Coronaviruses), and 0.67% of BtCalV (novel caliciviruses) from bats [
60], the VDR of the PyCaV virus is relatively high, especially in the autumn and spring in FRIFAAS, and the spring in Minhou County (
Figure 4B), which means it is urgent to study its possible pathogenicity before it causes possible harm. To the different sampling regions, the VDRs varied significantly, too. The VDR is about 18% in FRIFAAS and 35% in Minhou County, which reflected the remarkable distribution difference of the PyCaV virus, which might reflect that the location or distance or orther factors which can hinder the migration and spread of the insect carrier might influence the transmission of this virus (Figures 3B and 4A,
Supplementary Table S4). In spring, the VDR is about 46%; in summer, the VDR is about 15%; in autumn, it is about 24%; and in winter, it is about 23%. The VDR showed obvious seasonal variation, in which the VDR reaches its peak in the spring, touches its lowest valley in the summer, and rises in the autumn and winter (
Figure 4B,
Supplementary Table S4), and the VDR showed obvious regional variation (
Figure 4A). Given more longan lanternflies carry the PyCaV virus in the spring, it is speculated that this season may be a high incidence period of PyCaV virus infection from this insects to other possible hosts.
In this study, the novel PyCaV virus carried by P. candelaria were found for the first time, its distribution in the pests and the changing trends of its carrying rates by P. candelaria in different time and space was revealed, which leads to an urgent need to explore whether this virus is harmful to longan tree or its pollinators. Totally, this study laid a new foundation for studying the role of P. candelaria as an insect vector in pathogen transmission, and expanded our understanding of Iflavirus virus.
Author Contributions
Conceptualization, Z.Z. and J.X.; methodology, Z.Z.,H.L. and W.S..; software, J.X., R.M., and R.L.; validation,H.L, D.M.,C.Y.; formal analysis, H.L.,C.Y.; investigation, Z.Z., H.L., J.X., L.H., Y.Y., D.D and J.X.; resources, Z.Z., S.W., J.J. and R.L.; data curation, Z.Z.,Y.Y.; writing-original draft preparation, Z.Z., J.X., and H.L.; writing-review and editing, Z.Z.,J.X. and H.L.; visualization, Z.Z. and J.X.; supervision, Z.Z., and S.W.; project administration, Z.Z. and S.W.; funding acquisition, Z.Z. All authors have read and agreed to the published version of the manuscript.