1. Introduction
Human norovirus (HuNoV) is a major causative agent of acute gastroenteritis in humans of all ages [
1,
2]. Previous epidemiological data suggest that HuNoV may be associated with 30-60% of patients with gastroenteritis [
3,
4,
5]. Moreover, this agent has caused large outbreaks of food poisoning worldwide [
6,
7]. However, effective vaccines and antiviral agents are not available at present [
7]. Therefore, this agent may be a public health concern [
8].
The HuNoV genome is a single-stranded plus-sense RNA with approximately 7.5 kb of nucleotide sequence [
9]. The genome contains three open reading frames (ORFs): ORF1, ORF2, and ORF3 [
9]. ORF1 encodes six nonstructural proteins designated as nonstructural proteins (NS) 1/2–7 [
9]. Of these, the NS7 region encodes the RNA-dependent RNA polymerase (RdRp) protein, while ORF2 and ORF3 encode structural proteins, such as viral protein (VP) 1 and VP2, respectively [
7,
9]. The VP1 protein acts as an antigen and also shows large antigenic variations [
6], although it is not exactly known.
Previous genetic and molecular epidemiological studies have suggested that the HuNoV genome shows large genetic divergence [
8]. Currently, HuNoV is classified into two genogroups, genogroup I (GI) and genogroup II (GII) [
10]. Furthermore, the GI and GII HuNoVs are classified into many genotypes. Such genotypes are based on the
RdRp coding region (
RdRp region) and
VP1 gene sequences [
10]. To date, 60
RdRp (P-types) and 49
VP1 genotypes have been confirmed [
10]. Moreover, recombination occurs relatively frequently between ORF1 and ORF2 [
11,
12], resulting in many chimeric viruses acting as recombinants [
12]. Based on this evidence, HuNoV can represent both the
RdRp region genotype and
VP1 genotypes, such as GII.P6 (
RdRp genotype)-GII.6 (
VP1 genotype) [
13]. However, the role of these chimeric viruses remains unclear.
Molecular epidemiological data on HuNoV infections in humans suggest that certain GI and GII genotypes are prevalent [
14]. These reports also show that GII HuNoV is more dominant than GI HuNoV [
14]. Of these, some GII genotypes corresponding to
VP1 genotypes, such as GII.2, GII.3, GII.4, GII.6, and GII.17, are prevalent types [
15]. However, these epidemiological data may not explain the reasons for the HuNoV epidemics.
Recently, authentic bioinformatic technologies have been used in population genetics, including the study of the evolution of various viruses [
16]. Indeed, these methods may allow us to estimate the phylogeny, genome population, and antigenicity using three-dimensional antigen structures. Information that reflects viral evolution may contribute to a better response to these questions. To date, we have studied the molecular evolution of chimeric HuNoVs, such as GII.P17-GII.17, GII.P2-GII.2, and GII.P16-GII.2 [
17,
18,
19]. However, such studies have not been performed on other GII genotypes to better understand GII HuNoV. Therefore, in this study, we performed a comprehensive molecular analysis of globally collected HuNoV GII.P6-GII.6 and GII.P7-GII.6 strains.
4. Discussion
To better understand the evolution of HuNoV GII.6 strains with different
RdRp types (P6 and P7), we analyzed both the
RdRp region and
VP1 gene using various authentic bioinformatics technologies. First, a time-scaled phylogenetic tree showed that a common ancestor of the P6- and P7-type
RdRp region emerged approximately 50 years ago and uniquely evolved and formed clusters. A common ancestor of P6- and P7-type GII.6
VP1 gene emerged approximately 110 years ago and formed clusters. The dominant type for both the
RdRp region and
VP1 gene was P7-type (
Figure 1A,B). Secondly, the evolutionary rates of both the P6-type
RdRp region and
VP1 gene were faster than those of the P7-type
RdRp region and
VP1 gene (
Table 1). Next, the phylogenetic distances of the P7-type
RdRp region and
VP1 gene were wider than those of the P6-type
RdRp region and
VP1 gene. Furthermore, phylodynamic data showed that the
RdRp region and
VP1 gene population sizes fluctuated after 2000 (
Figure 3A–3F). Some positive selection sites in the VP1 proteins were estimated, and these were located in the antigenicity-related P2 domain. Among these, the positive selection sites in the P7-type VP1 protein overlapped with the conformational epitopes (
Figure 5A–D and
Table S4). These data imply that the GII.6
VP1 gene and VP1 protein uniquely evolved because of recombination between the P6- and P7-type
RdRp regions in the HuNoV GII.P6-GII.6 and GII.P7-GII.6 genomes.
A previous report regarding the evolutionary analyses of the
RdRp region of various HuNoV genotypes showed that the P6- and P7-type
RdRp region diverged from a common ancestor of other
RdRp genotypes, including P18, P15, and P20 [
35]. This report also estimated that the divergence year of the P6- and P7-types of the
RdRp region was in the 1960s [
35]. This may be compatible with the present data (December 1966). Moreover, the topologies of the previous time-scaled evolutionary tree and our tree were similar [
35]. Although this and other reports did not show the evolutionary rates of each
RdRp genotype, the evolutionary rates of various
RdRp genotypes were estimated as 2.52 × 10
−3 s/s/y to 3.12 × 10
−3 s/s/y. The present data are also compatible with the data from a previous report [
35]. These results suggested that the P6- and P7-type
RdRp regions are genetically related. Next, the HuNoV
RdRp region/RdRp protein may have affected the evolution of the
VP1 gene/VP1 protein [
18,
19]. As shown in
Figure 1B, the phylogeny of the
VP1 gene in GII.P6-GII.6 and GII.P7-GII.6 was clearly divided and evolved uniquely. Previous reports have also suggested that recombination between the HuNoV genome ORF1, incorporating the
RdRp region and ORF2, incorporating
VP1 gene, affects
VP1 gene/VP1 protein evolution [
18,
19]. For example, during the 2016/17 season, recombination between different lineages of the P16-type
RdRp region in the GII.P16-GII.2 strains occurred and the recombinant caused large outbreaks of acute gastroenteritis in various countries [
36,
37,
38]. Moreover, the GII.4 genotype caused a gastroenteritis pandemic between 2006 and 2012 [
39,
40]. Outbreaks may also be associated with recombination between ORF1 and ORF2 in GII.4 strains [
41]. Based on previous and the present results, the prevalence of GII.P7-GII.2 strains was due to the recombination of P6- and P7-type
RdRp regions. Moreover, the evolutionary rates of the
VP1 gene combined with P6- and P7-type
RdRp regions were estimated as 5.063 × 10
−3 s/s/y and 3.022 × 10
−3 s/s/y, respectively. Previous data estimated the mean rates of various GII.2 genotype strains (GII.1 to GII.22) as 3.21 × 10
−3 to 4.30 × 10
−3 s/s/y [
42]. Thus, these values and the present data may be similar [
43]. Taken together, these findings provide information on the evolutionary history of these viral strains and suggest that recombination events may have played a pivotal role in their evolution.
Next, we estimated the genetic divergence of the P6- and P7-type RdRp regions and P6- and P7-type VP1 genes in the present strains. First, a larger divergence of P7-type RdRp regions and P7-type VP1 genes was estimated compared to that of the P6-type VP1 gene. In the present study, the number of P6-type strains was relatively small (15 strains), although statistical analyses were performed. Further studies regarding this may be needed after a greater number of strains with the same genotype are registered.
We also analyzed the phylodynamics of the RdRp region and VP1 gene. The results showed that the genome population size of GII.P6-GII.6 increased around 2000–2003, while the genome population size of GII.P7-GII.6 increased after 2005. A previous molecular epidemiological study suggested that GII.6 had a biphasic prevalence between 2000 and 2005 and 2007 and 2010. Thus, the present phylodynamic data may reflect the prevalence of GII.P6-GII.6 and GII.P7-GII.6.
Moreover, to evaluate the functional and evolutionary characteristics of the P6- and P7-type RdRp proteins, we constructed 3D dimeric RdRp proteins and mapped them with amino acid substitutions (
Figure 4). Several amino acid substitutions were also identified. Previous reports have suggested that some amino acid substitutions are associated with replication efficacy [
8,
41]. For example, the efficacy of HuNoV genome replication is increased by amino acid substitutions (291Thr or 291Val) in various RdRp proteins [
43]. However, no substitution in the active sites were found in the P6- and P7-type RdRp proteins.
We also constructed P6- and P7-type 3D trimeric VP1 proteins (
Figure 5). Previous reports have shown that the P2 domain may act not only as a host cell-binding site, but also as a major part of the HuNoV antigen [
44,
45]. Therefore, amino acid substitutions in this domain may be associated with infectivity and antigenicity [
44,
45]. Moreover, positively selected sites may function as escape mutations in the host [
46]. In the present study, some conformational epitopes were identified in both RdRp-type VP1 proteins. Some of these were located in the P2 domain. Positively selected sites were also identified. Moreover, the amino acid positions of the conformational epitopes and positive selections between the P6- and P7-type VP1 proteins were distinct. These results implied that the antigenicity of P6- and P7-type VP1 proteins is distinct, although we did not examine this
in vitro, and both VP1 proteins may receive selective pressure from host defense systems (i.e., host immunity) [
45,
47].
Finally, we evaluated negative selection sites for the RdRp and VP1 proteins (
Table 3 and
Table S3). Many negative selection sites in P7-type RdRp (205 sites) and VP1 proteins (274 sites) were estimated, while P6-type RdRp and VP1 proteins were small. In general, negative selection sites play a role in preventing the deterioration of viral protein function [
46]. Thus, the present negative selection data may indicate the maintenance of RdRp and VP1 protein function. Furthermore, a small number of negative selection sites in P6-type RdRp and VP1 proteins were estimated. This may be because of the relatively small number of strains used in this study (15 strains).
Author Contributions
Conceptualization, H.K., T.T. and K.K.; methodology, T.T., R.K., T.S. (Tatsuya Shirai), M.S., T.S. (Toshiyuki Sugai), K.H., F.M., K.O., Y.H. and M.K.; writing—original draft preparation, T.K., Y.S., K.K. and H.K.; review and editing, K.M., K.I., H.I., A.R., K.F., K.K. and H.K.; visualization, T.T., R.K. and Y.M.; funding acquisition, H.K. All authors have read and agreed to the published version of the manuscript.