A peer-reviewed article of this preprint also exists.
Abstract
Phylogenetic trees of coronaviruses are difficult to interpret because they undergo frequent ge-nomic recombination. Here, we propose a new method, named coloured genomic bootstrap (CGB) barcodes, to highlight the polyphyletic origins of human sarbecoviruses and understand their host and geographic ori-gins. The results indicate that SARS-CoV and SARS-CoV-2 contain genomic regions of mixed an-cestry originating from horseshoe bat (Rhinolophus) viruses. First, different regions of SARS-CoV share exclusive ancestry with five Rhinolophus viruses from Southwest China (RfYNLF/31C: 17.9%; RpF46: 3.3%; RspSC2018: 2.0%; Rpe3: 1.3%; RaLYRa11: 1.0%) and 97% of its genome can be related to bat viruses from Yunnan (China), supporting its emergence in Rhinolophus species of this province. Second, different regions of SARS-Cov-2 share exclusive ancestry with eight Rhi-nolophus viruses from Yunnan (RpYN06: 5.8%; RaTG13: 4.8%; RmYN02: 3.8%), Laos (RpBA-NAL103: 3.3%; RmarBANAL236: 1.7%; RmBANAL52: 1.0%; RmBANAL247: 0.7%), and Cam-bodia (RshSTT200: 2.3%), and 98% of its genome can be related to bat viruses from northern Laos and Yunnan, supporting its emergence in Rhinolophus species of this region. Although CGB barcodes are very useful to retrace the origins of human sarbecoviruses, further investigations are needed to better apprehend the diversity of coronaviruses in bats from Cambo-dia, Laos, Myanmar, Thailand and Vietnam.
Keywords:
Subject:
Biology and Life Sciences - Virology
Preprints on COVID-19 and SARS-CoV-2
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Alerts
Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.