The ability to predict emerging variants of SARS-CoV-2 would be of enormous value, as it would enable proactive design of vaccines in advance of such emergence. We estimated diversity of each site on a multiple sequence alignment (MSA) of the Spike (S) proteins from close relatives of SARS-CoV-2 that infected bat and pangolin before the pandemic. Then we compared the locations of high diversity sites in this MSA and those of mutations found in multiple emerging lineages of human-infecting SARS-CoV-2. This comparison revealed a significant correspondence, which suggests that a limited number of sites in this protein are repeatedly substituted in different lineages of this group of viruses. It follows, therefore, that the sites of future emerging mutations in SARS-CoV-2 can be predicted by analyzing their relatives (outgroups) that have infected non-human hosts. We discuss a possible evolutionary basis for these substitutions and provide a list of frequently substituted sites that potentially include future emerging variants in SARS-CoV-2. In order to improve our ability to predict emerging variants of SARS-CoV-2, Katoh and Standley analysed the molecular evolution of the Spike protein. They found a significant correspondence in the location of mutations between recently emerging SARS-CoV-2 variants and their relatives that infected bat and pangolin before the pandemic, which could suggest that the sites of future mutations could be predicted by analyzing their relatives that have infected non-human hosts.
【저자키워드】 molecular evolution, Virus-host interactions, 【초록키워드】 viruses, Evolution, SARS-CoV-2, Vaccine, pandemic, Mutation, variant, SARS-CoV-2 variant, Protein, Lineage, molecular, predict, Multiple sequence alignment, hosts, Substitution, variants of SARS-CoV-2, IMPROVE, predicted, include, analysed, the Spike, 【제목키워드】 SARS-CoV-2 variant, infecting, Host, recorded,