Abstract
The surface glycoprotein (S protein) of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was used to develop coronavirus disease 2019 (COVID-19) vaccines. However, SARS-CoV-2, especially the S protein, has undergone rapid evolution and mutation, which has remained to be determined. Here, we analyzed and compared the early (12 237) and the current (more than 10 million) SARS-CoV-2 strains to identify the mutation features and geographical distribution of the S gene and S protein. Results showed that in the early strains, most of the loci were with relative low mutation frequency except S: 23403 (4486 strains), while in the current strains, there was a surge in the mutation strains and frequency, with S: 23403 constantly being the highest one, but tremendously increased to approximately 1050 times. Furthermore, D614 (S: 23403) was one of the most highly frequent mutations in the S protein of Omicron as of March 2022, and most of the mutant strains were still from the United States, and the United Kingdom. Further analysis demonstrated that in the receptor-binding domain, most of the loci with low mutation frequency in the early strains, while S: 22995 was nowadays the most prevalent loci with 3 122 491 strains in the current strains. Overall, we compare the mutation features of the S region in SARS-CoV-2 strains between the early and the current stains, providing insight into further studies in concert with emerging SARS-CoV-2 variants for COVID-19 vaccines.
Keywords: ACE2-binding region; COVID-19; SARS-CoV-2; genome mutations; surface glycoprotein.
【저자키워드】 COVID-19, SARS-CoV-2, Surface glycoprotein., genome mutations, ACE2-binding region, 【초록키워드】 coronavirus disease, Evolution, coronavirus, Mutation, Vaccines, S protein, Genome, SARS-CoV-2 variant, omicron, Surface glycoprotein, COVID-19 vaccines, United Kingdom, Strains, S gene, Frequency, Analysis, mutant strain, strain, geographical distribution, acute respiratory syndrome, loci, SARS-CoV-2 strain, The United States, feature, prevalent, Result, highest, analyzed, identify, was used, develop, remained, demonstrated, the receptor-binding domain, the S protein, 【제목키워드】 Mutation, Surface glycoprotein, S gene, Analysis, strain, geographical distribution, feature,