Background The coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome-related coronavirus-2 (SARS-CoV-2) is pandemic. However, the origins and global transmission pattern of SARS-CoV-2 remain largely unknown. We aimed to characterize the origination and transmission of SARS-CoV-2 based on evolutionary dynamics. Methods Using the full-length sequences of SARS-CoV-2 with intact geographic, demographic, and temporal information worldwide from the GISAID database during 26 December 2019 and 30 November 2020, we constructed the transmission tree to depict the evolutionary process by the R package “outbreaker”. The affinity of the mutated receptor-binding region of the spike protein to angiotensin-converting enzyme 2 (ACE2) was predicted using mCSM-PPI2 software. Viral infectivity and antigenicity were tested in ACE2-transfected HEK293T cells by pseudovirus transfection and neutralizing antibody test. Results From 26 December 2019 to 8 March 2020, early stage of the COVID-19 pandemic, SARS-CoV-2 strains identified worldwide were mainly composed of three clusters: the Europe-based cluster including two USA-based sub-clusters; the Asia-based cluster including isolates in China, Japan, the USA, Singapore, Australia, Malaysia, and Italy; and the USA-based cluster. The SARS-CoV-2 strains identified in the USA formed four independent clades while those identified in China formed one clade. After 8 March 2020, the clusters of SARS-CoV-2 strains tended to be independent and became “pure” in each of the major countries. Twenty-two of 60 mutations in the receptor-binding domain of the spike protein were predicted to increase the binding affinity of SARS-CoV-2 to ACE2. Of all predicted mutants, the number of E484K was the largest one with 86 585 sequences, followed by S477N with 55 442 sequences worldwide. In more than ten countries, the frequencies of the isolates with E484K and S477N increased significantly. V367F and N354D mutations increased the infectivity of SARS-CoV-2 pseudoviruses ( P < 0.001). SARS-CoV-2 with V367F was more sensitive to the S1-targeting neutralizing antibody than the wild-type counterpart ( P < 0.001). Conclusions SARS-CoV-2 strains might have originated in several countries simultaneously under certain evolutionary pressure. Travel restrictions might cause location-specific SARS-CoV-2 clustering. The SARS-CoV-2 evolution appears to facilitate its transmission via altering the affinity to ACE2 or immune evasion. Graphic Abstract Supplementary Information The online version contains supplementary material available at 10.1186/s40249-021-00895-4.
【저자키워드】 COVID-19, SARS-CoV-2, Transmission, evolutionary dynamics, 【초록키워드】 coronavirus disease, neutralizing antibody, Coronavirus disease 2019, ACE2, pandemic, Mutation, COVID-19 pandemic, Italy, angiotensin-converting enzyme 2, Malaysia, binding affinity, Spike protein, China, Severe acute respiratory syndrome, immune evasion, Viral, Receptor-binding domain, viral infectivity, pseudovirus, Clustering, Cluster, Japan, clade, evolutionary dynamics, SARS-CoV-2 evolution, E484K, mutants, Severe acute respiratory syndrome-related coronavirus, SARS-CoV-2 pseudovirus, antigenicity, Antibody test, Singapore, respiratory, USA, information, early stage, affinity, Angiotensin-converting enzyme, Coronavirus-2, Frequency, GISAID database, angiotensin, isolates, Severe acute respiratory syndrome-related coronavirus-2, followed by, pseudoviruses, acute respiratory syndrome, supplementary material, enzyme, sequence, SARS-CoV-2 strains, SARS-CoV-2 strain, S477N, full-length, transmission of SARS-CoV-2, HEK293T cells, R package, SARS-CoV-2 pseudoviruses, wild-type counterpart, isolate, country, independent, V367F, Result, tested, predicted, caused, significantly, composed, facilitate, appear, the spike protein, the receptor-binding domain, mutated, the binding affinity, HEK293T cell, 【제목키워드】 characterization, viral mutation, the spike protein,