Background Since early February 2021, the causative agent of COVID-19, SARS-CoV-2, has infected over 104 million people with more than 2 million deaths according to official reports. The key to understanding the biology and virus-host interactions of SARS-CoV-2 requires the knowledge of mutation and evolution of this virus at both inter- and intra-host levels. However, despite quite a few polymorphic sites identified among SARS-CoV-2 populations, intra-host variant spectra and their evolutionary dynamics remain mostly unknown. Methods Using high-throughput sequencing of metatranscriptomic and hybrid captured libraries, we characterized consensus genomes and intra-host single nucleotide variations (iSNVs) of serial samples collected from eight patients with COVID-19. The distribution of iSNVs along the SARS-CoV-2 genome was analyzed and co-occurring iSNVs among COVID-19 patients were identified. We also compared the evolutionary dynamics of SARS-CoV-2 population in the respiratory tract (RT) and gastrointestinal tract (GIT). Results The 32 consensus genomes revealed the co-existence of different genotypes within the same patient. We further identified 40 intra-host single nucleotide variants (iSNVs). Most (30/40) iSNVs presented in a single patient, while ten iSNVs were found in at least two patients or identical to consensus variants. Comparing allele frequencies of the iSNVs revealed a clear genetic differentiation between intra-host populations from the respiratory tract (RT) and gastrointestinal tract (GIT), mostly driven by bottleneck events during intra-host migrations. Compared to RT populations, the GIT populations showed a better maintenance and rapid development of viral genetic diversity following the suspected intra-host bottlenecks. Conclusions Our findings here illustrate the intra-host bottlenecks and evolutionary dynamics of SARS-CoV-2 in different anatomic sites and may provide new insights to understand the virus-host interactions of coronaviruses and other RNA viruses. Supplementary Information The online version contains supplementary material available at 10.1186/s13073-021-00847-5.
【저자키워드】 COVID-19, SARS-CoV-2, Intra-host, Variation, Dynamics, 【초록키워드】 Evolution, Mutation, knowledge, Sequencing, Genetic, iSNV, allele frequency, virus, variants, Population, Viral, SARS-CoV-2 genome, RNA viruses, Virus-host interactions, Patient, Genotype, death, High-throughput sequencing, Virus-host interaction, evolutionary dynamics, genetic diversity, respiratory tract, respiratory, distribution, iSNVs, gastrointestinal tract, Single nucleotide variant, nucleotide, Intra-host variant, COVID-19 patient, causative agent, Consensus, supplementary material, bottleneck events, bottlenecks, single nucleotide variation, MOST, populations, event, consensus genome, Result, analyzed, collected, eight, characterized, coronavirus, driven by, anatomic site, bottleneck, metatranscriptomic, patients with COVID-19, polymorphic site, the SARS-CoV-2 genome, 【제목키워드】 Population, COVID-19 patient,