Abstract
A high-quality dataset of 3289 complete SARS-CoV-2 genomes collected in Europe and European Economic Area (EAA) in the early phase of the first wave of the pandemic was analyzed. Among all single nucleotide mutations, 41 had a frequency ≥ 1%, and the phylogenetic analysis showed at least 6 clusters with a specific mutational profile. These clusters were differentially distributed in the EU/EEA, showing a statistically significant association with the geographic origin. The analysis highlighted that the mutations C 14408 T and C 14805 T played an important role in clusters selection and further virus spread. Moreover, the molecular analysis suggests that the SARS-CoV-2 strain responsible for the first Italian confirmed COVID-19 case was already circulating outside the country.
Keywords: COVID-19; Cluster analysis; SARS-CoV-2; SNVs analysis.
【저자키워드】 COVID-19, SARS-CoV-2, cluster analysis, SNVs analysis., 【초록키워드】 Europe, pandemic, Mutation, mutations, Phylogenetic analysis, SARS-CoV-2 genome, Cluster, dataset, cluster analysis, First wave, SNV, single nucleotide, molecular analysis, association, nucleotide, Frequency, Analysis, SARS-CoV-2 genomes, virus spread, early phase, circulating, SARS-CoV-2 strain, confirmed COVID-19 case, Complete, European, country, Italian, responsible, analyzed, collected, statistically significant, Area, the SARS-CoV-2, 【제목키워드】 pandemic, virus, Cluster, identification, element,