Spatiotemporal bias in genome sampling can severely confound discrete trait phylogeographic inference. This has impeded our ability to accurately track the spread of SARS-CoV-2, the virus responsible for the COVID-19 pandemic, despite the availability of unprecedented numbers of SARS-CoV-2 genomes. Here, we present an approach to integrate individual travel history data in Bayesian phylogeographic inference and apply it to the early spread of SARS-CoV-2. We demonstrate that including travel history data yields i) more realistic hypotheses of virus spread and ii) higher posterior predictive accuracy compared to including only sampling location. We further explore methods to ameliorate the impact of sampling bias by augmenting the phylogeographic analysis with lineages from undersampled locations. Our reconstructions reinforce specific transmission hypotheses suggested by the inclusion of travel history data, but also suggest alternative routes of virus migration that are plausible within the epidemiological context but are not apparent with current sampling efforts. Spatiotemporal sampling gaps in existing pathogen genomic data limits their use in understanding epidemiological patterns. Here, the authors apply a phylogeographic approach with SARS-CoV-2 genomes to accurately reproduce pathogen spread by accounting for spatial biases and travel history of the individual.
【저자키워드】 SARS-CoV-2, phylogenetics, Ecological epidemiology, Phylogenomics, 【초록키워드】 Bayesian, COVID-19 pandemic, Genome, Transmission, virus, Spread, Migration, pathogen, Accuracy, SARS-CoV-2 genome, Travel, Lineage, epidemiological, Predictive, SARS-CoV-2 genomes, virus spread, genomic data, phylogeographic analysis, Inclusion, approach, limit, alternative route, responsible, suggested, hypothese, 【제목키워드】 Bayesian, Travel,