Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has a complex strategy for the transcription of viral subgenomic mRNAs (sgmRNAs), which are targets for nucleic acid diagnostics. Each of these sgmRNAs has a unique 5′ sequence, the leader-transcriptional regulatory sequence gene junction (leader-TRS junction), that can be identified using sequencing. High-resolution sequencing has been used to investigate the biology of SARS-CoV-2 and the host response in cell culture and animal models and from clinical samples. LeTRS, a bioinformatics tool, was developed to identify leader-TRS junctions and can be used as a proxy to quantify sgmRNAs for understanding virus biology. LeTRS is readily adaptable for other coronaviruses such as Middle East respiratory syndrome coronavirus or a future newly discovered coronavirus. LeTRS was tested on published data sets and novel clinical samples from patients and longitudinal samples from animal models with coronavirus disease 2019. LeTRS identified known leader-TRS junctions and identified putative novel sgmRNAs that were common across different mammalian species. This may be indicative of an evolutionary mechanism where plasticity in transcription generates novel open reading frames, which can then subject to selection pressure. The data indicated multiphasic abundance of sgmRNAs in two different animal models. This recapitulates the relative sgmRNA abundance observed in cells at early points in infection but not at late points. This pattern is reflected in some human nasopharyngeal samples and therefore has implications for transmission models and nucleic acid-based diagnostics. LeTRS provides a quantitative measure of sgmRNA abundance from sequencing data. This can be used to assess the biology of SARS-CoV-2 (or other coronaviruses) in clinical and nonclinical samples, especially to evaluate different variants and medical countermeasures that may influence viral RNA synthesis.
Keywords: COVID-19; RNA modification; SARS-CoV-2; coronavirus; direct RNA sequencing; nanopore; sgmRNA; subgenomic mRNA; transcriptional regulatory sequences.
【저자키워드】 COVID-19, RNA modification, SARS-CoV-2, coronavirus, Nanopore, Direct RNA sequencing, sgmRNA, subgenomic mRNA, transcriptional regulatory sequences., 【초록키워드】 coronavirus disease, severe acute respiratory syndrome coronavirus 2, RNA modification, Coronavirus disease 2019, Nanopore, Sequencing, Direct RNA sequencing, Transcription, bioinformatics, variant, Infection, animal model, host response, animal models, severe acute respiratory syndrome Coronavirus, virus, diagnostics, RNA, Regulatory, clinical samples, nucleic acid, Cell culture, Biology, Patient, Middle East respiratory syndrome Coronavirus, selection pressure, target, Viral RNA, targets, Transmission model, Quantitative, mechanism, sgmRNA, subgenomic mRNA, Middle East, open reading frames, data set, in some, acute respiratory syndrome, Medical countermeasures, acute respiratory syndrome coronavirus, subject, other coronaviruses, complex, sequence, plasticity, viral RNA synthesis, Each, transcriptional regulatory sequences, sequencing data, respiratory syndrome coronavirus, mammalian, nasopharyngeal sample, longitudinal samples, sgmRNAs, implication, Cell, junction, clinical sample, identify, evaluate, indicated, generate, can be used, provide, other coronavirus, reflected, unique, biology of SARS-CoV-2, medical countermeasure, multiphasic, was tested, 【제목키워드】 animal model, sequence, clinical sample, identify, unique,