The genome of coronaviruses, including SARS-CoV-2, encodes for two proteases, a papain like (PL^{pro} ) protease and the so-called main protease (M^{pro} ), a chymotrypsin-like cysteine protease, also named 3CL^{pro} or non-structural protein 5 (nsp5). M^{pro} is activated by autoproteolysis and is the main protease responsible for cutting the viral polyprotein into functional units. Aside from this, it is described that M^{pro} proteases are also capable of processing host proteins, including those involved in the host innate immune response. To identify substrates of the three main proteases from SARS-CoV, SARS-CoV-2, and hCoV-NL63 coronviruses, an LC-MS based N-terminomics in vitro analysis is performed using recombinantly expressed proteases and lung epithelial and endothelial cell lysates as substrate pools. For SARS-CoV-2 M^{pro} , 445 cleavage events from more than 300 proteins are identified, while 151 and 331 M^{pro} derived cleavage events are identified for SARS-CoV and hCoV-NL63, respectively. These data enable to better understand the cleavage site specificity of the viral proteases and will help to identify novel substrates in vivo. All data are available via ProteomeXchange with identifier PXD021406.
【저자키워드】 COVID19, LC-MS, isobaric labeling, protease substrates, terminomics.,