Abstract
Background
COVID-19 progresses slowly and negatively affects many people. However, mild to moderate symptoms develop in most infected people, who recover without hospitalization. Therefore, the development of early diagnosis and treatment strategies is essential. One of these methods is proteomic technology based on the blood protein profiling technique. This study aims to classify three COVID-19 positive patient groups (mild, severe, and critical) and a control group based on the blood protein profiling using deep learning (DL), random forest (RF), and gradient boosted trees (GBTs).
Methods
The dataset consists of 93 samples (60 COVID-19 patients, 33 control), and 370 variables obtained from an open-source website. The current dataset contains age, gender, and 368 protein, used to predict the relationship between disease severity and proteins using DL and machine learning approaches (RF, GBTs). An evolutionary algorithm tunes hyperparameters of the models and the predictions are assessed through accuracy, sensitivity, specificity, precision, F1 score, classification error, and kappa performance metrics.
Results
The accuracy of RF (96.21%) was higher as compared to DL (94.73%). However, the ensemble classifier GBTs produced the highest accuracy (96.98%). TGB1BP2 in the cardiovascular II panel and MILR1 in the inflammation panel were the two most important proteins associated with disease severity.
Conclusions
The proposed model (GBTs) achieved the best prediction of disease severity based on the proteins compared to the other algorithms. The results point out that changes in blood proteins associated with the severity of COVID-19 may be used in monitoring and early diagnosis/treatment of the disease.
【저자키워드】 COVID-19, deep learning, artificial intelligence, Random forest, Gradient Boosted Trees, 【초록키워드】 Inflammation, Hospitalization, disease severity, Gender, Symptom, Protein, early diagnosis, sensitivity, specificity, Accuracy, severity of COVID-19, Algorithm, Mild, age, dataset, Critical, moderate, predict, proteomic, COVID-19 patients, treatment strategy, Precision, Classifier, Algorithms, Performance metrics, blood protein, control group, website, random, positive, variable, Affect, Result, produced, highest, develop, the disease, patient group, changes in, machine learning approach, MILR1, 【제목키워드】 prediction, artificial, Profiling, Result,