Eczacılık Fakültesi / Faculty of Pharmacy
Permanent URI for this collectionhttps://hdl.handle.net/11727/5700
Browse
Item Detection of COVID-19 by Machine Learning Using Routine Laboratory Tests(2021) Cubukcu, Hikmet Can; Topcu, Deniz Ilhan; Bayraktar, Nilufer; Gulsen, Murat; Sari, Nuran; Arslan, Ayse Hande; 0000-0002-1219-6368; 0000-0002-7886-3688; 34791032; E-3717-2019; Y-8758-2018Objectives The present study aimed to develop a clinical decision support tool to assist coronavirus disease 2019 (COVID-19) diagnoses with machine learning (ML) models using routine laboratory test results. Methods We developed ML models using laboratory data (n = 1,391) composed of six clinical chemistry (CC) results, 14 CBC parameter results, and results of a severe acute respiratory syndrome coronavirus 2 real-time reverse transcription-polymerase chain reaction as a gold standard method. Four ML algorithms, including random forest (RF), gradient boosting (XGBoost), support vector machine (SVM), and logistic regression, were used to build eight ML models using CBC and a combination of CC and CBC parameters. Performance evaluation was conducted on the test data set and external validation data set from Brazil. Results The accuracy values of all models ranged from 74% to 91%. The RF model trained from CC and CBC analytes showed the best performance on the present study's data set (accuracy, 85.3%; sensitivity, 79.6%; specificity, 91.2%). The RF model trained from only CBC parameters detected COVID-19 cases with 82.8% accuracy. The best performance on the external validation data set belonged to the SVM model trained from CC and CBC parameters (accuracy, 91.18%; sensitivity, 100%; specificity, 84.21%). Conclusions ML models presented in this study can be used as clinical decision support tools to contribute to physicians' clinical judgment for COVID-19 diagnoses.Item Estimation of Low-Density Lipoprotein Cholesterol Concentration Using Machine Learning(2021) Cubukcu, Hikmet Can; Topcu, Deniz İlhan; 0000-0002-1219-6368; 34635916; E-3717-2019Objective Low-density lipoprotein cholesterol (LDL-C) can be estimated using the Friedewald and Martin-Hopkins formulas. We developed LDL-C prediction models using multiple machine learning methods and investigated the validity of the new models along with the former formulas. Methods Laboratory data (n = 59,415) on measured LDL-C, high-density lipoprotein cholesterol, triglycerides (TG), and total cholesterol were partitioned into training and test data sets. Linear regression, gradient-boosted trees, and artificial neural network (ANN) models were formed based on the training data. Paired-group comparisons were performed using a t-test and the Wilcoxon signed-rank test. We considered P values .2 to be statistically significant. Results For TG >= 177 mg/dL, the Friedewald formula underestimated and the Martin-Hopkins formula overestimated the LDL-C (P <.001), which was more significant for LDL-C <70 mg/dL. The linear regression, gradient-boosted trees, and ANN models outperformed the aforementioned formulas for TG >= 177 mg/dL and LDL-C <70 mg/dL based on a comparison with a homogeneous assay (P >.001 vs. P <.001) and classification accuracy. Conclusion Linear regression, gradient-boosted trees, and ANN models offer more accurate alternatives to the aforementioned formulas, especially for TG 177 to 399 mg/dL and LDL-C <70 mg/dL.