{"title":"利用探索性和预测性分析方法研究 COVID-19 对肝癌的影响","authors":"Victor Chang, Rameshwari Mukeshkumar Patel, Meghana Ashok Ganatra, Qianwen Ariel Xu","doi":"10.1016/j.health.2024.100309","DOIUrl":null,"url":null,"abstract":"<div><p>This study presents the influence of COVID-19 and the pandemic on individuals diagnosed with hepatocellular carcinoma and intrahepatic cholangiocarcinoma, the two most common types of primary liver cancer. The study compares the effects before and after the pandemic on these patients. Additionally, it endeavors to predict the likelihood of survival for liver cancer patients. Our research will employ various methodologies to investigate this. Exploratory data analysis techniques are utilized, including univariate analysis, correlation analysis, bivariate analysis, chi-square testing, and T-sample testing. For predictive analytics, machine learning algorithms such as Logistic Regression, Decision Trees, Classification And Regression Tree (CART), Artificial Neural Networks (ANN), K-Nearest Neighbors (KNN), and Support Vector Machines (SVMs) will be applied. For our outputs, Logistic Regression and SVMs emerged as top-performing algorithms, boasting a remarkable accuracy rate of 93%. The study reveals that COVID-19 affected all age groups similarly. However, a gender-based difference was observed, indicating that males faced a higher risk of both cancer and mortality. Furthermore, the study found that variables such as year, month, bleeding, cirrhosis, and previously known cirrhosis did not significantly influence patient survival.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100309"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S277244252400011X/pdfft?md5=29ce6fe47398e94a0c12f7634a551b42&pid=1-s2.0-S277244252400011X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"An investigation of the COVID-19 impact on liver cancer using exploratory and predictive analytics\",\"authors\":\"Victor Chang, Rameshwari Mukeshkumar Patel, Meghana Ashok Ganatra, Qianwen Ariel Xu\",\"doi\":\"10.1016/j.health.2024.100309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This study presents the influence of COVID-19 and the pandemic on individuals diagnosed with hepatocellular carcinoma and intrahepatic cholangiocarcinoma, the two most common types of primary liver cancer. The study compares the effects before and after the pandemic on these patients. Additionally, it endeavors to predict the likelihood of survival for liver cancer patients. Our research will employ various methodologies to investigate this. Exploratory data analysis techniques are utilized, including univariate analysis, correlation analysis, bivariate analysis, chi-square testing, and T-sample testing. For predictive analytics, machine learning algorithms such as Logistic Regression, Decision Trees, Classification And Regression Tree (CART), Artificial Neural Networks (ANN), K-Nearest Neighbors (KNN), and Support Vector Machines (SVMs) will be applied. For our outputs, Logistic Regression and SVMs emerged as top-performing algorithms, boasting a remarkable accuracy rate of 93%. The study reveals that COVID-19 affected all age groups similarly. However, a gender-based difference was observed, indicating that males faced a higher risk of both cancer and mortality. Furthermore, the study found that variables such as year, month, bleeding, cirrhosis, and previously known cirrhosis did not significantly influence patient survival.</p></div>\",\"PeriodicalId\":73222,\"journal\":{\"name\":\"Healthcare analytics (New York, N.Y.)\",\"volume\":\"5 \",\"pages\":\"Article 100309\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S277244252400011X/pdfft?md5=29ce6fe47398e94a0c12f7634a551b42&pid=1-s2.0-S277244252400011X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Healthcare analytics (New York, N.Y.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S277244252400011X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare analytics (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277244252400011X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本研究介绍了 COVID-19 和大流行对肝细胞癌和肝内胆管癌(两种最常见的原发性肝癌)患者的影响。研究比较了大流行前后对这些患者的影响。此外,研究还将努力预测肝癌患者的生存可能性。我们的研究将采用多种方法对此进行调查。我们将采用探索性数据分析技术,包括单变量分析法、相关分析法、双变量分析法、卡方检验法和 T 样本检验法。在预测分析方面,将采用逻辑回归、决策树、分类回归树(CART)、人工神经网络(ANN)、K-近邻(KNN)和支持向量机(SVM)等机器学习算法。在我们的结果中,逻辑回归和 SVM 是表现最好的算法,准确率高达 93%。研究显示,COVID-19 对所有年龄组的影响相似。不过,我们观察到了性别差异,这表明男性患癌症和死亡的风险都更高。此外,研究还发现,年、月、出血、肝硬化和先前已知的肝硬化等变量对患者的存活率没有显著影响。
An investigation of the COVID-19 impact on liver cancer using exploratory and predictive analytics
This study presents the influence of COVID-19 and the pandemic on individuals diagnosed with hepatocellular carcinoma and intrahepatic cholangiocarcinoma, the two most common types of primary liver cancer. The study compares the effects before and after the pandemic on these patients. Additionally, it endeavors to predict the likelihood of survival for liver cancer patients. Our research will employ various methodologies to investigate this. Exploratory data analysis techniques are utilized, including univariate analysis, correlation analysis, bivariate analysis, chi-square testing, and T-sample testing. For predictive analytics, machine learning algorithms such as Logistic Regression, Decision Trees, Classification And Regression Tree (CART), Artificial Neural Networks (ANN), K-Nearest Neighbors (KNN), and Support Vector Machines (SVMs) will be applied. For our outputs, Logistic Regression and SVMs emerged as top-performing algorithms, boasting a remarkable accuracy rate of 93%. The study reveals that COVID-19 affected all age groups similarly. However, a gender-based difference was observed, indicating that males faced a higher risk of both cancer and mortality. Furthermore, the study found that variables such as year, month, bleeding, cirrhosis, and previously known cirrhosis did not significantly influence patient survival.