{"title":"Improving colorectal polyp classification based on physical examination data — A ensemble learning approach","authors":"Chong Li, Xiaolei Xie, Jinlin Li, N. Kong","doi":"10.1109/COASE.2017.8256102","DOIUrl":null,"url":null,"abstract":"Colorectal cancer is a common type of cancer. Due to the alarming incidence and mortality rate, it has received increasing attention on early detection and treatment. Colorectal polyps form and grow at initial stages of most colorectal cancer cases. Due to rather stringent medical resource availability and low screening compliance rate, it is more desirable in China than industrialized countries to characterize the relations between the detection of colorectal polyps and various potential determinants, including basic health information, comorbidities, and lifestyle conditions. Subsequently, one can better predict polyp onset for each individual. In this paper, we present a data-driven modeling study to improve binary classification of colorectal polyp occurrence. We apply several machine-learning methods, particularly random forests, on physical examination data of 849 Chinese people, to build the classifiers. Our results suggest improved prediction performance with a random forest model. Our results also show that subject's negative mood score, rarely recorded in previous studies, is highly correlated with colorectal polyp occurrence.","PeriodicalId":445441,"journal":{"name":"2017 13th IEEE Conference on Automation Science and Engineering (CASE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 13th IEEE Conference on Automation Science and Engineering (CASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COASE.2017.8256102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Colorectal cancer is a common type of cancer. Due to the alarming incidence and mortality rate, it has received increasing attention on early detection and treatment. Colorectal polyps form and grow at initial stages of most colorectal cancer cases. Due to rather stringent medical resource availability and low screening compliance rate, it is more desirable in China than industrialized countries to characterize the relations between the detection of colorectal polyps and various potential determinants, including basic health information, comorbidities, and lifestyle conditions. Subsequently, one can better predict polyp onset for each individual. In this paper, we present a data-driven modeling study to improve binary classification of colorectal polyp occurrence. We apply several machine-learning methods, particularly random forests, on physical examination data of 849 Chinese people, to build the classifiers. Our results suggest improved prediction performance with a random forest model. Our results also show that subject's negative mood score, rarely recorded in previous studies, is highly correlated with colorectal polyp occurrence.