Abel KahsayGebreslassie, YaecobGirmayGezahegn, Misgina Tsighe Hagos, AchimIbenthal, Pooja
{"title":"Automated Gastrointestinal Disease Recognition for Endoscopic Images","authors":"Abel KahsayGebreslassie, YaecobGirmayGezahegn, Misgina Tsighe Hagos, AchimIbenthal, Pooja","doi":"10.1109/ICCCIS48478.2019.8974458","DOIUrl":null,"url":null,"abstract":"The human Gastrointestinal (GI) tract can be affected by different diseases and endoscopy has been seen to perform well for diagnosing GI tract problems. Accurate identification of underlying problems in GI tract endoscopic images is important as it affects decision-making on treatment and follow-up. In developing countries trained endoscopic experts are small in number and expensive. Even though medical recognition is a promising field of application for Artificial Intelligence (AI) publicly available datasets for such tasks are small in number. Kvasir dataset is one of the publicly available medical datasets. It consists of gastrointestinal endoscopic images that belong to eight different classes. We have automated recognition of GI tract landmarks and diseases, for classes that are available in Kvasir, with the use of Convolutional Neural Networks (CNNs). CNNs are widely used for visual recognition due to their ability to capture local features and their computational efficiency compared to fully connected networks. We have fine-tuned a residual model based on ResNet50 and a dense model based on DenseNet121 on Kvasir dataset. The models’ performance on a test set that consists of 75 images from each class is 86.9% for dense model and 87.8% for residual model. We have also built a user interface for users to select images and get recognition results. The interface built can serve as a decision support system for classifying GI tract endoscopic images. It can also further be extended for recognition in videos by feeding the video input as a sequence of images.","PeriodicalId":436154,"journal":{"name":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCIS48478.2019.8974458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
The human Gastrointestinal (GI) tract can be affected by different diseases and endoscopy has been seen to perform well for diagnosing GI tract problems. Accurate identification of underlying problems in GI tract endoscopic images is important as it affects decision-making on treatment and follow-up. In developing countries trained endoscopic experts are small in number and expensive. Even though medical recognition is a promising field of application for Artificial Intelligence (AI) publicly available datasets for such tasks are small in number. Kvasir dataset is one of the publicly available medical datasets. It consists of gastrointestinal endoscopic images that belong to eight different classes. We have automated recognition of GI tract landmarks and diseases, for classes that are available in Kvasir, with the use of Convolutional Neural Networks (CNNs). CNNs are widely used for visual recognition due to their ability to capture local features and their computational efficiency compared to fully connected networks. We have fine-tuned a residual model based on ResNet50 and a dense model based on DenseNet121 on Kvasir dataset. The models’ performance on a test set that consists of 75 images from each class is 86.9% for dense model and 87.8% for residual model. We have also built a user interface for users to select images and get recognition results. The interface built can serve as a decision support system for classifying GI tract endoscopic images. It can also further be extended for recognition in videos by feeding the video input as a sequence of images.