{"title":"Study of classification of breast lesions using texture GLCM features obtained from the raw ultrasound signal","authors":"Mariusz Nieniewski, L. Chmielewski","doi":"10.5566/ias.2113","DOIUrl":null,"url":null,"abstract":"Most of the methods of classification of breast lesions in ultrasound (US) images have been tested on B-mode images from the commercial equipment. The new possibility of further analysis of this problem showed up with the availability of a public database containing original raw radio frequency (RF) signals. In particular, it appeared that the original texture might contain diagnostic information which could be modified in the typical image processing and which is more difficult to perceive than the details of lesion shape/contour. In this paper a detailed analysis of the lesion texture is conducted by means of the decision trees and logistic regression. The decision trees turned out useful mainly for selecting texture features to be employed in the stepwise logistic regression. The RF signals database of 200 breast lesions was used for testing the performance of the benign vs malignant lesion classifier. The Gray Level Cooccurrence Matrix (GLCM) was calculated with the vertical/horizontal offset of up to five pixels. For each of these matrices six features were calculated resulting in a total of 210 features. Using these features a sufficient number of decision trees were generated to calculate pseudo-Receiver Operating Characteristics (ROCs). The outcome of this process is a collection of generated trees for which the employed features are known. These features were then used for generating generalized linear model by means of stepwise logistic regression. The analyzed regression models included the coefficients of up-to-the second degree terms. The texture features were further completed by a single shape feature, that is tumor circularity. The automatic procedure for finding the exact mask of a lesion is also provided for the conditions when the acoustic shadowing makes it impossible to obtain the entire contour reliably and a half-contour has to be used. The selected logistic regression models gave ROCs with the Area Under Curve (AUC) of up to 0.83 and the 95 % confidence region (0.63 0.96). Analyzing classification results one comes to the conclusion that the tumor circularity, which is the most informative among shape/contour features, is not essential against the background of textural features. The reported study shows that a relatively straightforward procedure can be employed to obtain benign vs malignant classifier comparable with that originally used for the database of the raw RF signals and based on the more complicated segmentation of the parameter maps of homodyned K distribution.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"42 1","pages":"129-145"},"PeriodicalIF":0.8000,"publicationDate":"2020-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image Analysis & Stereology","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.5566/ias.2113","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}
引用次数: 1
Abstract
Most of the methods of classification of breast lesions in ultrasound (US) images have been tested on B-mode images from the commercial equipment. The new possibility of further analysis of this problem showed up with the availability of a public database containing original raw radio frequency (RF) signals. In particular, it appeared that the original texture might contain diagnostic information which could be modified in the typical image processing and which is more difficult to perceive than the details of lesion shape/contour. In this paper a detailed analysis of the lesion texture is conducted by means of the decision trees and logistic regression. The decision trees turned out useful mainly for selecting texture features to be employed in the stepwise logistic regression. The RF signals database of 200 breast lesions was used for testing the performance of the benign vs malignant lesion classifier. The Gray Level Cooccurrence Matrix (GLCM) was calculated with the vertical/horizontal offset of up to five pixels. For each of these matrices six features were calculated resulting in a total of 210 features. Using these features a sufficient number of decision trees were generated to calculate pseudo-Receiver Operating Characteristics (ROCs). The outcome of this process is a collection of generated trees for which the employed features are known. These features were then used for generating generalized linear model by means of stepwise logistic regression. The analyzed regression models included the coefficients of up-to-the second degree terms. The texture features were further completed by a single shape feature, that is tumor circularity. The automatic procedure for finding the exact mask of a lesion is also provided for the conditions when the acoustic shadowing makes it impossible to obtain the entire contour reliably and a half-contour has to be used. The selected logistic regression models gave ROCs with the Area Under Curve (AUC) of up to 0.83 and the 95 % confidence region (0.63 0.96). Analyzing classification results one comes to the conclusion that the tumor circularity, which is the most informative among shape/contour features, is not essential against the background of textural features. The reported study shows that a relatively straightforward procedure can be employed to obtain benign vs malignant classifier comparable with that originally used for the database of the raw RF signals and based on the more complicated segmentation of the parameter maps of homodyned K distribution.
期刊介绍:
Image Analysis and Stereology is the official journal of the International Society for Stereology & Image Analysis. It promotes the exchange of scientific, technical, organizational and other information on the quantitative analysis of data having a geometrical structure, including stereology, differential geometry, image analysis, image processing, mathematical morphology, stochastic geometry, statistics, pattern recognition, and related topics. The fields of application are not restricted and range from biomedicine, materials sciences and physics to geology and geography.