The appearance of whole slide biopsy images is greatly affected by various factors such as laboratory procedures or the choice of digital slide scanners. The resulting variations in image styles within and across batches of histological images represent one of the major obstacles to the development of generalizable machine learning algorithms. To overcome this challenge, a lot of research has focused on stain normalization and stain augmentation techniques. While such approaches provide effective strategies to reduce stain variation or increase stain invariance, respectively, they typically involve only limited modelling or sampling of the underlying stain style distribution. Tools for a streamlined sampling of different aspects of such a distribution, which would be crucial e.g. for explicitly evaluating machine learning robustness across or with respect to major stain styles, remain largely missing. Here, we present the StainStyleSampler, a toolkit for (i) the exploration and modelling of stain style variations, and (ii) the automated sampling of images or styles capturing the core components of this variation. The tool enables the extraction of various colour features and deconvolved stain components, visualization of such features directly or after dimensionality reduction, modelling of style distributions using binning, clustering, and density mapping, and automated sampling of the most representative reference images. We believe that this software will equip pathologists and computer-scientists with a more versatile set of tools that can substantially aid in both the exploration and sampling of stain variation across whole slide images.
扫码关注我们
求助内容:
应助结果提醒方式:
