首页 > 最新文献

Journal of Pathology Informatics最新文献

英文 中文
Automated detection and delineation of lymph nodes in haematoxylin & eosin stained digitised slides 在苏木精和伊红染色的数字化玻片上自动检测和描绘淋巴结
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100192
Manon Beuque , Derek R. Magee , Avishek Chatterjee , Henry C. Woodruff , Ruth E. Langley , William Allum , Matthew G. Nankivell , David Cunningham , Philippe Lambin , Heike I. Grabsch

Treatment of patients with oesophageal and gastric cancer (OeGC) is guided by disease stage, patient performance status and preferences. Lymph node (LN) status is one of the strongest prognostic factors for OeGC patients. However, survival varies between patients with the same disease stage and LN status. We recently showed that LN size from patients with OeGC might also have prognostic value, thus making delineations of LNs essential for size estimation and the extraction of other imaging biomarkers.

We hypothesized that a machine learning workflow is able to: (1) find digital H&E stained slides containing LNs, (2) create a scoring system providing degrees of certainty for the results, and (3) delineate LNs in those images.

To train and validate the pipeline, we used 1695 H&E slides from the OE02 trial. The dataset was divided into training (80%) and validation (20%). The model was tested on an external dataset of 826 H&E slides from the OE05 trial. U-Net architecture was used to generate prediction maps from which predefined features were extracted. These features were subsequently used to train an XGBoost model to determine if a region truly contained a LN. With our innovative method, the balanced accuracies of the LN detection were 0.93 on the validation dataset (0.83 on the test dataset) compared to 0.81 (0.81) on the validation (test) datasets when using the standard method of thresholding U-Net predictions to arrive at a binary mask. Our method allowed for the creation of an “uncertain” category, and partly limited false-positive predictions on the external dataset. The mean Dice score was 0.73 (0.60) per-image and 0.66 (0.48) per-LN for the validation (test) datasets.

Our pipeline detects images with LNs more accurately than conventional methods, and high-throughput delineation of LNs can facilitate future LN content analyses of large datasets.

食管癌和胃癌(OeGC)患者的治疗是根据疾病分期、患者的身体状况和偏好来指导的。淋巴结(LN)状态是OeGC患者最重要的预后因素之一。然而,同一疾病分期和LN状态的患者的生存率不同。我们最近发现,OeGC患者的淋巴结大小也可能具有预后价值,因此对淋巴结的描绘对于大小估计和其他成像生物标志物的提取至关重要。我们假设机器学习工作流程能够:(1)找到包含ln的数字H&E染色玻片,(2)创建一个评分系统,为结果提供确定性程度,(3)在这些图像中描绘ln。为了训练和验证管道,我们使用了来自OE02试验的1695张H&E幻灯片。数据集分为训练(80%)和验证(20%)。该模型在OE05试验的826张H&E幻灯片的外部数据集上进行了测试。采用U-Net体系结构生成预测图,从中提取预定义特征。这些特征随后被用于训练XGBoost模型,以确定一个区域是否真正包含LN。使用我们的创新方法,当使用阈值U-Net预测的标准方法来获得二进制掩码时,LN检测的平衡精度在验证数据集上为0.93(在测试数据集上为0.83),而在验证(测试)数据集中为0.81(0.81)。我们的方法允许创建“不确定”类别,并在一定程度上限制了外部数据集上的假阳性预测。对于验证(测试)数据集,平均Dice评分为每张图像0.73(0.60)和每张ln 0.66(0.48)。我们的管道比传统方法更准确地检测具有LN的图像,并且对LN的高通量描绘可以促进未来大型数据集的LN内容分析。
{"title":"Automated detection and delineation of lymph nodes in haematoxylin & eosin stained digitised slides","authors":"Manon Beuque ,&nbsp;Derek R. Magee ,&nbsp;Avishek Chatterjee ,&nbsp;Henry C. Woodruff ,&nbsp;Ruth E. Langley ,&nbsp;William Allum ,&nbsp;Matthew G. Nankivell ,&nbsp;David Cunningham ,&nbsp;Philippe Lambin ,&nbsp;Heike I. Grabsch","doi":"10.1016/j.jpi.2023.100192","DOIUrl":"10.1016/j.jpi.2023.100192","url":null,"abstract":"<div><p>Treatment of patients with oesophageal and gastric cancer (OeGC) is guided by disease stage, patient performance status and preferences. Lymph node (LN) status is one of the strongest prognostic factors for OeGC patients. However, survival varies between patients with the same disease stage and LN status. We recently showed that LN size from patients with OeGC might also have prognostic value, thus making delineations of LNs essential for size estimation and the extraction of other imaging biomarkers.</p><p>We hypothesized that a machine learning workflow is able to: (1) find digital H&amp;E stained slides containing LNs, (2) create a scoring system providing degrees of certainty for the results, and (3) delineate LNs in those images.</p><p>To train and validate the pipeline, we used 1695 H&amp;E slides from the OE02 trial. The dataset was divided into training (80%) and validation (20%). The model was tested on an external dataset of 826 H&amp;E slides from the OE05 trial. U-Net architecture was used to generate prediction maps from which predefined features were extracted. These features were subsequently used to train an XGBoost model to determine if a region truly contained a LN. With our innovative method, the balanced accuracies of the LN detection were 0.93 on the validation dataset (0.83 on the test dataset) compared to 0.81 (0.81) on the validation (test) datasets when using the standard method of thresholding U-Net predictions to arrive at a binary mask. Our method allowed for the creation of an “uncertain” category, and partly limited false-positive predictions on the external dataset. The mean Dice score was 0.73 (0.60) per-image and 0.66 (0.48) per-LN for the validation (test) datasets.</p><p>Our pipeline detects images with LNs more accurately than conventional methods, and high-throughput delineation of LNs can facilitate future LN content analyses of large datasets.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/87/43/main.PMC9932489.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9162066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer-based personalized attention mechanism for medical images with clinical records 基于变压器的临床医学图像个性化注意机制
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2022.100185
Yusuke Takagi , Noriaki Hashimoto , Hiroki Masuda , Hiroaki Miyoshi , Koichi Ohshima , Hidekata Hontani , Ichiro Takeuchi

In medical image diagnosis, identifying the attention region, i.e., the region of interest for which the diagnosis is made, is an important task. Various methods have been developed to automatically identify target regions from given medical images. However, in actual medical practice, the diagnosis is made based on both the images and various clinical records. Consequently, pathologists examine medical images with prior knowledge of the patients and the attention regions may change depending on the clinical records. In this study, we propose a method, called the Personalized Attention Mechanism (PersAM) method, by which the attention regions in medical images according to the clinical records. The primary idea underlying the PersAM method is the encoding of the relationships between medical images and clinical records using a variant of the Transformer architecture. To demonstrate the effectiveness of the PersAM method, we applied it to a large-scale digital pathology problem involving identifying the subtypes of 842 malignant lymphoma patients based on their gigapixel whole-slide images and clinical records.

在医学图像诊断中,识别注意区域,即对诊断感兴趣的区域,是一项重要的任务。已经开发了各种方法来从给定的医学图像中自动识别目标区域。然而,在实际的医疗实践中,诊断是基于图像和各种临床记录。因此,病理学家检查医学图像与患者的先验知识和注意力区域可能会改变取决于临床记录。在本研究中,我们提出了一种个性化注意机制(PersAM)方法,该方法根据临床记录对医学图像中的注意区域进行划分。PersAM方法的基本思想是使用Transformer体系结构的变体对医学图像和临床记录之间的关系进行编码。为了证明PersAM方法的有效性,我们将其应用于一个大规模的数字病理问题,该问题涉及根据842名恶性淋巴瘤患者的千兆像素全幻灯片图像和临床记录识别其亚型。
{"title":"Transformer-based personalized attention mechanism for medical images with clinical records","authors":"Yusuke Takagi ,&nbsp;Noriaki Hashimoto ,&nbsp;Hiroki Masuda ,&nbsp;Hiroaki Miyoshi ,&nbsp;Koichi Ohshima ,&nbsp;Hidekata Hontani ,&nbsp;Ichiro Takeuchi","doi":"10.1016/j.jpi.2022.100185","DOIUrl":"10.1016/j.jpi.2022.100185","url":null,"abstract":"<div><p>In medical image diagnosis, identifying the attention region, i.e., the region of interest for which the diagnosis is made, is an important task. Various methods have been developed to automatically identify target regions from given medical images. However, in actual medical practice, the diagnosis is made based on both the images and various clinical records. Consequently, pathologists examine medical images with prior knowledge of the patients and the attention regions may change depending on the clinical records. In this study, we propose a method, called the <em>Personalized Attention Mechanism (PersAM)</em> method, by which the attention regions in medical images according to the clinical records. The primary idea underlying the PersAM method is the encoding of the relationships between medical images and clinical records using a variant of the Transformer architecture. To demonstrate the effectiveness of the PersAM method, we applied it to a large-scale digital pathology problem involving identifying the subtypes of 842 malignant lymphoma patients based on their gigapixel whole-slide images and clinical records.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/80/66/main.PMC9860154.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9183255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Differentiation of urothelial carcinoma in histopathology images using deep learning and visualization 利用深度学习和可视化技术鉴别尿路上皮癌的组织病理学图像
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2022.100155
Aniruddha Mundhada , Sandhya Sundaram , Ramakrishnan Swaminathan , Lawrence D' Cruze , Satyavratan Govindarajan , Navaneethakrishna Makaram

Artificial Intelligence is a tool poised to transform healthcare, with use in diagnostics and therapeutics. The widespread use of digital pathology has been due to the advent of whole slide imaging. Cheaper storage for digital images, along with unprecedented progress in artificial intelligence, have paved the synergy of these two fields. This has pushed the limits of traditional diagnosis using light microscopy, from a more subjective to a more objective method of looking at cases, incorporating grading too. The grading of histopathological images of urothelial carcinoma of the urinary bladder is important with direct implications for surgical management and prognosis.

In this study, the aim is to classify urothelial carcinoma into low and high grade based on the WHO 2016 classification. The hematoxylin and eosin-stained transurethral resection of bladder tumor (TURBT) samples of both low and high grade non-invasive papillary urothelial carcinoma were digitally scanned. Patches were extracted from these whole slide images to feed into a deep learning (Convolution Neural Network: CNN) model. Patches were segregated if they had tumor tissue and only included for model training if a threshold of 90% of tumor tissue per patch was seen. Various parameters of the deep learning model, known as hyperparameters, were optimized to get the best accuracy for grading or classification into low- and high-grade urothelial carcinoma. The model was robust with an overall accuracy of 90% after hyperparameter tuning. Visualization in the form of a class activation map using Grad-CAM was done. This indicates that such a model can be used as a companion diagnostic tool for grading of urothelial carcinoma. The probable causes of this accuracy are summarized along with the limitations of this study and future work possible.

人工智能是一种有望改变医疗保健的工具,可用于诊断和治疗。数字病理学的广泛应用是由于全切片成像的出现。更便宜的数字图像存储,以及人工智能领域前所未有的进步,为这两个领域的协同作用铺平了道路。这突破了传统光学显微镜诊断的极限,从一种更主观的观察病例的方法变成了一种更客观的方法,并结合了分级。膀胱尿路上皮癌的组织病理学图像分级对手术治疗和预后具有直接意义。本研究的目的是根据WHO 2016年的分级标准将尿路上皮癌分为低分级和高分级。对低级别和高级别无创乳头状尿路上皮癌经尿道膀胱肿瘤切除术(turt)标本进行数字扫描。从这些完整的幻灯片图像中提取斑块,并将其输入深度学习(卷积神经网络:CNN)模型。如果贴片有肿瘤组织,则将其分离,只有当每个贴片的肿瘤组织的阈值为90%时,才将其纳入模型训练。深度学习模型的各种参数(称为超参数)经过优化,以获得低级别和高级别尿路上皮癌分级或分类的最佳准确性。经过超参数调整后,该模型具有良好的鲁棒性,总体精度达到90%。使用Grad-CAM以类激活图的形式进行了可视化。这表明该模型可作为尿路上皮癌分级的辅助诊断工具。本文总结了这种准确性的可能原因以及本研究的局限性和未来可能的工作。
{"title":"Differentiation of urothelial carcinoma in histopathology images using deep learning and visualization","authors":"Aniruddha Mundhada ,&nbsp;Sandhya Sundaram ,&nbsp;Ramakrishnan Swaminathan ,&nbsp;Lawrence D' Cruze ,&nbsp;Satyavratan Govindarajan ,&nbsp;Navaneethakrishna Makaram","doi":"10.1016/j.jpi.2022.100155","DOIUrl":"10.1016/j.jpi.2022.100155","url":null,"abstract":"<div><p>Artificial Intelligence is a tool poised to transform healthcare, with use in diagnostics and therapeutics. The widespread use of digital pathology has been due to the advent of whole slide imaging. Cheaper storage for digital images, along with unprecedented progress in artificial intelligence, have paved the synergy of these two fields. This has pushed the limits of traditional diagnosis using light microscopy, from a more subjective to a more objective method of looking at cases, incorporating grading too. The grading of histopathological images of urothelial carcinoma of the urinary bladder is important with direct implications for surgical management and prognosis.</p><p>In this study, the aim is to classify urothelial carcinoma into low and high grade based on the WHO 2016 classification. The hematoxylin and eosin-stained transurethral resection of bladder tumor (TURBT) samples of both low and high grade non-invasive papillary urothelial carcinoma were digitally scanned. Patches were extracted from these whole slide images to feed into a deep learning (Convolution Neural Network: CNN) model. Patches were segregated if they had tumor tissue and only included for model training if a threshold of 90% of tumor tissue per patch was seen. Various parameters of the deep learning model, known as hyperparameters, were optimized to get the best accuracy for grading or classification into low- and high-grade urothelial carcinoma. The model was robust with an overall accuracy of 90% after hyperparameter tuning. Visualization in the form of a class activation map using Grad-CAM was done. This indicates that such a model can be used as a companion diagnostic tool for grading of urothelial carcinoma. The probable causes of this accuracy are summarized along with the limitations of this study and future work possible.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/6c/f9/main.PMC9747506.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10355027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Removing non-nuclei information from histopathological images: A preprocessing step towards improving nuclei segmentation methods 从组织病理学图像中去除非细胞核信息:改进细胞核分割方法的预处理步骤。
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100315
Ricardo Moncayo , Anne L. Martel , Eduardo Romero

Disease interpretation by computer-aided diagnosis systems in digital pathology depends on reliable detection and segmentation of nuclei in hematoxylin and eosin (HE) images. These 2 tasks are challenging since appearance of both cell nuclei and background structures are very variable. This paper presents a method to improve nuclei detection and segmentation in HE images by removing tiles that only contain background information. The method divides each image into smaller patches and uses their projection to the noiselet space to capture different spatial features from non-nuclei background and nuclei structures. The noiselet features are clustered by a K-means algorithm and the resultant partition, defined by the cluster centroids, is herein named the noiselet code-book. A part of an image, a tile, is divided into patches and represented by the histogram of occurrences of the projected patches in the noiselet code-book. Finally, with these histograms, a classifier learns to differentiate between nuclei and non-nuclei tiles. By applying a conventional watershed-marked method to detect and segment nuclei, evaluation consisted in comparing pure watershed method against denoising-plus-watershed in an open database with 8 different types of tissues. The averaged F-score of nuclei detection improved from 0.830 to 0.86 and the dice score after segmentation increased from 0.701 to 0.723.

数字病理学中计算机辅助诊断系统的疾病解释依赖于苏木精和伊红(HE)图像中细胞核的可靠检测和分割。这两项任务具有挑战性,因为细胞核和背景结构的外观都是非常可变的。本文提出了一种通过去除仅包含背景信息的瓦片来改进HE图像中细胞核检测和分割的方法。该方法将每个图像划分为更小的块,并使用它们对小噪声空间的投影来捕捉非核背景和核结构的不同空间特征。通过K-means算法对noiselet特征进行聚类,由聚类质心定义的结果分区在本文中被命名为noiselet代码簿。图像的一部分,即瓦片,被划分为块,并由noiselet代码本中投影块的出现直方图表示。最后,通过这些直方图,分类器学会区分细胞核和非细胞核瓦片。通过应用传统的分水岭标记方法来检测和分割细胞核,评估包括在具有8种不同类型组织的开放数据库中比较纯分水岭方法和去噪加分水岭方法。细胞核检测的平均F分数从0.830提高到0.86,分割后的骰子分数从0.701提高到0.723。
{"title":"Removing non-nuclei information from histopathological images: A preprocessing step towards improving nuclei segmentation methods","authors":"Ricardo Moncayo ,&nbsp;Anne L. Martel ,&nbsp;Eduardo Romero","doi":"10.1016/j.jpi.2023.100315","DOIUrl":"10.1016/j.jpi.2023.100315","url":null,"abstract":"<div><p>Disease interpretation by computer-aided diagnosis systems in digital pathology depends on reliable detection and segmentation of nuclei in hematoxylin and eosin (HE) images. These 2 tasks are challenging since appearance of both cell nuclei and background structures are very variable. This paper presents a method to improve nuclei detection and segmentation in HE images by removing tiles that only contain background information. The method divides each image into smaller patches and uses their projection to the noiselet space to capture different spatial features from non-nuclei background and nuclei structures. The noiselet features are clustered by a <em>K</em>-means algorithm and the resultant partition, defined by the cluster centroids, is herein named the noiselet code-book. A part of an image, a tile, is divided into patches and represented by the histogram of occurrences of the projected patches in the noiselet code-book. Finally, with these histograms, a classifier learns to differentiate between nuclei and non-nuclei tiles. By applying a conventional watershed-marked method to detect and segment nuclei, evaluation consisted in comparing pure watershed method against denoising-plus-watershed in an open database with 8 different types of tissues. The averaged F-score of nuclei detection improved from 0.830 to 0.86 and the dice score after segmentation increased from 0.701 to 0.723.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10550762/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41166304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HistoPerm: A permutation-based view generation approach for improving histopathologic feature representation learning HistoPerm:一种基于排列的视图生成方法,用于改善组织病理特征表示学习
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100320
Joseph DiPalma , Lorenzo Torresani , Saeed Hassanpour

Deep learning has been effective for histology image analysis in digital pathology. However, many current deep learning approaches require large, strongly- or weakly labeled images and regions of interest, which can be time-consuming and resource-intensive to obtain. To address this challenge, we present HistoPerm, a view generation method for representation learning using joint embedding architectures that enhances representation learning for histology images. HistoPerm permutes augmented views of patches extracted from whole-slide histology images to improve classification performance. We evaluated the effectiveness of HistoPerm on 2 histology image datasets for Celiac disease and Renal Cell Carcinoma, using 3 widely used joint embedding architecture-based representation learning methods: BYOL, SimCLR, and VICReg. Our results show that HistoPerm consistently improves patch- and slide-level classification performance in terms of accuracy, F1-score, and AUC. Specifically, for patch-level classification accuracy on the Celiac disease dataset, HistoPerm boosts BYOL and VICReg by 8% and SimCLR by 3%. On the Renal Cell Carcinoma dataset, patch-level classification accuracy is increased by 2% for BYOL and VICReg, and by 1% for SimCLR. In addition, on the Celiac disease dataset, models with HistoPerm outperform the fully supervised baseline model by 6%, 5%, and 2% for BYOL, SimCLR, and VICReg, respectively. For the Renal Cell Carcinoma dataset, HistoPerm lowers the classification accuracy gap for the models up to 10% relative to the fully supervised baseline. These findings suggest that HistoPerm can be a valuable tool for improving representation learning of histopathology features when access to labeled data is limited and can lead to whole-slide classification results that are comparable to or superior to fully supervised methods.

深度学习对于数字病理学中的组织学图像分析是有效的。然而,当前的许多深度学习方法需要大的、强或弱标记的图像和感兴趣区域,这可能是耗时和资源密集型的。为了应对这一挑战,我们提出了HistoPerm,这是一种使用联合嵌入架构进行表示学习的视图生成方法,可以增强组织学图像的表示学习。HistoPerm排列增强了从整张幻灯片组织学图像中提取的补丁的视图,以提高分类性能。我们使用3种广泛使用的基于联合嵌入架构的表示学习方法:BYOL、SimCLR和VICReg,在2个组织学图像数据集上评估了HistoPerm对腹腔疾病和肾细胞癌的有效性。我们的结果表明,HistoPerm在准确性、F1分数和AUC方面持续提高了贴片和载玻片级别的分类性能。具体而言,对于Celiac疾病数据集的补丁级别分类准确性,HistoPerm将BYOL和VICReg提高了8%,将SimCLR提高了3%。在肾细胞癌数据集上,BYOL和VICReg的补丁级别分类准确率提高了2%,SimCLR的补丁级别的分类准确率增加了1%。此外,在腹腔疾病数据集上,BYOL、SimCLR和VICReg的HistoPerm模型分别比完全监督的基线模型好6%、5%和2%。对于肾细胞癌数据集,相对于完全监督的基线,HistoPerm将模型的分类准确率差距降低了10%。这些发现表明,当对标记数据的访问受到限制时,HistoPerm可以成为改进组织病理学特征表示学习的一个有价值的工具,并且可以导致与完全监督方法相当或优于完全监督方法的全玻片分类结果。
{"title":"HistoPerm: A permutation-based view generation approach for improving histopathologic feature representation learning","authors":"Joseph DiPalma ,&nbsp;Lorenzo Torresani ,&nbsp;Saeed Hassanpour","doi":"10.1016/j.jpi.2023.100320","DOIUrl":"10.1016/j.jpi.2023.100320","url":null,"abstract":"<div><p>Deep learning has been effective for histology image analysis in digital pathology. However, many current deep learning approaches require large, strongly- or weakly labeled images and regions of interest, which can be time-consuming and resource-intensive to obtain. To address this challenge, we present HistoPerm, a view generation method for representation learning using joint embedding architectures that enhances representation learning for histology images. HistoPerm permutes augmented views of patches extracted from whole-slide histology images to improve classification performance. We evaluated the effectiveness of HistoPerm on 2 histology image datasets for Celiac disease and Renal Cell Carcinoma, using 3 widely used joint embedding architecture-based representation learning methods: BYOL, SimCLR, and VICReg. Our results show that HistoPerm consistently improves patch- and slide-level classification performance in terms of accuracy, F1-score, and AUC. Specifically, for patch-level classification accuracy on the Celiac disease dataset, HistoPerm boosts BYOL and VICReg by 8% and SimCLR by 3%. On the Renal Cell Carcinoma dataset, patch-level classification accuracy is increased by 2% for BYOL and VICReg, and by 1% for SimCLR. In addition, on the Celiac disease dataset, models with HistoPerm outperform the fully supervised baseline model by 6%, 5%, and 2% for BYOL, SimCLR, and VICReg, respectively. For the Renal Cell Carcinoma dataset, HistoPerm lowers the classification accuracy gap for the models up to 10% relative to the fully supervised baseline. These findings suggest that HistoPerm can be a valuable tool for improving representation learning of histopathology features when access to labeled data is limited and can lead to whole-slide classification results that are comparable to or superior to fully supervised methods.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10339175/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9826170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Impact of a switch to immediate release on the patient viewing of diagnostic test results in an online portal at an academic medical center 切换到立即发布对患者在学术医疗中心的在线门户中查看诊断测试结果的影响
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100323
Kelly E. Wood , Hanh T. Pham , Knute D. Carter , Kenneth G. Nepple , James M. Blum , Matthew D. Krasowski

Patient portals allow patients to access their personal health information. The 21st Century Cures Act in the United States sought to eliminate ‘information blocking’, requiring timely release upon request of electronic health information including diagnostic test results. Some health systems, including the one in the present study, chose a systematic switch to immediate release of all or nearly all diagnostic test results to patient portals as part of compliance with the Cures Act. Our primary objective was to study changes in the time to view test results by patients before and after implementation of Cures Act-related changes. This retrospective pre-post study included data from two 10-month time periods before and after implementation of Cures Act-related changes at an academic medical center. The study included all patients (adult and pediatric) with diagnostic testing (laboratory and imaging) performed in the outpatient, inpatient, or emergency department settings. Between February 9, 2020 and December 9, 2021, there was a total of 3 809 397 diagnostic tests from 204 605 unique patients (3 320 423 tests for adult patients; 488 974 for pediatric patients). Overall, 56.5% (115 627) of patients were female, 84.1% (172 048) white, and 96.5% (197 517) preferred English as primary language. The odds of viewing test results within 1 and 30 days after portal release increased monthly throughout both time periods before and after the Cures Act for all patients. The rate of increase was significantly higher after implementation only in the subgroup of tests belonging to adult patients with active MyChart accounts. Immediate release shifted a higher proportion of result/report release to weekends (3.2% pre-Cures vs 15.3% post-Cures), although patient viewing patterns by day of week and time of day were similar before and after immediate release changes. The switch to immediate release of diagnostic test results to the patient portal resulted in a higher fraction of results viewed within 1 day across outpatient, inpatient, and emergency department settings.

患者门户允许患者访问他们的个人健康信息。美国的《21世纪治愈法案》试图消除"信息封锁",要求应要求及时公布包括诊断测试结果在内的电子健康信息。一些卫生系统,包括本研究中的卫生系统,选择了一种系统的转变,即立即向患者门户网站发布所有或几乎所有的诊断测试结果,作为遵守《治愈法案》的一部分。我们的主要目标是研究在实施《治愈法案》相关变更之前和之后患者查看检测结果的时间变化。这项回顾性的前后研究包括了一家学术医疗中心实施《治愈法案》相关改革前后两个10个月的数据。该研究包括所有在门诊、住院或急诊科进行诊断测试(实验室和成像)的患者(成人和儿童)。2020年2月9日至2021年12月9日,共进行了3次 809 397次诊断检测,来自204 605例独特患者(3 320 423例成人患者;488 小儿患者974)。总体而言,56.5%(115 627)的患者为女性,84.1%(172 048)的患者为白人,96.5%(197 517)的患者以英语为主要语言。在《治愈法案》之前和之后的所有患者中,在门户释放后1天和30天内查看测试结果的几率每月都在增加。只有在属于MyChart活跃账户的成年患者的测试亚组中,实施后的增长率才显着提高。即时释放将更高比例的结果/报告转移到周末(治疗前3.2% vs治疗后15.3%),尽管患者按星期和一天的时间查看模式在立即释放变化之前和之后相似。切换到立即向患者门户发布诊断测试结果,在门诊、住院和急诊科设置中,1天内查看的结果比例更高。
{"title":"Impact of a switch to immediate release on the patient viewing of diagnostic test results in an online portal at an academic medical center","authors":"Kelly E. Wood ,&nbsp;Hanh T. Pham ,&nbsp;Knute D. Carter ,&nbsp;Kenneth G. Nepple ,&nbsp;James M. Blum ,&nbsp;Matthew D. Krasowski","doi":"10.1016/j.jpi.2023.100323","DOIUrl":"10.1016/j.jpi.2023.100323","url":null,"abstract":"<div><p>Patient portals allow patients to access their personal health information. The 21st Century Cures Act in the United States sought to eliminate ‘information blocking’, requiring timely release upon request of electronic health information including diagnostic test results. Some health systems, including the one in the present study, chose a systematic switch to immediate release of all or nearly all diagnostic test results to patient portals as part of compliance with the Cures Act. Our primary objective was to study changes in the time to view test results by patients before and after implementation of Cures Act-related changes. This retrospective pre-post study included data from two 10-month time periods before and after implementation of Cures Act-related changes at an academic medical center. The study included all patients (adult and pediatric) with diagnostic testing (laboratory and imaging) performed in the outpatient, inpatient, or emergency department settings. Between February 9, 2020 and December 9, 2021, there was a total of 3 809 397 diagnostic tests from 204 605 unique patients (3 320 423 tests for adult patients; 488 974 for pediatric patients). Overall, 56.5% (115 627) of patients were female, 84.1% (172 048) white, and 96.5% (197 517) preferred English as primary language. The odds of viewing test results within 1 and 30 days after portal release increased monthly throughout both time periods before and after the Cures Act for all patients. The rate of increase was significantly higher after implementation only in the subgroup of tests belonging to adult patients with active MyChart accounts. Immediate release shifted a higher proportion of result/report release to weekends (3.2% pre-Cures vs 15.3% post-Cures), although patient viewing patterns by day of week and time of day were similar before and after immediate release changes. The switch to immediate release of diagnostic test results to the patient portal resulted in a higher fraction of results viewed within 1 day across outpatient, inpatient, and emergency department settings.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/bd/cc/main.PMC10384271.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9912062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Comparative Pathology Workbench: Interactive visual analytics for biomedical data 比较病理学工作台:生物医学数据的交互式可视化分析
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100328
Michael N. Wicks , Michael Glinka , Bill Hill , Derek Houghton , Mehran Sharghi , Ingrid Ferreira , David Adams , Shahida Din , Irene Papatheodorou , Kathryn Kirkwood , Michael Cheeseman , Albert Burger , Richard A. Baldock , Mark J. Arends

Pathologists need to compare histopathological images of normal and diseased tissues between different samples, cases, and species. We have designed an interactive system, termed Comparative Pathology Workbench (CPW), which allows direct and dynamic comparison of images at a variety of magnifications, selected regions of interest, as well as the results of image analysis or other data analyses such as scRNA-seq. This allows pathologists to indicate key diagnostic features, with a mechanism to allow discussion threads amongst expert groups of pathologists and other disciplines. The data and associated discussions can be accessed online from anywhere in the world. The Comparative Pathology Workbench (CPW) is a web-browser-based visual analytics platform providing shared access to an interactive “spreadsheet” style presentation of image and associated analysis data. The CPW provides a grid layout of rows and columns so that images that correspond to matching data can be organised in the form of an image-enabled “spreadsheet”. An individual workbench can be shared with other users with read-only or full edit access as required. In addition, each workbench element or the whole bench itself has an associated discussion thread to allow collaborative analysis and consensual interpretation of the data.

The CPW is a Django-based web-application that hosts the workbench data, manages users, and user-preferences. All image data are hosted by other resource applications such as OMERO or the Digital Slide Archive. Further resources can be added as required. The discussion threads are managed using WordPress and include additional graphical and image data. The CPW has been developed to allow integration of image analysis outputs from systems such as QuPath or ImageJ. All software is open-source and available from a GitHub repository.

病理学家需要比较不同样本、病例和物种之间正常和病变组织的组织病理学图像。我们设计了一个交互式系统,称为比较病理学工作台(CPW),它允许在各种放大倍数下对图像进行直接和动态比较,选择感兴趣的区域,以及图像分析或其他数据分析(如scRNA-seq)的结果。这使病理学家能够指出关键的诊断特征,并通过一种机制允许病理学家和其他学科的专家组之间的讨论线程。数据和相关讨论可以从世界任何地方在线访问。比较病理学工作台(CPW)是一个基于web浏览器的可视化分析平台,提供对交互式“电子表格”风格的图像和相关分析数据的共享访问。CPW提供了行和列的网格布局,以便与匹配数据相对应的图像可以以启用图像的“电子表格”的形式进行组织。可以根据需要与具有只读或完全编辑访问权限的其他用户共享单个工作台。此外,每个工作台元素或整个工作台本身都有一个相关的讨论线程,以允许对数据进行协作分析和一致解释。CPW是一个基于django的web应用程序,它托管工作台数据,管理用户和用户首选项。所有图像数据都由其他资源应用程序托管,如OMERO或数字幻灯片存档。可以根据需要添加更多的资源。讨论线程使用WordPress管理,并包含额外的图形和图像数据。开发CPW是为了集成来自QuPath或ImageJ等系统的图像分析输出。所有软件都是开源的,可以从GitHub存储库中获得。
{"title":"The Comparative Pathology Workbench: Interactive visual analytics for biomedical data","authors":"Michael N. Wicks ,&nbsp;Michael Glinka ,&nbsp;Bill Hill ,&nbsp;Derek Houghton ,&nbsp;Mehran Sharghi ,&nbsp;Ingrid Ferreira ,&nbsp;David Adams ,&nbsp;Shahida Din ,&nbsp;Irene Papatheodorou ,&nbsp;Kathryn Kirkwood ,&nbsp;Michael Cheeseman ,&nbsp;Albert Burger ,&nbsp;Richard A. Baldock ,&nbsp;Mark J. Arends","doi":"10.1016/j.jpi.2023.100328","DOIUrl":"10.1016/j.jpi.2023.100328","url":null,"abstract":"<div><p>Pathologists need to compare histopathological images of normal and diseased tissues between different samples, cases, and species. We have designed an interactive system, termed Comparative Pathology Workbench (CPW), which allows direct and dynamic comparison of images at a variety of magnifications, selected regions of interest, as well as the results of image analysis or other data analyses such as scRNA-seq. This allows pathologists to indicate key diagnostic features, with a mechanism to allow discussion threads amongst expert groups of pathologists and other disciplines. The data and associated discussions can be accessed online from anywhere in the world. The Comparative Pathology Workbench (CPW) is a web-browser-based visual analytics platform providing shared access to an interactive “spreadsheet” style presentation of image and associated analysis data. The CPW provides a grid layout of rows and columns so that images that correspond to matching data can be organised in the form of an image-enabled “spreadsheet”. An individual workbench can be shared with other users with read-only or full edit access as required. In addition, each workbench element or the whole bench itself has an associated discussion thread to allow collaborative analysis and consensual interpretation of the data.</p><p>The CPW is a Django-based web-application that hosts the workbench data, manages users, and user-preferences. All image data are hosted by other resource applications such as OMERO or the Digital Slide Archive. Further resources can be added as required. The discussion threads are managed using WordPress and include additional graphical and image data. The CPW has been developed to allow integration of image analysis outputs from systems such as QuPath or ImageJ. All software is open-source and available from a GitHub repository.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491844/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10220036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of semi- and self-supervised learning methods in the histopathological domain 组织病理学领域半监督和自我监督学习方法的研究
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100305
Benjamin Voigt , Oliver Fischer , Bruno Schilling , Christian Krumnow , Christian Herta

Training models with semi- or self-supervised learning methods is one way to reduce annotation effort since they rely on unlabeled or sparsely labeled datasets. Such approaches are particularly promising for domains with a time-consuming annotation process requiring specialized expertise and where high-quality labeled machine learning datasets are scarce, like in computational pathology. Even though some of these methods have been used in the histopathological domain, there is, so far, no comprehensive study comparing different approaches. Therefore, this work compares feature extractors models trained with state-of-the-art semi- or self-supervised learning methods PAWS, SimCLR, and SimSiam within a unified framework. We show that such models, across different architectures and network configurations, have a positive performance impact on histopathological classification tasks, even in low data regimes. Moreover, our observations suggest that features learned from a particular dataset, i.e., tissue type, are only in-domain transferable to a certain extent. Finally, we share our experience using each method in computational pathology and provide recommendations for its use.

使用半监督或自监督学习方法训练模型是减少标注工作量的一种方法,因为它们依赖于未标记或稀疏标记的数据集。这种方法特别适用于需要专业知识的耗时标注过程和高质量标记机器学习数据集稀缺的领域,如计算病理学。尽管其中一些方法已被用于组织病理学领域,但到目前为止,还没有对不同方法进行比较的全面研究。因此,这项工作比较了在统一框架内使用最先进的半监督或自监督学习方法PAWS、SimCLR和SimSiam训练的特征提取器模型。我们表明,这种模型,跨越不同的架构和网络配置,对组织病理学分类任务具有积极的性能影响,即使在低数据体制下也是如此。此外,我们的观察表明,从特定数据集中学习的特征,即组织类型,只能在一定程度上在域内转移。最后,我们分享了我们在计算病理学中使用每种方法的经验,并提出了使用建议。
{"title":"Investigation of semi- and self-supervised learning methods in the histopathological domain","authors":"Benjamin Voigt ,&nbsp;Oliver Fischer ,&nbsp;Bruno Schilling ,&nbsp;Christian Krumnow ,&nbsp;Christian Herta","doi":"10.1016/j.jpi.2023.100305","DOIUrl":"10.1016/j.jpi.2023.100305","url":null,"abstract":"<div><p>Training models with semi- or self-supervised learning methods is one way to reduce annotation effort since they rely on unlabeled or sparsely labeled datasets. Such approaches are particularly promising for domains with a time-consuming annotation process requiring specialized expertise and where high-quality labeled machine learning datasets are scarce, like in computational pathology. Even though some of these methods have been used in the histopathological domain, there is, so far, no comprehensive study comparing different approaches. Therefore, this work compares feature extractors models trained with state-of-the-art semi- or self-supervised learning methods PAWS, SimCLR, and SimSiam within a unified framework. We show that such models, across different architectures and network configurations, have a positive performance impact on histopathological classification tasks, even in low data regimes. Moreover, our observations suggest that features learned from a particular dataset, i.e., tissue type, are only in-domain transferable to a certain extent. Finally, we share our experience using each method in computational pathology and provide recommendations for its use.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10070179/pdf/main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9259196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Digital pathology implementation in a private laboratory: The CEDAP experience 数字病理学在私人实验室的实施:CEDAP的经验
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2022.100180
Inês Ferreira , Carlos Sachica Montenegro , Daniel Coelho , Maria Pereira , Sara da Mata , Sofia Carvalho , Ana Catarina Araújo , Carlos Abrantes , José Mário Ruivo , Helena Garcia , Rui Caetano Oliveira

Introduction

The transition to digital pathology has been carried out by several laboratories across the globe, with some cases described in Portugal. In this article, we describe the transition to digital pathology in a high-volume private laboratory, considering the main challenges and opportunities.

Material and methods

Our process started in 2020, with laboratory workflow adaptation and we are currently using a high-capacity scanner (Aperio GT450DX) to digitize slides at 20×. The visualization system, Aperio eSlide Manager WebViewer, is integrated into the Laboratory System. The validation process followed the Royal College of Pathologists Guidelines.

Results

Regarding validation, the first phase detected an error rate of 6.8%, mostly due to digitization errors. Phase optimization and collaboration with technical services led to improvements in this process. In the second validation phase, most of the slides had the desired quality for evaluation, with only an error rate of 0.6%, corrected with a new scan. The interpathologist correlation had a total agreement rate of 96.87% and 3.13% partial agreement.

Conclusion

The implementation and validation of digital pathology was a success, being ready for prime time. The total integration of all laboratory systems and the acquisition of new equipment will maximize their use, especially with the application of artificial intelligence algorithms.

向数字化病理学的过渡已经在全球的几个实验室进行,在葡萄牙描述了一些病例。在这篇文章中,我们描述了过渡到数字病理学在一个大容量的私人实验室,考虑到主要的挑战和机遇。材料和方法sour过程始于2020年,适应实验室工作流程,我们目前正在使用高容量扫描仪(Aperio GT450DX)以20倍的速度对幻灯片进行数字化。可视化系统Aperio eSlide Manager WebViewer集成到实验室系统中。验证过程遵循皇家病理学家学院指南。结果在验证方面,第一阶段的检出错误率为6.8%,主要是数字化错误。阶段优化和与技术服务的协作导致了这一过程的改进。在第二个验证阶段,大多数载玻片具有所需的评估质量,只有0.6%的错误率,通过新的扫描进行纠正。病理间相关性总符合率为96.87%,部分符合率为3.13%。结论数字化病理的实施和验证是成功的,已准备好进入黄金时期。所有实验室系统的全面集成和新设备的采购将最大限度地利用它们,特别是人工智能算法的应用。
{"title":"Digital pathology implementation in a private laboratory: The CEDAP experience","authors":"Inês Ferreira ,&nbsp;Carlos Sachica Montenegro ,&nbsp;Daniel Coelho ,&nbsp;Maria Pereira ,&nbsp;Sara da Mata ,&nbsp;Sofia Carvalho ,&nbsp;Ana Catarina Araújo ,&nbsp;Carlos Abrantes ,&nbsp;José Mário Ruivo ,&nbsp;Helena Garcia ,&nbsp;Rui Caetano Oliveira","doi":"10.1016/j.jpi.2022.100180","DOIUrl":"10.1016/j.jpi.2022.100180","url":null,"abstract":"<div><h3>Introduction</h3><p>The transition to digital pathology has been carried out by several laboratories across the globe, with some cases described in Portugal. In this article, we describe the transition to digital pathology in a high-volume private laboratory, considering the main challenges and opportunities.</p></div><div><h3>Material and methods</h3><p>Our process started in 2020, with laboratory workflow adaptation and we are currently using a high-capacity scanner (Aperio GT450DX) to digitize slides at 20×. The visualization system, Aperio eSlide Manager WebViewer, is integrated into the Laboratory System. The validation process followed the Royal College of Pathologists Guidelines.</p></div><div><h3>Results</h3><p>Regarding validation, the first phase detected an error rate of 6.8%, mostly due to digitization errors. Phase optimization and collaboration with technical services led to improvements in this process. In the second validation phase, most of the slides had the desired quality for evaluation, with only an error rate of 0.6%, corrected with a new scan. The interpathologist correlation had a total agreement rate of 96.87% and 3.13% partial agreement.</p></div><div><h3>Conclusion</h3><p>The implementation and validation of digital pathology was a success, being ready for prime time. The total integration of all laboratory systems and the acquisition of new equipment will maximize their use, especially with the application of artificial intelligence algorithms.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a8/1a/main.PMC9853351.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10583650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Automated HL7v2 LRI informatics framework for streamlining genomics-EHR data integration 用于简化基因组学EHR数据集成的自动化HL7v2 LRI信息学框架。
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100330
Robert H. Dolin , Rohan Gupta , Kimberly Newsom , Bret S.E. Heale , Shailesh Gothi , Petr Starostik , Srikar Chamala

While VCF formatted files are the lingua franca of next-generation sequencing, most EHRs do not provide native VCF support. As a result, labs often must send non-structured PDF reports to the EHR. On the other hand, while FHIR adoption is growing, most EHRs support HL7 interoperability standards, particularly those based on the HL7 Version 2 (HL7v2) standard. The HL7 Version 2 genomics component of the HL7 Laboratory Results Interface (HL7v2 LRI) standard specifies a formalism for the structured communication of genomic data from lab to EHR. We previously described an open-source tool (vcf2fhir) that converts VCF files into HL7 FHIR format. In this report, we describe how the utility has been extended to output HL7v2 LRI data that contains both variants and variant annotations (e.g., predicted phenotypes and therapeutic implications). Using this HL7v2 converter, we implemented an automated pipeline for moving structured genomic data from the clinical laboratory to EHR. We developed an open source hl7v2GenomicsExtractor that converts genomic interpretation report files into a series of HL7v2 observations conformant to HL7v2 LRI. We further enhanced the converter to produce output conformant to Epic's genomic import specification and to support alternative input formats. An automated pipeline for pushing standards-based structured genomic data directly into the EHR was successfully implemented, where genetic variant data and the clinical annotations are now both available to be viewed in the EHR through Epic's genomics module. Issues encountered in the development and deployment of the HL7v2 converter primarily revolved around data variability issues, primarily lack of a standardized representation of data elements within various genomic interpretation report files. The technical implementation of a HL7v2 message transformation to feed genomic variant and clinical annotation data into an EHR has been successful. In addition to genetic variant data, the implementation described here releases the valuable asset of clinically relevant genomic annotations provided by labs from static PDFs to calculable, structured data in EHR systems.

虽然VCF格式的文件是下一代测序的通用语言,但大多数EHR不提供本地VCF支持。因此,实验室通常必须向EHR发送非结构化的PDF报告。另一方面,随着FHIR的采用越来越多,大多数EHR支持HL7互操作性标准,特别是那些基于HL7版本2(HL7v2)标准的标准。HL7实验室结果接口(HL7v2 LRI)标准的HL7版本2基因组学组件规定了从实验室到EHR的基因组数据结构化通信的形式。我们之前描述了一个开源工具(vcf2fhir),它可以将VCF文件转换为HL7FHIR格式。在本报告中,我们描述了如何将该实用程序扩展到输出HL7v2 LRI数据,该数据包含变体和变体注释(例如,预测的表型和治疗意义)。使用这个HL7v2转换器,我们实现了一个将结构化基因组数据从临床实验室转移到EHR的自动化管道。我们开发了一种开源的hl7v2GenomicsExtractor,它可以将基因组解释报告文件转换为一系列符合HL7v2 LRI的HL7v2观察结果。我们进一步增强了转换器,以产生符合Epic基因组导入规范的输出,并支持替代输入格式。成功实现了将基于标准的结构化基因组数据直接推入EHR的自动化管道,现在可以通过Epic的基因组学模块在EHR中查看遗传变异数据和临床注释。HL7v2转换器的开发和部署中遇到的问题主要围绕数据可变性问题,主要是缺乏各种基因组解释报告文件中数据元素的标准化表示。将基因组变体和临床注释数据输入EHR的HL7v2消息转换的技术实现是成功的。除了遗传变异数据外,本文所述的实现还释放了实验室提供的临床相关基因组注释的宝贵资产,从静态PDF到EHR系统中的可计算结构化数据。
{"title":"Automated HL7v2 LRI informatics framework for streamlining genomics-EHR data integration","authors":"Robert H. Dolin ,&nbsp;Rohan Gupta ,&nbsp;Kimberly Newsom ,&nbsp;Bret S.E. Heale ,&nbsp;Shailesh Gothi ,&nbsp;Petr Starostik ,&nbsp;Srikar Chamala","doi":"10.1016/j.jpi.2023.100330","DOIUrl":"10.1016/j.jpi.2023.100330","url":null,"abstract":"<div><p>While VCF formatted files are the lingua franca of next-generation sequencing, most EHRs do not provide native VCF support. As a result, labs often must send non-structured PDF reports to the EHR. On the other hand, while FHIR adoption is growing, most EHRs support HL7 interoperability standards, particularly those based on the HL7 Version 2 (HL7v2) standard. The HL7 Version 2 genomics component of the HL7 Laboratory Results Interface (HL7v2 LRI) standard specifies a formalism for the structured communication of genomic data from lab to EHR. We previously described an open-source tool (vcf2fhir) that converts VCF files into HL7 FHIR format. In this report, we describe how the utility has been extended to output HL7v2 LRI data that contains both variants and variant annotations (e.g., predicted phenotypes and therapeutic implications). Using this HL7v2 converter, we implemented an automated pipeline for moving structured genomic data from the clinical laboratory to EHR. We developed an open source hl7v2GenomicsExtractor that converts genomic interpretation report files into a series of HL7v2 observations conformant to HL7v2 LRI. We further enhanced the converter to produce output conformant to Epic's genomic import specification and to support alternative input formats. An automated pipeline for pushing standards-based structured genomic data directly into the EHR was successfully implemented, where genetic variant data and the clinical annotations are now both available to be viewed in the EHR through Epic's genomics module. Issues encountered in the development and deployment of the HL7v2 converter primarily revolved around data variability issues, primarily lack of a standardized representation of data elements within various genomic interpretation report files. The technical implementation of a HL7v2 message transformation to feed genomic variant and clinical annotation data into an EHR has been successful. In addition to genetic variant data, the implementation described here releases the valuable asset of clinically relevant genomic annotations provided by labs from static PDFs to calculable, structured data in EHR systems.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/08/9e/main.PMC10504540.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10286053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Pathology Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1