Adaptable Deep Learning and Probabilistic Graphical Model System for Semantic Segmentation

Adv. Artif. Intell. Mach. Learn. Pub Date : 1900-01-01 DOI:10.54364/aaiml.2022.1119

Matthew Avaylon, R. Sadre, Zhen Bai, T. Perciano

{"title":"Adaptable Deep Learning and Probabilistic Graphical Model System for Semantic Segmentation","authors":"Matthew Avaylon, R. Sadre, Zhen Bai, T. Perciano","doi":"10.54364/aaiml.2022.1119","DOIUrl":null,"url":null,"abstract":"Semantic segmentation algorithms based on deep learning architectures have been applied to a diverse set of problems. Consequently, new methodologies have emerged to push the state-of-the-art in this field forward, and the need for powerful user-friendly software increased significantly. The combination of Conditional Random Fields (CRFs) and Convolutional Neural Networks (CNNs) boosted the results of pixel-level classification predictions. Recent work using a fully integrated CRF-RNN layer have shown strong advantages in segmentation benchmarks over the base models. Despite this success, the rigidity of these frameworks prevents mass adaptability for complex scientific datasets and presents challenges in optimally scaling these models. In this work, we introduce a new encoder-decoder system that overcomes both these issues. We adapt multiple CNNs as encoders, allowing for the definition of multiple function parameter arguments to structure the models according to the targeted datasets and scientific problem. We leverage the flexibility of the U-Net architecture to act as a scalable decoder. The CRF-RNN layer is integrated into the decoder as an optional final layer, keeping the entire system fully compatible with back-propagation. To evaluate the performance of our implementation, we performed experiments on the Oxford-IIIT Pet Dataset and to experimental scientific data acquired via micro-computed tomography (microCT), revealing the adaptability of this framework and the performance benefits from a fully end-to-end CNN-CRF system on a both experimental and benchmark datasets.","PeriodicalId":373878,"journal":{"name":"Adv. Artif. Intell. Mach. Learn.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adv. Artif. Intell. Mach. Learn.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54364/aaiml.2022.1119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Semantic segmentation algorithms based on deep learning architectures have been applied to a diverse set of problems. Consequently, new methodologies have emerged to push the state-of-the-art in this field forward, and the need for powerful user-friendly software increased significantly. The combination of Conditional Random Fields (CRFs) and Convolutional Neural Networks (CNNs) boosted the results of pixel-level classification predictions. Recent work using a fully integrated CRF-RNN layer have shown strong advantages in segmentation benchmarks over the base models. Despite this success, the rigidity of these frameworks prevents mass adaptability for complex scientific datasets and presents challenges in optimally scaling these models. In this work, we introduce a new encoder-decoder system that overcomes both these issues. We adapt multiple CNNs as encoders, allowing for the definition of multiple function parameter arguments to structure the models according to the targeted datasets and scientific problem. We leverage the flexibility of the U-Net architecture to act as a scalable decoder. The CRF-RNN layer is integrated into the decoder as an optional final layer, keeping the entire system fully compatible with back-propagation. To evaluate the performance of our implementation, we performed experiments on the Oxford-IIIT Pet Dataset and to experimental scientific data acquired via micro-computed tomography (microCT), revealing the adaptability of this framework and the performance benefits from a fully end-to-end CNN-CRF system on a both experimental and benchmark datasets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

语义分割的自适应深度学习和概率图模型系统

基于深度学习架构的语义分割算法已经被应用于各种各样的问题。因此，出现了新的方法来推动这一领域的最新技术，并且对功能强大的用户友好软件的需求显著增加。条件随机场(CRFs)和卷积神经网络(cnn)的结合提高了像素级分类预测的结果。最近使用完全集成的CRF-RNN层的工作在分割基准测试中显示出比基本模型更强的优势。尽管取得了成功，但这些框架的刚性阻碍了对复杂科学数据集的大规模适应性，并在优化这些模型方面提出了挑战。在这项工作中，我们介绍了一种新的编码器-解码器系统，克服了这两个问题。我们采用多个cnn作为编码器，允许定义多个函数参数参数来根据目标数据集和科学问题构建模型。我们利用U-Net架构的灵活性作为可扩展的解码器。CRF-RNN层作为可选的最后一层集成到解码器中，使整个系统与反向传播完全兼容。为了评估我们实现的性能，我们在Oxford-IIIT Pet数据集和通过微计算机断层扫描(microCT)获得的实验科学数据上进行了实验，揭示了该框架的适应性以及完全端到端CNN-CRF系统在实验和基准数据集上的性能优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Adv. Artif. Intell. Mach. Learn.

自引率

0.00%

发文量