基于Transformer的多尺度特征流对齐融合在活性污泥显微图像分割中的应用

IF 2 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Signal Image and Video Processing Pub Date : 2023-11-02 DOI:10.1007/s11760-023-02836-0
Lijie Zhao, Yingying Zhang, Guogang Wang, Mingzhong Huang, Qichun Zhang, Hamid Reza Karimi
{"title":"基于Transformer的多尺度特征流对齐融合在活性污泥显微图像分割中的应用","authors":"Lijie Zhao, Yingying Zhang, Guogang Wang, Mingzhong Huang, Qichun Zhang, Hamid Reza Karimi","doi":"10.1007/s11760-023-02836-0","DOIUrl":null,"url":null,"abstract":"Abstract Accurate microscopic images segmentation of activated sludge is essential for monitoring wastewater treatment processes. However, it is a challenging task due to poor contrast, artifacts, morphological similarities, and distribution imbalance. A novel image segmentation model (FafFormer) was developed in the work based on Transformer that incorporated pyramid pooling and flow alignment fusion. Pyramid Pooling Module was used to extract multi-scale features of flocs and filamentous bacteria with different morphology in the encoder. Multi-scale features were fused by flow alignment fusion module in the decoder. The module used generated semantic flow as auxiliary information to restore boundary details and facilitate fine-grained upsampling. The Focal–Lovász Loss was designed to handle class imbalance for filamentous bacteria and flocs. Image-segmentation experiments were conducted on an activated sludge dataset from a municipal wastewater treatment plant. FafFormer showed relative superiority in accuracy and reliability, especially for filamentous bacteria compared to existing models.","PeriodicalId":54393,"journal":{"name":"Signal Image and Video Processing","volume":"107 2","pages":"0"},"PeriodicalIF":2.0000,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-scale feature flow alignment fusion with Transformer for the microscopic images segmentation of activated sludge\",\"authors\":\"Lijie Zhao, Yingying Zhang, Guogang Wang, Mingzhong Huang, Qichun Zhang, Hamid Reza Karimi\",\"doi\":\"10.1007/s11760-023-02836-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Accurate microscopic images segmentation of activated sludge is essential for monitoring wastewater treatment processes. However, it is a challenging task due to poor contrast, artifacts, morphological similarities, and distribution imbalance. A novel image segmentation model (FafFormer) was developed in the work based on Transformer that incorporated pyramid pooling and flow alignment fusion. Pyramid Pooling Module was used to extract multi-scale features of flocs and filamentous bacteria with different morphology in the encoder. Multi-scale features were fused by flow alignment fusion module in the decoder. The module used generated semantic flow as auxiliary information to restore boundary details and facilitate fine-grained upsampling. The Focal–Lovász Loss was designed to handle class imbalance for filamentous bacteria and flocs. Image-segmentation experiments were conducted on an activated sludge dataset from a municipal wastewater treatment plant. FafFormer showed relative superiority in accuracy and reliability, especially for filamentous bacteria compared to existing models.\",\"PeriodicalId\":54393,\"journal\":{\"name\":\"Signal Image and Video Processing\",\"volume\":\"107 2\",\"pages\":\"0\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2023-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s11760-023-02836-0\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11760-023-02836-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

摘要对活性污泥进行精确的显微图像分割是监测废水处理过程的关键。然而,由于对比度差、伪影、形态相似性和分布不平衡,这是一项具有挑战性的任务。在Transformer的基础上,提出了一种结合金字塔池和流向融合的图像分割模型FafFormer。利用金字塔池模块提取编码器中不同形态的絮凝体和丝状细菌的多尺度特征。利用解码器中的流向融合模块融合多尺度特征。该模块使用生成的语义流作为辅助信息,恢复边界细节,实现细粒度上采样。Focal-Lovász Loss设计用于处理丝状细菌和絮凝体的类不平衡。在某城市污水处理厂的活性污泥数据集上进行了图像分割实验。与现有模型相比,FafFormer在准确性和可靠性方面具有相对优势,特别是对丝状细菌。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multi-scale feature flow alignment fusion with Transformer for the microscopic images segmentation of activated sludge
Abstract Accurate microscopic images segmentation of activated sludge is essential for monitoring wastewater treatment processes. However, it is a challenging task due to poor contrast, artifacts, morphological similarities, and distribution imbalance. A novel image segmentation model (FafFormer) was developed in the work based on Transformer that incorporated pyramid pooling and flow alignment fusion. Pyramid Pooling Module was used to extract multi-scale features of flocs and filamentous bacteria with different morphology in the encoder. Multi-scale features were fused by flow alignment fusion module in the decoder. The module used generated semantic flow as auxiliary information to restore boundary details and facilitate fine-grained upsampling. The Focal–Lovász Loss was designed to handle class imbalance for filamentous bacteria and flocs. Image-segmentation experiments were conducted on an activated sludge dataset from a municipal wastewater treatment plant. FafFormer showed relative superiority in accuracy and reliability, especially for filamentous bacteria compared to existing models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Signal Image and Video Processing
Signal Image and Video Processing ENGINEERING, ELECTRICAL & ELECTRONIC-IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY
CiteScore
3.80
自引率
8.70%
发文量
328
审稿时长
6 months
期刊介绍: The journal is an interdisciplinary journal presenting the theory and practice of signal, image and video processing. It aims at: - Disseminating high level research results and engineering developments to all signal, image or video processing researchers and research groups. - Presenting practical solutions for the current signal, image and video processing problems in Engineering and Science. Subject areas covered by the journal include but are not limited to: Adaptive processing – biomedical signal processing – multimedia signal processing – communication signal processing – non-linear signal processing – array processing – statistics and statistical signal processing – modeling – filtering – data science – graph signal processing – multi-resolution signal analysis and wavelets – segmentation – coding – restoration – enhancement – storage and retrieval – colour and multi-spectral processing – scanning – displaying – printing – interpolation – image processing - video processing-motion detection and estimation – stereoscopic processing – image and video coding.
期刊最新文献
A multi-attention Uformer for low-dose CT image denoising Efficient luma modification-based chroma down-sampling and novel luma down-sampling with adaptive interpolation A principal component fusion-based thresholded bin-stretching for CT image enhancement HFRAS: design of a high-density feature representation model for effective augmentation of satellite images LocMix: local saliency-based data augmentation for image classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1