基于图形的主动半监督学习:水质监测案例研究

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Advanced Engineering Informatics Pub Date : 2024-10-01 DOI:10.1016/j.aei.2024.102902
Zesen Wang, Yonggang Li, Chunhua Yang, Hongqiu Zhu, Can Zhou
{"title":"基于图形的主动半监督学习:水质监测案例研究","authors":"Zesen Wang,&nbsp;Yonggang Li,&nbsp;Chunhua Yang,&nbsp;Hongqiu Zhu,&nbsp;Can Zhou","doi":"10.1016/j.aei.2024.102902","DOIUrl":null,"url":null,"abstract":"<div><div>Process monitoring is a key technology in the field of industrial production and manufacturing, where machine learning algorithms play a crucial role. However, the cost of data collection in industrial settings is very high, which seriously limits the performance improvement of monitoring models. To address this issue, a graph-based active semi-supervised learning (GASSL) strategy is proposed, which can derive reliable monitoring models with limited labeling costs. Specifically, first, a robust unsupervised active learning (RUAL) method is proposed, which incorporates data reconstruction, low-rank representation, and manifold learning into a unified framework to select the most representative samples for labeling, avoiding the poor performance of model-based active learning algorithms under the condition of limited initial sample size. Second, to maximize the use of the remaining unlabeled samples after labeling, pseudo-labels are assigned to the unlabeled samples through label propagation, thereby further expanding the sample set. At the same time, active learning selects the most valuable samples as the labeled node set of the graph model, strengthening the performance of label propagation. Experimental results on three datasets related to water quality monitoring, including public dataset, simulation dataset, and real total nitrogen detection dataset, extensively demonstrate the effectiveness of the proposed method.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102902"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Graph-based active semi-supervised learning: Case study in water quality monitoring\",\"authors\":\"Zesen Wang,&nbsp;Yonggang Li,&nbsp;Chunhua Yang,&nbsp;Hongqiu Zhu,&nbsp;Can Zhou\",\"doi\":\"10.1016/j.aei.2024.102902\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Process monitoring is a key technology in the field of industrial production and manufacturing, where machine learning algorithms play a crucial role. However, the cost of data collection in industrial settings is very high, which seriously limits the performance improvement of monitoring models. To address this issue, a graph-based active semi-supervised learning (GASSL) strategy is proposed, which can derive reliable monitoring models with limited labeling costs. Specifically, first, a robust unsupervised active learning (RUAL) method is proposed, which incorporates data reconstruction, low-rank representation, and manifold learning into a unified framework to select the most representative samples for labeling, avoiding the poor performance of model-based active learning algorithms under the condition of limited initial sample size. Second, to maximize the use of the remaining unlabeled samples after labeling, pseudo-labels are assigned to the unlabeled samples through label propagation, thereby further expanding the sample set. At the same time, active learning selects the most valuable samples as the labeled node set of the graph model, strengthening the performance of label propagation. Experimental results on three datasets related to water quality monitoring, including public dataset, simulation dataset, and real total nitrogen detection dataset, extensively demonstrate the effectiveness of the proposed method.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"62 \",\"pages\":\"Article 102902\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624005536\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624005536","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

过程监控是工业生产和制造领域的一项关键技术,其中机器学习算法发挥着至关重要的作用。然而,工业环境中的数据收集成本非常高,严重限制了监控模型性能的提高。为解决这一问题,我们提出了一种基于图的主动半监督学习(GASSL)策略,它能以有限的标注成本推导出可靠的监控模型。具体来说,首先,提出了一种鲁棒无监督主动学习(RUAL)方法,该方法将数据重构、低秩表示和流形学习整合到一个统一的框架中,选择最具代表性的样本进行标注,避免了基于模型的主动学习算法在初始样本量有限的条件下性能不佳的问题。其次,为了最大限度地利用标注后剩余的未标注样本,通过标签传播为未标注样本分配伪标签,从而进一步扩大样本集。同时,主动学习会选择最有价值的样本作为图模型的标签节点集,从而加强标签传播的性能。在与水质监测相关的三个数据集(包括公共数据集、模拟数据集和实际总氮检测数据集)上的实验结果广泛证明了所提方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Graph-based active semi-supervised learning: Case study in water quality monitoring
Process monitoring is a key technology in the field of industrial production and manufacturing, where machine learning algorithms play a crucial role. However, the cost of data collection in industrial settings is very high, which seriously limits the performance improvement of monitoring models. To address this issue, a graph-based active semi-supervised learning (GASSL) strategy is proposed, which can derive reliable monitoring models with limited labeling costs. Specifically, first, a robust unsupervised active learning (RUAL) method is proposed, which incorporates data reconstruction, low-rank representation, and manifold learning into a unified framework to select the most representative samples for labeling, avoiding the poor performance of model-based active learning algorithms under the condition of limited initial sample size. Second, to maximize the use of the remaining unlabeled samples after labeling, pseudo-labels are assigned to the unlabeled samples through label propagation, thereby further expanding the sample set. At the same time, active learning selects the most valuable samples as the labeled node set of the graph model, strengthening the performance of label propagation. Experimental results on three datasets related to water quality monitoring, including public dataset, simulation dataset, and real total nitrogen detection dataset, extensively demonstrate the effectiveness of the proposed method.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
期刊最新文献
Multi-graph attention temporal convolutional network-based radius prediction in three-roller bending of thin-walled parts A stacked graph neural network with self-exciting process for robotic cognitive strategy reasoning in proactive human-robot collaborative assembly Evaluation of Augmented Reality instructions based on initial and dynamic assembly tolerance allocation schemes in precise manual assembly UAV applications in intelligent traffic: RGBT image feature registration and complementary perception CMPSO: A novel co-evolutionary multigroup particle swarm optimization for multi-mission UAVs path planning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1