MetaFL: Metamorphic fault localisation using weakly supervised deep learning

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING IET Software Pub Date : 2023-02-01 DOI:10.1049/sfw2.12102

Lingfeng Fu, Yan Lei, Meng Yan, Ling Xu, Zhou Xu, Xiaohong Zhang

{"title":"MetaFL: Metamorphic fault localisation using weakly supervised deep learning","authors":"Lingfeng Fu, Yan Lei, Meng Yan, Ling Xu, Zhou Xu, Xiaohong Zhang","doi":"10.1049/sfw2.12102","DOIUrl":null,"url":null,"abstract":"<p>Deep-Learning-based Fault Localisation (DLFL) leverages deep neural networks to learn the relationship between statement behaviour and program failures, showing promising results. However, since DLFL uses program failures as labels to conduct supervised learning, a labelled dataset is a requisite of applying DLFL. A failure is detected by comparing program output with a test oracle which is the standard answer for the given input. The problem is, test oracles are often difficult, or even impossible to acquire in real life, and that has severely restricted the application of DLFL since we have only unlabelled datasets in most cases. Thus, MetaFL: Metamorphic Fault Localisation Using Weakly Supervised Deep Learning is proposed, to provide a weakly supervised learning solution for DLFL. Instead of using test oracles, MetaFL uses metamorphic relations to prescribe expected behaviour of a program, and defines labels of metamorphic testing groups by verifying integrity in each group of test cases. Hence, a coarse-grained labelled dataset can be built from the originally unlabelled one, with which DLFL can work now, utilising a weakly supervised learning paradigm. The experiments show that MetaFL yields a performance comparable to plain DLFL under ideal condition (i.e. the labels of datasets are available). MetaFL successfully extends the methodology of DLFL from supervised learning to weakly supervised learning, and a fully labelled dataset is no longer mandatory for applying DLFL.</p>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"17 2","pages":"137-153"},"PeriodicalIF":1.5000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2.12102","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Software","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/sfw2.12102","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Deep-Learning-based Fault Localisation (DLFL) leverages deep neural networks to learn the relationship between statement behaviour and program failures, showing promising results. However, since DLFL uses program failures as labels to conduct supervised learning, a labelled dataset is a requisite of applying DLFL. A failure is detected by comparing program output with a test oracle which is the standard answer for the given input. The problem is, test oracles are often difficult, or even impossible to acquire in real life, and that has severely restricted the application of DLFL since we have only unlabelled datasets in most cases. Thus, MetaFL: Metamorphic Fault Localisation Using Weakly Supervised Deep Learning is proposed, to provide a weakly supervised learning solution for DLFL. Instead of using test oracles, MetaFL uses metamorphic relations to prescribe expected behaviour of a program, and defines labels of metamorphic testing groups by verifying integrity in each group of test cases. Hence, a coarse-grained labelled dataset can be built from the originally unlabelled one, with which DLFL can work now, utilising a weakly supervised learning paradigm. The experiments show that MetaFL yields a performance comparable to plain DLFL under ideal condition (i.e. the labels of datasets are available). MetaFL successfully extends the methodology of DLFL from supervised learning to weakly supervised learning, and a fully labelled dataset is no longer mandatory for applying DLFL.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

MetaFL：使用弱监督深度学习的变质断层定位

基于深度学习的故障定位（DLFL）利用深度神经网络来学习语句行为和程序故障之间的关系，显示出有希望的结果。然而，由于DLFL使用程序故障作为标签来进行监督学习，因此标记的数据集是应用DLFL的必要条件。通过将程序输出与测试预言机进行比较来检测故障，测试预言机是给定输入的标准答案。问题是，测试预言往往很难，甚至不可能在现实生活中获得，这严重限制了DLFL的应用，因为在大多数情况下，我们只有未标记的数据集。因此，提出了MetaFL:使用弱监督深度学习的变形故障定位，为DLFL提供了一种弱监督学习解决方案。MetaFL不使用测试预言，而是使用变形关系来规定程序的预期行为，并通过验证每组测试用例的完整性来定义变形测试组的标签。因此，可以利用弱监督学习范式，从最初未标记的数据集构建粗粒度标记数据集，DLFL现在可以使用该数据集工作。实验表明，在理想条件下（即数据集的标签可用），MetaFL的性能与普通DLFL相当。MetaFL成功地将DLFL的方法从监督学习扩展到了弱监督学习，并且完全标记的数据集不再是应用DLFL的强制性数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IET Software 工程技术-计算机：软件工程

CiteScore

4.20

自引率

0.00%

发文量

审稿时长

9 months

期刊介绍： IET Software publishes papers on all aspects of the software lifecycle, including design, development, implementation and maintenance. The focus of the journal is on the methods used to develop and maintain software, and their practical application. Authors are especially encouraged to submit papers on the following topics, although papers on all aspects of software engineering are welcome: Software and systems requirements engineering Formal methods, design methods, practice and experience Software architecture, aspect and object orientation, reuse and re-engineering Testing, verification and validation techniques Software dependability and measurement Human systems engineering and human-computer interaction Knowledge engineering; expert and knowledge-based systems, intelligent agents Information systems engineering Application of software engineering in industry and commerce Software engineering technology transfer Management of software development Theoretical aspects of software development Machine learning Big data and big code Cloud computing Current Special Issue. Call for papers: Knowledge Discovery for Software Development - https://digital-library.theiet.org/files/IET_SEN_CFP_KDSD.pdf Big Data Analytics for Sustainable Software Development - https://digital-library.theiet.org/files/IET_SEN_CFP_BDASSD.pdf