{"title":"Toward Improved Artificial Intelligence in Requirements Engineering: Metadata for Tracing Datasets","authors":"J. Hayes, Jared Payne, Mallory Leppelmeier","doi":"10.1109/REW.2019.00052","DOIUrl":null,"url":null,"abstract":"Data is the driver of artificial intelligence in requirements engineering. While some applications may lend themselves to training sets that are easily accessible (such as sentiment detection, feature request classification, requirements prioritization), other tasks face data challenges. Tracing and domain model building are examples of applications where data is not easily found or in the proper format or with the necessary metadata to support deep learning, machine learning, or other artificial intelligence techniques. This paper surveys datasets available from sources such as the Center of Excellence for Software and Systems Traceability and provides valuable metadata that can be used by re-searchers or practitioners when deciding what datasets to use, what aspects of datasets to use, what features to use in deep learning, and more.","PeriodicalId":166923,"journal":{"name":"2019 IEEE 27th International Requirements Engineering Conference Workshops (REW)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 27th International Requirements Engineering Conference Workshops (REW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REW.2019.00052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Data is the driver of artificial intelligence in requirements engineering. While some applications may lend themselves to training sets that are easily accessible (such as sentiment detection, feature request classification, requirements prioritization), other tasks face data challenges. Tracing and domain model building are examples of applications where data is not easily found or in the proper format or with the necessary metadata to support deep learning, machine learning, or other artificial intelligence techniques. This paper surveys datasets available from sources such as the Center of Excellence for Software and Systems Traceability and provides valuable metadata that can be used by re-searchers or practitioners when deciding what datasets to use, what aspects of datasets to use, what features to use in deep learning, and more.