Creating a body of physical activity evidence to test the generalisation of annotation methods for automated evidence synthesis

Oscar Castro, E. Norris, Alison J. Wright, Emily Hayes, Ella Howes, Candice Moore, Robert West, Susan Michie
{"title":"Creating a body of physical activity evidence to test the generalisation of annotation methods for automated evidence synthesis","authors":"Oscar Castro, E. Norris, Alison J. Wright, Emily Hayes, Ella Howes, Candice Moore, Robert West, Susan Michie","doi":"10.12688/wellcomeopenres.21664.1","DOIUrl":null,"url":null,"abstract":"Background The Human Behaviour-Change Project (HBCP) aims to improve evidence synthesis in behavioural science by compiling intervention reports, annotating them according to an ontology, and using the resulting data to train information extraction and prediction algorithms. The HBCP used smoking cessation as the first ‘proof of concept’ domain but intends to extend its methodology to other behaviours. The aims of this paper are to (i) assess the extent to which methods developed for annotating smoking cessation intervention reports were generalisable to a corpus of evidence relating to a different behaviour, namely physical activity, and (ii) describe the steps involved in developing this second HBCP corpus. Methods The development of the physical activity corpus took place in four stages: (i) reviewing the suitability of smoking cessation codes already used in the HBCP, (ii) defining the selection criteria and scope of the corpus, (iii) identifying and screening records for inclusion, and (iv) annotating intervention reports using a code set of 200+ entities from the Behaviour Change Intervention Ontology. Results Stage 1 highlighted the need to modify the smoking cessation behavioural outcome codes for application to physical activity. One hundred physical activity intervention reports were reviewed, and 11 physical activity experts were consulted to inform the adapted code set. Stage 2 involved narrowing down the scope of the corpus to interventions targeting moderate-to-vigorous physical activity. In stage 3, 111 physical activity intervention reports were identified, which were then annotated in stage 4. Conclusions Smoking cessation annotation methods developed as part of the HBCP were mostly transferable to the physical activity domain. However, the codes applied to behavioural outcome variables required adaptations. This paper can help anyone interested in building a body of research to develop automated evidence synthesis methods in physical activity or for other behaviours.","PeriodicalId":508490,"journal":{"name":"Wellcome Open Research","volume":"5 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wellcome Open Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/wellcomeopenres.21664.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background The Human Behaviour-Change Project (HBCP) aims to improve evidence synthesis in behavioural science by compiling intervention reports, annotating them according to an ontology, and using the resulting data to train information extraction and prediction algorithms. The HBCP used smoking cessation as the first ‘proof of concept’ domain but intends to extend its methodology to other behaviours. The aims of this paper are to (i) assess the extent to which methods developed for annotating smoking cessation intervention reports were generalisable to a corpus of evidence relating to a different behaviour, namely physical activity, and (ii) describe the steps involved in developing this second HBCP corpus. Methods The development of the physical activity corpus took place in four stages: (i) reviewing the suitability of smoking cessation codes already used in the HBCP, (ii) defining the selection criteria and scope of the corpus, (iii) identifying and screening records for inclusion, and (iv) annotating intervention reports using a code set of 200+ entities from the Behaviour Change Intervention Ontology. Results Stage 1 highlighted the need to modify the smoking cessation behavioural outcome codes for application to physical activity. One hundred physical activity intervention reports were reviewed, and 11 physical activity experts were consulted to inform the adapted code set. Stage 2 involved narrowing down the scope of the corpus to interventions targeting moderate-to-vigorous physical activity. In stage 3, 111 physical activity intervention reports were identified, which were then annotated in stage 4. Conclusions Smoking cessation annotation methods developed as part of the HBCP were mostly transferable to the physical activity domain. However, the codes applied to behavioural outcome variables required adaptations. This paper can help anyone interested in building a body of research to develop automated evidence synthesis methods in physical activity or for other behaviours.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
创建体育锻炼证据库,测试自动证据综合注释方法的通用性
背景 人类行为改变项目(HBCP)旨在通过汇编干预报告、根据本体对其进行注释以及使用由此产生的数据来训练信息提取和预测算法,从而改进行为科学中的证据综合。HBCP 将戒烟作为第一个 "概念验证 "领域,但打算将其方法扩展到其他行为。本文的目的是:(i) 评估为戒烟干预报告注释而开发的方法在多大程度上可推广到与不同行为(即体育活动)相关的证据语料库;(ii) 描述开发第二个 HBCP 语料库的步骤。方法 体力活动语料库的开发分为四个阶段:(i) 审查 HBCP 中已使用的戒烟代码的适用性,(ii) 确定语料库的选择标准和范围,(iii) 识别和筛选纳入的记录,(iv) 使用行为改变干预本体中 200 多个实体的代码集对干预报告进行注释。结果 第 1 阶段强调了修改戒烟行为结果代码以应用于体育锻炼的必要性。对 100 份体育锻炼干预报告进行了审核,并咨询了 11 位体育锻炼专家,为改编后的代码集提供信息。第 2 阶段是将语料库的范围缩小到以中强度体育锻炼为目标的干预措施。第 3 阶段确定了 111 份体育锻炼干预报告,然后在第 4 阶段对其进行注释。结论 作为 HBCP 的一部分而开发的戒烟注释方法大多可用于体育活动领域。不过,应用于行为结果变量的代码需要进行调整。本文可以帮助任何有兴趣建立研究机构的人开发体育锻炼或其他行为的自动证据综合方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The genome sequence of the bloodfluke planorb, Biomphalaria glabrata (Say, 1818) The genome sequence of the blonde ray, Raja brachyura Lafont, 1871 The genome sequence of the Northern Bottlenose Whale, Hyperoodon ampullatus (Forster, 1770) The genome sequence of the Maiden’s Blush moth, Cyclophora punctaria (Linnaeus, 1758) The genome sequence of a jewel beetle, Agrilus biguttatus (Fabricius, 1776)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1