Making guidelines computable

Clinical and Public Health Guidelines Pub Date : 2024-04-04 DOI:10.1002/gin2.12014

Brian S. Alper

{"title":"Making guidelines computable","authors":"Brian S. Alper","doi":"10.1002/gin2.12014","DOIUrl":null,"url":null,"abstract":"Guideline development is easy and efficient. With instant access to all the contributing information—all the relevant evidence, critical appraisal of the evidence by the community, values and preferences of public representatives, judgements by multidisciplinary experts, and re-usable data where others have developed recommendations for similar decisions—….Wait. It's 2024, not 2042. Let's try that again.Guideline development is difficult and resource-intensive. Even when the decision-making process works well, there is so much work involved to gather the evidence, assess the certainty of the evidence, determine the relative importance of the outcomes and consider contextual factors. It is sometimes easier if we can adapt from others who have already done it, but their work is not fitting what we need, so we essentially recreate the work, using our development methods anyway.Some aspects of guideline development are necessarily difficult and should not be oversimplified, but there are many opportunities to reduce the work involved. For example, automating tasks that do not require human cognition, such as identifying direct links to supporting information, can greatly improve work efficiency. To realize this potential, the guideline development content will need to be available in a form the computer can process.Computers could make guideline development more efficient. They already do, to some degree. We copy and paste instead of retyping when we can. We use autocomplete features to enter data when the machine can guess what we want to express, or dropdown lists when the choices are preset for us. We have come to expect massive increases in efficiency at times, such as rapid responses for targeted searching in large databases. Compare that to literature searching before the Internet.But the essence of our work—understanding the evidence and judgements sufficiently to select information and use it for informing our decisions—is not grasped by the computer. We may try to apply artificial intelligence (AI) to the challenge and occasionally show a tool helps a step in the process (e.g., highlighting population, intervention, and outcome terms in the text),1 but we have yet to create an AI that understands evidence and judgements.Imagine if we could make the evidence and judgements computable (i.e., machine-interpretable) so that the computer could create derivative concepts through calculations and logical operations. Searches would be even more efficient. Compare the precision searching for a nearby restaurant when you are travelling to finding evidence for a specific clinical outcome. You can find not only the restaurant's name but also its location, hours of operation and a link to its menu. However, if you find an article that mentions the clinical outcome in the abstract, you still need to obtain the full text, read it to extract the data and make many judgements to determine the certainty of the reported finding. The restaurant data are machine-interpretable, but the outcome data are not.Efficiency would be further increased if the knowledge (evidence and judgements) were interoperable, so any computer system could use the output (reuse the work) of any other computer system. Today, a systematic reviewer and guideline developer that use reference management software for citation management, PICO Portal for screening of articles, Robot Reviewer for assistance in risk of bias assessment, the Systematic Review Data Repository (SRDR+) for reporting data extraction, Cochrane RevMan for the meta-analyses and GRADEpro or MAGICapp for the reporting of summary of findings will need to re-enter the data for each of these systems. We enjoy extreme efficiency for navigation support due to societal evolution to ubiquitous computable forms of data exchange (see Figure 1) but have yet to achieve this state for evidence and guidelines (see Figure 2).The Guidelines International Network Technology Working Group (GINTech) had a goal in 2017 to achieve interoperable methods to share evidence and guidelines across the ‘Evidence Ecosystem’2 but no framework for how to proceed with such a substantial undertaking. In 2018, the present author recognized how a technical standard for health data exchange, Fast Healthcare Interoperability Resources (FHIR), is overcoming the long intractable problem of interoperability for electronic health records.3 At its core, FHIR solves the technical problem, by conveying data in small digital packages called Resources, and solves the social agreement challenge, by establishing global consensus through Health Level Seven International (HL7), a standards developing organization.With this insight, we approached HL7 to extend FHIR to define a standard for data exchange for research results (evidence) and judgements related to certainty of the evidence and making recommendations (evidence-based guidance). The FHIR Resources for Evidence-Based Medicine (EBM) Knowledge Assets project (EBMonFHIR) was approved on May 16, 2018, as an HL7 project.4There are now technical standards (standard for data exchange) for how to represent evidence and guidelines in machine-interpretable, interoperable form.5 We defined the structure for an Evidence Resource that precisely represents the variables, the study design, the statistical values, the analytic model and the certainty judgements for a single research finding.6 We defined an ArtifactAssessment Resource to represent any comment, rating or classification of a bit of knowledge (also called a knowledge artefact or digital knowledge object).7Profiles enable context-specific modifications to a Resource, and we defined a RecommendationJustification Profile to use the ArtifactAssessment Resource to represent all the concepts reported in the Evidence-to-Decision framework.8 We are currently working on an Evidence-Based Medicine Implementation Guide, which describes 73 profiles of 12 Resources for computable evidence and guidance.9The standard for the form of data exchange (syntactic standard) is only part of the overall solution. We also need a standard for the terminology used (semantic standard), and there is no fit-for-purpose standard vocabulary for describing evidence and guidance. We are currently 69% of the way through a multiyear, multidisciplinary effort to define about 600 terms for study design, risk of bias and statistics, manifest as the Scientific Evidence Code System (SEVCO).10, 11 We also recently started similar efforts with the GRADE Working Group to define terms for certainty of evidence, strength of recommendation and evidence-to-decision framework judgements for the GRADE Ontology.12Standards for data exchange (syntactic and semantic) are not enough. System developers need to develop or adapt computer systems to use the standards. Software tools used by researchers, methodologists, clinicians and decision-makers need to work for the user without the user having to learn FHIR or any of the underlying technical specifications.Making guidelines computable is compelling to enhance the role of guidelines in the overall ecosystem (see Figure 3), but there are limitations. Many stakeholders need to agree on the precise expectations for knowledge transfer at many different points of data exchange. Neither simple voting nor regulatory mandates can establish the agreements needed to achieve the necessary functionality. Great care must be taken to avoid the illusion of accuracy or correctness that can occur with artificial precision of concepts when ambiguous language is transformed into exacting machine code. The effort to make it easy will not be easy.Making guidelines computable is an ‘Evidence Ecosystem’-level community effort. The effort has grown since its inception in 2018, boosted in a large way in 2020 with the formation of a COVID-19 Knowledge Accelerator.13 The effort is now called Health Evidence Knowledge Accelerator (HEvKA) and has 15 working group meetings per week (see Table 1).14 There are working groups of interest to researchers, methodologists and software developers. There is no cost or contractual obligation to participate. The standards developed are open and freely available.We have also developed a platform to support data exchange using the FHIR standard for evidence and guidance knowledge. This platform is called the Fast Evidence Interoperability Resources (FEvIR) Platform.15 The FEvIR Platform is available for use now but is ‘prerelease’ and not yet scaled for performance handling of millions of records (MEDLINE alone has about 40 million records). Viewing resources on the FEvIR Platform is open without logging in, and there are 26 Viewer Tools supporting human-friendly views of FHIR Resources. Signing in is free and required to create content (which can then only be edited by the person who created the content). There are 23 Builder Tools enabling the creation of a FHIR Resource without any working knowledge of FHIR. These include a Recommendation Authoring Tool and a Guideline Authoring Tool.16, 17 There are 16 specialized tools, including Converter Tools which will convert data from MEDLINE, ClinicalTrials.gov, MAGICapp and RIS to FHIR.18-21Prioritization for tool development on the FEvIR Platform is determined by participation and by resources. We anticipate 2024 priorities to include substantial gain in features for the updating and adapting functions of the recommendation and guideline authoring tools.With computable guidelines (i.e., specification of guideline content and guideline development content in machine-interpretable form), guideline developers will be able to spend more of their time making interpretations, judgements and decisions and less of their time re-entering data, editing for format to fit and refit the system and searching to find specific bits of information.With all these developments, creating, updating and adapting guidelines will be much easier and much more efficient. And we will get there before 2042.The author is the owner and CEO of Computable Publishing LLC, a small business providing consulting services and software development and hosting the FEvIR Platform; president of Scientific Knowledge Accelerator Foundation, a nonprofit organization to support virtual scientific knowledge accelerators such as HEvKA; and chair of GIN Tech Working Group, a committee of Guidelines International Network which is collaborating with standards development for data exchange for sharing evidence and guidance in computable form.","PeriodicalId":100266,"journal":{"name":"Clinical and Public Health Guidelines","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gin2.12014","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Public Health Guidelines","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gin2.12014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Guideline development is easy and efficient. With instant access to all the contributing information—all the relevant evidence, critical appraisal of the evidence by the community, values and preferences of public representatives, judgements by multidisciplinary experts, and re-usable data where others have developed recommendations for similar decisions—….

Wait. It's 2024, not 2042. Let's try that again.

Guideline development is difficult and resource-intensive. Even when the decision-making process works well, there is so much work involved to gather the evidence, assess the certainty of the evidence, determine the relative importance of the outcomes and consider contextual factors. It is sometimes easier if we can adapt from others who have already done it, but their work is not fitting what we need, so we essentially recreate the work, using our development methods anyway.

Some aspects of guideline development are necessarily difficult and should not be oversimplified, but there are many opportunities to reduce the work involved. For example, automating tasks that do not require human cognition, such as identifying direct links to supporting information, can greatly improve work efficiency. To realize this potential, the guideline development content will need to be available in a form the computer can process.

Computers could make guideline development more efficient. They already do, to some degree. We copy and paste instead of retyping when we can. We use autocomplete features to enter data when the machine can guess what we want to express, or dropdown lists when the choices are preset for us. We have come to expect massive increases in efficiency at times, such as rapid responses for targeted searching in large databases. Compare that to literature searching before the Internet.

But the essence of our work—understanding the evidence and judgements sufficiently to select information and use it for informing our decisions—is not grasped by the computer. We may try to apply artificial intelligence (AI) to the challenge and occasionally show a tool helps a step in the process (e.g., highlighting population, intervention, and outcome terms in the text),¹ but we have yet to create an AI that understands evidence and judgements.

Imagine if we could make the evidence and judgements computable (i.e., machine-interpretable) so that the computer could create derivative concepts through calculations and logical operations. Searches would be even more efficient. Compare the precision searching for a nearby restaurant when you are travelling to finding evidence for a specific clinical outcome. You can find not only the restaurant's name but also its location, hours of operation and a link to its menu. However, if you find an article that mentions the clinical outcome in the abstract, you still need to obtain the full text, read it to extract the data and make many judgements to determine the certainty of the reported finding. The restaurant data are machine-interpretable, but the outcome data are not.

Efficiency would be further increased if the knowledge (evidence and judgements) were interoperable, so any computer system could use the output (reuse the work) of any other computer system. Today, a systematic reviewer and guideline developer that use reference management software for citation management, PICO Portal for screening of articles, Robot Reviewer for assistance in risk of bias assessment, the Systematic Review Data Repository (SRDR+) for reporting data extraction, Cochrane RevMan for the meta-analyses and GRADEpro or MAGICapp for the reporting of summary of findings will need to re-enter the data for each of these systems. We enjoy extreme efficiency for navigation support due to societal evolution to ubiquitous computable forms of data exchange (see Figure 1) but have yet to achieve this state for evidence and guidelines (see Figure 2).

The Guidelines International Network Technology Working Group (GINTech) had a goal in 2017 to achieve interoperable methods to share evidence and guidelines across the ‘Evidence Ecosystem’² but no framework for how to proceed with such a substantial undertaking. In 2018, the present author recognized how a technical standard for health data exchange, Fast Healthcare Interoperability Resources (FHIR), is overcoming the long intractable problem of interoperability for electronic health records.³ At its core, FHIR solves the technical problem, by conveying data in small digital packages called Resources, and solves the social agreement challenge, by establishing global consensus through Health Level Seven International (HL7), a standards developing organization.

With this insight, we approached HL7 to extend FHIR to define a standard for data exchange for research results (evidence) and judgements related to certainty of the evidence and making recommendations (evidence-based guidance). The FHIR Resources for Evidence-Based Medicine (EBM) Knowledge Assets project (EBMonFHIR) was approved on May 16, 2018, as an HL7 project.⁴

There are now technical standards (standard for data exchange) for how to represent evidence and guidelines in machine-interpretable, interoperable form.⁵ We defined the structure for an Evidence Resource that precisely represents the variables, the study design, the statistical values, the analytic model and the certainty judgements for a single research finding.⁶ We defined an ArtifactAssessment Resource to represent any comment, rating or classification of a bit of knowledge (also called a knowledge artefact or digital knowledge object).⁷

Profiles enable context-specific modifications to a Resource, and we defined a RecommendationJustification Profile to use the ArtifactAssessment Resource to represent all the concepts reported in the Evidence-to-Decision framework.⁸ We are currently working on an Evidence-Based Medicine Implementation Guide, which describes 73 profiles of 12 Resources for computable evidence and guidance.⁹

The standard for the form of data exchange (syntactic standard) is only part of the overall solution. We also need a standard for the terminology used (semantic standard), and there is no fit-for-purpose standard vocabulary for describing evidence and guidance. We are currently 69% of the way through a multiyear, multidisciplinary effort to define about 600 terms for study design, risk of bias and statistics, manifest as the Scientific Evidence Code System (SEVCO).^{10, 11} We also recently started similar efforts with the GRADE Working Group to define terms for certainty of evidence, strength of recommendation and evidence-to-decision framework judgements for the GRADE Ontology.¹²

Standards for data exchange (syntactic and semantic) are not enough. System developers need to develop or adapt computer systems to use the standards. Software tools used by researchers, methodologists, clinicians and decision-makers need to work for the user without the user having to learn FHIR or any of the underlying technical specifications.

Making guidelines computable is compelling to enhance the role of guidelines in the overall ecosystem (see Figure 3), but there are limitations. Many stakeholders need to agree on the precise expectations for knowledge transfer at many different points of data exchange. Neither simple voting nor regulatory mandates can establish the agreements needed to achieve the necessary functionality. Great care must be taken to avoid the illusion of accuracy or correctness that can occur with artificial precision of concepts when ambiguous language is transformed into exacting machine code. The effort to make it easy will not be easy.

Making guidelines computable is an ‘Evidence Ecosystem’-level community effort. The effort has grown since its inception in 2018, boosted in a large way in 2020 with the formation of a COVID-19 Knowledge Accelerator.¹³ The effort is now called Health Evidence Knowledge Accelerator (HEvKA) and has 15 working group meetings per week (see Table 1).¹⁴ There are working groups of interest to researchers, methodologists and software developers. There is no cost or contractual obligation to participate. The standards developed are open and freely available.

We have also developed a platform to support data exchange using the FHIR standard for evidence and guidance knowledge. This platform is called the Fast Evidence Interoperability Resources (FEvIR) Platform.¹⁵ The FEvIR Platform is available for use now but is ‘prerelease’ and not yet scaled for performance handling of millions of records (MEDLINE alone has about 40 million records). Viewing resources on the FEvIR Platform is open without logging in, and there are 26 Viewer Tools supporting human-friendly views of FHIR Resources. Signing in is free and required to create content (which can then only be edited by the person who created the content). There are 23 Builder Tools enabling the creation of a FHIR Resource without any working knowledge of FHIR. These include a Recommendation Authoring Tool and a Guideline Authoring Tool.^{16, 17} There are 16 specialized tools, including Converter Tools which will convert data from MEDLINE, ClinicalTrials.gov, MAGICapp and RIS to FHIR.^18-21

Prioritization for tool development on the FEvIR Platform is determined by participation and by resources. We anticipate 2024 priorities to include substantial gain in features for the updating and adapting functions of the recommendation and guideline authoring tools.

With computable guidelines (i.e., specification of guideline content and guideline development content in machine-interpretable form), guideline developers will be able to spend more of their time making interpretations, judgements and decisions and less of their time re-entering data, editing for format to fit and refit the system and searching to find specific bits of information.

With all these developments, creating, updating and adapting guidelines will be much easier and much more efficient. And we will get there before 2042.

The author is the owner and CEO of Computable Publishing LLC, a small business providing consulting services and software development and hosting the FEvIR Platform; president of Scientific Knowledge Accelerator Foundation, a nonprofit organization to support virtual scientific knowledge accelerators such as HEvKA; and chair of GIN Tech Working Group, a committee of Guidelines International Network which is collaborating with standards development for data exchange for sharing evidence and guidance in computable form.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使指导方针可计算

指南制定简单高效。通过即时获取所有信息--所有相关证据、社区对证据的批判性评估、公众代表的价值观和偏好、多学科专家的判断，以及他人为类似决策制定建议的可重复使用的数据--....。现在是 2024 年，不是 2042 年。让我们再试一次。让我们再试一次。指南的制定是困难的，也是资源密集型的。即使决策过程运行良好，收集证据、评估证据的确定性、确定结果的相对重要性以及考虑背景因素等方面的工作也非常繁重。如果我们能借鉴他人的做法，有时会容易一些，但他们的工作并不符合我们的需要，所以我们基本上是在重做，无论如何都要使用我们的制定方法。指南制定的某些方面必然是困难的，不应过分简化，但有很多机会可以减少相关工作。例如，将不需要人类认知的任务自动化，如确定与辅助信息的直接链接，可以大大提高工作效率。要实现这一潜力，准则制定的内容需要以计算机可以处理的形式提供。在某种程度上，计算机已经做到了这一点。在可能的情况下，我们用复制和粘贴来代替重新输入。当机器能猜到我们想表达什么时，我们就使用自动完成功能输入数据；当为我们预设了选项时，我们就使用下拉列表。我们已经开始期待效率的大幅提高，例如在大型数据库中有针对性地搜索时的快速反应。但我们工作的本质--充分理解证据和判断，以选择信息并将其用于为我们的决策提供依据--并没有被计算机所掌握。我们可能会尝试应用人工智能（AI）来应对挑战，并偶尔展示一个工具可以帮助我们完成过程中的某个步骤（例如，突出显示文本中的人群、干预和结果术语）1，但我们还没有创造出一个能够理解证据和判断的人工智能。试想一下，如果我们能让证据和判断变得可计算（即机器可解释），这样计算机就能通过计算和逻辑运算创造出衍生概念。搜索将更加高效。将旅行时搜索附近餐馆的精确度与搜索特定临床结果的证据进行比较。你不仅可以找到餐厅的名字，还可以找到它的位置、营业时间和菜单链接。但是，如果您找到一篇在摘要中提到临床结果的文章，您仍然需要获取全文，阅读全文以提取数据，并做出许多判断，以确定所报告结果的确定性。如果知识（证据和判断）具有互操作性，那么任何计算机系统都可以使用任何其他计算机系统的输出（重复使用工作），那么效率就会进一步提高。如今，使用参考文献管理软件进行引文管理、使用 PICO Portal 进行文章筛选、使用 Robot Reviewer 协助进行偏倚风险评估、使用系统性综述数据存储库 (SRDR+) 进行报告数据提取、使用 Cochrane RevMan 进行荟萃分析、使用 GRADEpro 或 MAGICapp 报告研究结果摘要的系统性综述评审员和指南制定者需要重新输入每个系统的数据。由于社会发展到无处不在的可计算数据交换形式，我们在导航支持方面享有极高的效率（见图 1），但在证据和指南方面尚未达到这种境界（见图 2）。"指南国际网络技术工作组"（Guidelines International Network Technology Working Group，GINTech）在 2017 年提出了一个目标，即在 "证据生态系统 "2 中实现可互操作的证据和指南共享方法，但对于如何开展如此重大的工作却没有任何框架。2018 年，本文作者认识到快速医疗互操作性资源（FHIR）这一健康数据交换技术标准是如何克服电子健康记录互操作性这一长期难以解决的问题的。FHIR 的核心是通过以称为 "资源"（Resources）的小型数字包传递数据来解决技术问题，并通过标准制定组织 Health Level Seven International（HL7）建立全球共识来解决社会协议难题。"基于这一认识，我们与 HL7 接洽，希望扩展 FHIR，以定义研究成果（证据）和与证据确定性相关的判断以及提出建议（循证指导）的数据交换标准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Clinical and Public Health Guidelines

自引率

0.00%

发文量