P Guillod, A Savvas, P N Robinson, D Nai, K N Naresh, G Ott, A Schuh, W A Sewell, M Anderson, N Matentzoglu, D Durgavarjhula, M L Xu, M J Druzdzel, J M Astle
{"title":"一种系统的方法,优先考虑诊断有用的发现,作为离散数据纳入电子健康记录,以改进临床人工智能工具和基因组研究。","authors":"P Guillod, A Savvas, P N Robinson, D Nai, K N Naresh, G Ott, A Schuh, W A Sewell, M Anderson, N Matentzoglu, D Durgavarjhula, M L Xu, M J Druzdzel, J M Astle","doi":"10.1016/j.clon.2024.103737","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>The recent widespread use of electronic health records (EHRs) has opened the possibility for innumerable artificial intelligence (AI) tools to aid in genomics, phenomics, and other research, as well as disease prevention, diagnosis, and therapy. Unfortunately, much of the data contained in EHRs are not optimally structured for even the most sophisticated AI approaches. There are very few published efforts investigating methods for recording discrete data in EHRs that would not slow current clinical workflows or ways to prioritise patient characteristics worth recording. Here, we propose an approach to identify and prioritise findings (phenotypes) useful for differentiating diseases, with an initial focus on relatively common small B-cell lymphomas.</p><p><strong>Materials and methods: </strong>A website enabling crowd-sourced recording of diseases and phenotypes was developed. An expert committee in the field of B-cell lymphomas standardised phenotype terminology for use in digital resources, and select terms were included in the Human Phenotype Ontology (HPO). A total of 100 patient lymph node biopsy samples were evaluated, and phenotypes were recorded as discrete data. Bayesian networks (BNs) were developed based on these data, and their diagnostic accuracy and ability to prioritise these phenotypes for inclusion in EHRs were assessed.</p><p><strong>Results: </strong>Out of 146 phenotypes identified from the website as potentially useful for differentiating four different lymphomas from each other and from benign lymph nodes, 70-75 were included in BNs. The diagnostic accuracy of different naïve BNs was 96.3% for non-marginal zone lymphoma cases and 50% for marginal zone lymphoma cases when all of the included phenotypes were used and 93.8% for non-marginal zone lymphoma cases and 27.5% for marginal zone lymphoma cases when only 15 phenotypes were included in the BNs.</p><p><strong>Conclusion: </strong>This pilot provides a starting point for systematic improvement and a dataset for comparing related approaches.</p>","PeriodicalId":10403,"journal":{"name":"Clinical oncology","volume":"39 ","pages":"103737"},"PeriodicalIF":3.2000,"publicationDate":"2024-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Systematic Approach to Prioritise Diagnostically Useful Findings for Inclusion in Electronic Health Records as Discrete Data to Improve Clinical Artificial Intelligence Tools and Genomic Research.\",\"authors\":\"P Guillod, A Savvas, P N Robinson, D Nai, K N Naresh, G Ott, A Schuh, W A Sewell, M Anderson, N Matentzoglu, D Durgavarjhula, M L Xu, M J Druzdzel, J M Astle\",\"doi\":\"10.1016/j.clon.2024.103737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Aims: </strong>The recent widespread use of electronic health records (EHRs) has opened the possibility for innumerable artificial intelligence (AI) tools to aid in genomics, phenomics, and other research, as well as disease prevention, diagnosis, and therapy. Unfortunately, much of the data contained in EHRs are not optimally structured for even the most sophisticated AI approaches. There are very few published efforts investigating methods for recording discrete data in EHRs that would not slow current clinical workflows or ways to prioritise patient characteristics worth recording. Here, we propose an approach to identify and prioritise findings (phenotypes) useful for differentiating diseases, with an initial focus on relatively common small B-cell lymphomas.</p><p><strong>Materials and methods: </strong>A website enabling crowd-sourced recording of diseases and phenotypes was developed. An expert committee in the field of B-cell lymphomas standardised phenotype terminology for use in digital resources, and select terms were included in the Human Phenotype Ontology (HPO). A total of 100 patient lymph node biopsy samples were evaluated, and phenotypes were recorded as discrete data. Bayesian networks (BNs) were developed based on these data, and their diagnostic accuracy and ability to prioritise these phenotypes for inclusion in EHRs were assessed.</p><p><strong>Results: </strong>Out of 146 phenotypes identified from the website as potentially useful for differentiating four different lymphomas from each other and from benign lymph nodes, 70-75 were included in BNs. The diagnostic accuracy of different naïve BNs was 96.3% for non-marginal zone lymphoma cases and 50% for marginal zone lymphoma cases when all of the included phenotypes were used and 93.8% for non-marginal zone lymphoma cases and 27.5% for marginal zone lymphoma cases when only 15 phenotypes were included in the BNs.</p><p><strong>Conclusion: </strong>This pilot provides a starting point for systematic improvement and a dataset for comparing related approaches.</p>\",\"PeriodicalId\":10403,\"journal\":{\"name\":\"Clinical oncology\",\"volume\":\"39 \",\"pages\":\"103737\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-12-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.clon.2024.103737\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.clon.2024.103737","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
A Systematic Approach to Prioritise Diagnostically Useful Findings for Inclusion in Electronic Health Records as Discrete Data to Improve Clinical Artificial Intelligence Tools and Genomic Research.
Aims: The recent widespread use of electronic health records (EHRs) has opened the possibility for innumerable artificial intelligence (AI) tools to aid in genomics, phenomics, and other research, as well as disease prevention, diagnosis, and therapy. Unfortunately, much of the data contained in EHRs are not optimally structured for even the most sophisticated AI approaches. There are very few published efforts investigating methods for recording discrete data in EHRs that would not slow current clinical workflows or ways to prioritise patient characteristics worth recording. Here, we propose an approach to identify and prioritise findings (phenotypes) useful for differentiating diseases, with an initial focus on relatively common small B-cell lymphomas.
Materials and methods: A website enabling crowd-sourced recording of diseases and phenotypes was developed. An expert committee in the field of B-cell lymphomas standardised phenotype terminology for use in digital resources, and select terms were included in the Human Phenotype Ontology (HPO). A total of 100 patient lymph node biopsy samples were evaluated, and phenotypes were recorded as discrete data. Bayesian networks (BNs) were developed based on these data, and their diagnostic accuracy and ability to prioritise these phenotypes for inclusion in EHRs were assessed.
Results: Out of 146 phenotypes identified from the website as potentially useful for differentiating four different lymphomas from each other and from benign lymph nodes, 70-75 were included in BNs. The diagnostic accuracy of different naïve BNs was 96.3% for non-marginal zone lymphoma cases and 50% for marginal zone lymphoma cases when all of the included phenotypes were used and 93.8% for non-marginal zone lymphoma cases and 27.5% for marginal zone lymphoma cases when only 15 phenotypes were included in the BNs.
Conclusion: This pilot provides a starting point for systematic improvement and a dataset for comparing related approaches.
期刊介绍:
Clinical Oncology is an International cancer journal covering all aspects of the clinical management of cancer patients, reflecting a multidisciplinary approach to therapy. Papers, editorials and reviews are published on all types of malignant disease embracing, pathology, diagnosis and treatment, including radiotherapy, chemotherapy, surgery, combined modality treatment and palliative care. Research and review papers covering epidemiology, radiobiology, radiation physics, tumour biology, and immunology are also published, together with letters to the editor, case reports and book reviews.