Jennifer Webster, Jennifer Ghith, Orion Penner, Christopher H Lieu, Bob J A Schijvenaars
{"title":"Using Artificial Intelligence to Support Informed Decision-Making on <i>BRAF</i> Mutation Testing.","authors":"Jennifer Webster, Jennifer Ghith, Orion Penner, Christopher H Lieu, Bob J A Schijvenaars","doi":"10.1200/PO.23.00685","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Precision oncology relies on accurate and interpretable reporting of testing and mutation rates. Focusing on the <i>BRAFV600</i> mutations in advanced colorectal carcinoma, non-small-cell lung carcinoma, and cutaneous melanoma, we developed a platform displaying testing and mutation rates reported in the literature, which we annotated using an artificial intelligence (AI) and natural language processing (NLP) pipeline.</p><p><strong>Methods: </strong>Using AI, we identified publications that likely reported a testing or mutation rate, filtered publications for cancer type, and identified sentences that likely reported rates. Rates and covariates were subsequently manually curated by three experts. The AI performance was evaluated using precision and recall metrics. We used an interactive platform to explore and present the annotated testing and mutation rates by certain study characteristics.</p><p><strong>Results: </strong>The interactive dashboard, accessible at the BRAF dimensions website, enables users to filter mutation and testing rates with relevant options (eg, country of study, study type, mutation type) and to visualize annotated rates. The AI pipeline demonstrated excellent filtering performance (>90% precision and recall for all target cancer types) and moderate performance for sentence classification (53%-99% precision; ≥75% recall). The manual annotation of testing and mutation rates revealed inter-rater disagreement (testing rate, 19%; mutation rate, 70%), indicating unclear or nonstandard reporting of rates in some publications.</p><p><strong>Conclusion: </strong>Our AI-driven NLP pipeline demonstrated the potential for annotating biomarker testing and mutation rates. The difficulties we encountered highlight the need for more advanced AI-powered literature searching and data extraction, and more consistent reporting of testing rates. These improvements would reduce the risk of misinterpretation or misunderstanding of testing and mutation rates by AI-based technologies and the health care community, with beneficial impacts on clinical decision-making, research, and trial design.</p>","PeriodicalId":14797,"journal":{"name":"JCO precision oncology","volume":null,"pages":null},"PeriodicalIF":5.3000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO precision oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1200/PO.23.00685","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/30 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Precision oncology relies on accurate and interpretable reporting of testing and mutation rates. Focusing on the BRAFV600 mutations in advanced colorectal carcinoma, non-small-cell lung carcinoma, and cutaneous melanoma, we developed a platform displaying testing and mutation rates reported in the literature, which we annotated using an artificial intelligence (AI) and natural language processing (NLP) pipeline.
Methods: Using AI, we identified publications that likely reported a testing or mutation rate, filtered publications for cancer type, and identified sentences that likely reported rates. Rates and covariates were subsequently manually curated by three experts. The AI performance was evaluated using precision and recall metrics. We used an interactive platform to explore and present the annotated testing and mutation rates by certain study characteristics.
Results: The interactive dashboard, accessible at the BRAF dimensions website, enables users to filter mutation and testing rates with relevant options (eg, country of study, study type, mutation type) and to visualize annotated rates. The AI pipeline demonstrated excellent filtering performance (>90% precision and recall for all target cancer types) and moderate performance for sentence classification (53%-99% precision; ≥75% recall). The manual annotation of testing and mutation rates revealed inter-rater disagreement (testing rate, 19%; mutation rate, 70%), indicating unclear or nonstandard reporting of rates in some publications.
Conclusion: Our AI-driven NLP pipeline demonstrated the potential for annotating biomarker testing and mutation rates. The difficulties we encountered highlight the need for more advanced AI-powered literature searching and data extraction, and more consistent reporting of testing rates. These improvements would reduce the risk of misinterpretation or misunderstanding of testing and mutation rates by AI-based technologies and the health care community, with beneficial impacts on clinical decision-making, research, and trial design.