{"title":"63. An introduction to publicly available AI-assisted chatbot-style search engines for cancer variant curation","authors":"Beth Pitel, Antonina Wojcik, Christy Koellner, Claire Teigen, Katherine Geiersbach, Patricia Greipp, Xinjie Xu, Cinthya Zepeda Mendoza","doi":"10.1016/j.cancergen.2024.08.065","DOIUrl":null,"url":null,"abstract":"<div><div>Publicly available artificial intelligence (AI) chatbot-style search engines are gaining popularity for various applications, ranging from writing poetry to determining oncogenic cancer variants and genes. However, to our knowledge, a systematic evaluation of the effectiveness of these tools in cancer variant interpretation is lacking in current literature.</div><div>In this proof-of-concept study, four free online AI-assisted search engines (ChatGPT, Perplexity AI, Claude AI, and Llama2) were given simple standardized queries to investigate the clinical relevance of multiple gene variants observed in lung adenocarcinoma, glioma, and acute myeloid leukemia. The queries were structured as follows: 'What is the clinical significance of [GENE] [protein-level (p.) nomenclature] in [cancer type]?'</div><div>As anticipated, variants of uncertain significance (VUS) illustrated challenges for using AI-assisted search engines in cancer variant interpretation. Each tool incorrectly attributed oncogenicity to at least 1 of the 6 VUS investigated: Perplexity AI (1/6 VUS incorrectly represented as oncogenic), ChatGPT (2/6), Llama2 (4/6), Claude AI (5/6).</div><div>The overestimation of oncogenicity in these tools may be driven by conditioning of these AI-assisted search engines by past and current users for positive assignation attributes or from application of a response format with incorrect extrapolation of studies describing variants in the same gene without the ability to draw nuanced conclusions from studies focusing on different aspects of gene function. While there are challenges in using AI-assisted search engines in the clinical genomic space currently, this rapidly improving technology could provide a useful supplement for cancer variant analysts when combined with caution and expert human oversight.</div></div>","PeriodicalId":49225,"journal":{"name":"Cancer Genetics","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Genetics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210776224001030","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Publicly available artificial intelligence (AI) chatbot-style search engines are gaining popularity for various applications, ranging from writing poetry to determining oncogenic cancer variants and genes. However, to our knowledge, a systematic evaluation of the effectiveness of these tools in cancer variant interpretation is lacking in current literature.
In this proof-of-concept study, four free online AI-assisted search engines (ChatGPT, Perplexity AI, Claude AI, and Llama2) were given simple standardized queries to investigate the clinical relevance of multiple gene variants observed in lung adenocarcinoma, glioma, and acute myeloid leukemia. The queries were structured as follows: 'What is the clinical significance of [GENE] [protein-level (p.) nomenclature] in [cancer type]?'
As anticipated, variants of uncertain significance (VUS) illustrated challenges for using AI-assisted search engines in cancer variant interpretation. Each tool incorrectly attributed oncogenicity to at least 1 of the 6 VUS investigated: Perplexity AI (1/6 VUS incorrectly represented as oncogenic), ChatGPT (2/6), Llama2 (4/6), Claude AI (5/6).
The overestimation of oncogenicity in these tools may be driven by conditioning of these AI-assisted search engines by past and current users for positive assignation attributes or from application of a response format with incorrect extrapolation of studies describing variants in the same gene without the ability to draw nuanced conclusions from studies focusing on different aspects of gene function. While there are challenges in using AI-assisted search engines in the clinical genomic space currently, this rapidly improving technology could provide a useful supplement for cancer variant analysts when combined with caution and expert human oversight.
期刊介绍:
The aim of Cancer Genetics is to publish high quality scientific papers on the cellular, genetic and molecular aspects of cancer, including cancer predisposition and clinical diagnostic applications. Specific areas of interest include descriptions of new chromosomal, molecular or epigenetic alterations in benign and malignant diseases; novel laboratory approaches for identification and characterization of chromosomal rearrangements or genomic alterations in cancer cells; correlation of genetic changes with pathology and clinical presentation; and the molecular genetics of cancer predisposition. To reach a basic science and clinical multidisciplinary audience, we welcome original full-length articles, reviews, meeting summaries, brief reports, and letters to the editor.