{"title":"Artificial intelligence-aided data mining of medical records for cancer detection and screening","authors":"Amalie Dahl Haue, Jessica Xin Hjaltelin, Peter Christoffer Holm, Davide Placido, S⊘ren Brunak","doi":"10.1016/s1470-2045(24)00277-8","DOIUrl":null,"url":null,"abstract":"The application of artificial intelligence methods to electronic patient records paves the way for large-scale analysis of multimodal data. Such population-wide data describing deep phenotypes composed of thousands of features are now being leveraged to create data-driven algorithms, which in turn has led to improved methods for early cancer detection and screening. Remaining challenges include establishment of infrastructures for prospective testing of such methods, ways to assess biases given the data, and gathering of sufficiently large and diverse datasets that reflect disease heterogeneities across populations. This Review provides an overview of artificial intelligence methods designed to detect cancer early, including key aspects of concern (eg, the problem of data drift—when the underlying health-care data change over time), ethical aspects, and discrepancies between access to cancer screening in high-income countries versus low-income and middle-income countries.","PeriodicalId":22865,"journal":{"name":"The Lancet Oncology","volume":"13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Lancet Oncology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/s1470-2045(24)00277-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The application of artificial intelligence methods to electronic patient records paves the way for large-scale analysis of multimodal data. Such population-wide data describing deep phenotypes composed of thousands of features are now being leveraged to create data-driven algorithms, which in turn has led to improved methods for early cancer detection and screening. Remaining challenges include establishment of infrastructures for prospective testing of such methods, ways to assess biases given the data, and gathering of sufficiently large and diverse datasets that reflect disease heterogeneities across populations. This Review provides an overview of artificial intelligence methods designed to detect cancer early, including key aspects of concern (eg, the problem of data drift—when the underlying health-care data change over time), ethical aspects, and discrepancies between access to cancer screening in high-income countries versus low-income and middle-income countries.