Rahul Alapati MD, Bryan Renslo MD, Sarah F. Wagoner MD, Omar Karadaghy MD, Aisha Serpedin BS, Yeo Eun Kim BS, Maria Feucht MD, Naomi Wang BA, Uma Ramesh BA, Antonio Bon Nieves BS, Amelia Lawrence BS, Celina Virgen MD, MPH, Tuleen Sawaf MD, Anaïs Rameau MD, Andrés M. Bur MD
{"title":"Assessing the Reporting Quality of Machine Learning Algorithms in Head and Neck Oncology","authors":"Rahul Alapati MD, Bryan Renslo MD, Sarah F. Wagoner MD, Omar Karadaghy MD, Aisha Serpedin BS, Yeo Eun Kim BS, Maria Feucht MD, Naomi Wang BA, Uma Ramesh BA, Antonio Bon Nieves BS, Amelia Lawrence BS, Celina Virgen MD, MPH, Tuleen Sawaf MD, Anaïs Rameau MD, Andrés M. Bur MD","doi":"10.1002/lary.31756","DOIUrl":null,"url":null,"abstract":"<div>\n \n <section>\n \n <h3> Objective</h3>\n \n <p>This study aimed to assess reporting quality of machine learning (ML) algorithms in the head and neck oncology literature using the TRIPOD-AI criteria.</p>\n </section>\n \n <section>\n \n <h3> Data Sources</h3>\n \n <p>A comprehensive search was conducted using PubMed, Scopus, Embase, and Cochrane Database of Systematic Reviews, incorporating search terms related to “artificial intelligence,” “machine learning,” “deep learning,” “neural network,” and various head and neck neoplasms.</p>\n </section>\n \n <section>\n \n <h3> Review Methods</h3>\n \n <p>Two independent reviewers analyzed each published study for adherence to the 65-point TRIPOD-AI criteria. Items were classified as “Yes,” “No,” or “NA” for each publication. The proportion of studies satisfying each TRIPOD-AI criterion was calculated. Additionally, the evidence level for each study was evaluated independently by two reviewers using the Oxford Centre for Evidence-Based Medicine (OCEBM) Levels of Evidence. Discrepancies were reconciled through discussion until consensus was reached.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The study highlights the need for improvements in ML algorithm reporting in head and neck oncology. This includes more comprehensive descriptions of datasets, standardization of model performance reporting, and increased sharing of ML models, data, and code with the research community. Adoption of TRIPOD-AI is necessary for achieving standardized ML research reporting in head and neck oncology.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Current reporting of ML algorithms hinders clinical application, reproducibility, and understanding of the data used for model training. To overcome these limitations and improve patient and clinician trust, ML developers should provide open access to models, code, and source data, fostering iterative progress through community critique, thus enhancing model accuracy and mitigating biases.</p>\n </section>\n \n <section>\n \n <h3> Level of Evidence</h3>\n \n <p>NA <i>Laryngoscope</i>, 135:687–694, 2025</p>\n </section>\n </div>","PeriodicalId":49921,"journal":{"name":"Laryngoscope","volume":"135 2","pages":"687-694"},"PeriodicalIF":2.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Laryngoscope","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/lary.31756","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This study aimed to assess reporting quality of machine learning (ML) algorithms in the head and neck oncology literature using the TRIPOD-AI criteria.
Data Sources
A comprehensive search was conducted using PubMed, Scopus, Embase, and Cochrane Database of Systematic Reviews, incorporating search terms related to “artificial intelligence,” “machine learning,” “deep learning,” “neural network,” and various head and neck neoplasms.
Review Methods
Two independent reviewers analyzed each published study for adherence to the 65-point TRIPOD-AI criteria. Items were classified as “Yes,” “No,” or “NA” for each publication. The proportion of studies satisfying each TRIPOD-AI criterion was calculated. Additionally, the evidence level for each study was evaluated independently by two reviewers using the Oxford Centre for Evidence-Based Medicine (OCEBM) Levels of Evidence. Discrepancies were reconciled through discussion until consensus was reached.
Results
The study highlights the need for improvements in ML algorithm reporting in head and neck oncology. This includes more comprehensive descriptions of datasets, standardization of model performance reporting, and increased sharing of ML models, data, and code with the research community. Adoption of TRIPOD-AI is necessary for achieving standardized ML research reporting in head and neck oncology.
Conclusion
Current reporting of ML algorithms hinders clinical application, reproducibility, and understanding of the data used for model training. To overcome these limitations and improve patient and clinician trust, ML developers should provide open access to models, code, and source data, fostering iterative progress through community critique, thus enhancing model accuracy and mitigating biases.
期刊介绍:
The Laryngoscope has been the leading source of information on advances in the diagnosis and treatment of head and neck disorders since 1890. The Laryngoscope is the first choice among otolaryngologists for publication of their important findings and techniques. Each monthly issue of The Laryngoscope features peer-reviewed medical, clinical, and research contributions in general otolaryngology, allergy/rhinology, otology/neurotology, laryngology/bronchoesophagology, head and neck surgery, sleep medicine, pediatric otolaryngology, facial plastics and reconstructive surgery, oncology, and communicative disorders. Contributions include papers and posters presented at the Annual and Section Meetings of the Triological Society, as well as independent papers, "How I Do It", "Triological Best Practice" articles, and contemporary reviews. Theses authored by the Triological Society’s new Fellows as well as papers presented at meetings of the American Laryngological Association are published in The Laryngoscope.
• Broncho-esophagology
• Communicative disorders
• Head and neck surgery
• Plastic and reconstructive facial surgery
• Oncology
• Speech and hearing defects