Machine learning vs. rule-based methods for document classification of electronic health records within mental health care—A systematic literature review
Emil Rijcken , Kalliopi Zervanou , Pablo Mosteiro , Floortje Scheepers , Marco Spruit , Uzay Kaymak
{"title":"Machine learning vs. rule-based methods for document classification of electronic health records within mental health care—A systematic literature review","authors":"Emil Rijcken , Kalliopi Zervanou , Pablo Mosteiro , Floortje Scheepers , Marco Spruit , Uzay Kaymak","doi":"10.1016/j.nlp.2025.100129","DOIUrl":null,"url":null,"abstract":"<div><div>Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.</div></div>","PeriodicalId":100944,"journal":{"name":"Natural Language Processing Journal","volume":"10 ","pages":"Article 100129"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949719125000056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.