{"title":"Age-at-death estimation based on algorithms for multi-class discriminant analysis using binary and ordinal predictors","authors":"Efthymia Nikita , Panos Nikitas","doi":"10.1016/j.jflm.2025.102848","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>The objective of this study is to develop algorithms for multi-class discriminant analysis using binary and/or ordinal predictors that can be used effectively to age-at-death estimation when expressed in terms of binary/ordinal age markers.</div></div><div><h3>Method</h3><div>The algorithms examined are based on the crude assumption that the predictors are uncorrelated or on the latent model which assumes that the discrete predictors code an underlying multivariate normal distribution. The tetrachoric/polychoric correlations of this distribution are either estimated from the training dataset and used without or with correction to fall within feasible correlation bounds or are extracted from algorithms used to generate correlated binary/ordinal data.</div></div><div><h3>Results</h3><div>It was found that, irrespective of the origin of the dataset analyzed, the crude algorithms may give poor results only when applied to ordinal datasets with very strong intercorrelated predictors. In what concerns the classification performance of the algorithms based on the latent model, we did not detect any statistically significant differences; they all perform similarly. The application of the algorithms to age-at-death estimation showed that the total classification accuracy is overall satisfactory even in datasets with small sample sizes, but the cross-validated accuracy is low when sample sizes are small.</div></div><div><h3>Conclusion</h3><div>In age-at-death estimations we can use any algorithm from those we studied except for the crude algorithms. However, to increase the classification accuracy, we should increase the size of the classes. Under this prerequisite, we may achieve very high cross-validated classification accuracies, in most cases higher than 90 %.</div></div>","PeriodicalId":16098,"journal":{"name":"Journal of forensic and legal medicine","volume":"111 ","pages":"Article 102848"},"PeriodicalIF":1.2000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of forensic and legal medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1752928X25000496","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
The objective of this study is to develop algorithms for multi-class discriminant analysis using binary and/or ordinal predictors that can be used effectively to age-at-death estimation when expressed in terms of binary/ordinal age markers.
Method
The algorithms examined are based on the crude assumption that the predictors are uncorrelated or on the latent model which assumes that the discrete predictors code an underlying multivariate normal distribution. The tetrachoric/polychoric correlations of this distribution are either estimated from the training dataset and used without or with correction to fall within feasible correlation bounds or are extracted from algorithms used to generate correlated binary/ordinal data.
Results
It was found that, irrespective of the origin of the dataset analyzed, the crude algorithms may give poor results only when applied to ordinal datasets with very strong intercorrelated predictors. In what concerns the classification performance of the algorithms based on the latent model, we did not detect any statistically significant differences; they all perform similarly. The application of the algorithms to age-at-death estimation showed that the total classification accuracy is overall satisfactory even in datasets with small sample sizes, but the cross-validated accuracy is low when sample sizes are small.
Conclusion
In age-at-death estimations we can use any algorithm from those we studied except for the crude algorithms. However, to increase the classification accuracy, we should increase the size of the classes. Under this prerequisite, we may achieve very high cross-validated classification accuracies, in most cases higher than 90 %.
期刊介绍:
The Journal of Forensic and Legal Medicine publishes topical articles on aspects of forensic and legal medicine. Specifically the Journal supports research that explores the medical principles of care and forensic assessment of individuals, whether adult or child, in contact with the judicial system. It is a fully peer-review hybrid journal with a broad international perspective.
The Journal accepts submissions of original research, review articles, and pertinent case studies, editorials, and commentaries in relevant areas of Forensic and Legal Medicine, Context of Practice, and Education and Training.
The Journal adheres to strict publication ethical guidelines, and actively supports a culture of inclusive and representative publication.