Konrad Kwolek, Artur Gądek, Kamil Kwolek, Radek Kolecki, Henryk Liszka
{"title":"Automated decision support for Hallux Valgus treatment options using anteroposterior foot radiographs.","authors":"Konrad Kwolek, Artur Gądek, Kamil Kwolek, Radek Kolecki, Henryk Liszka","doi":"10.5312/wjo.v14.i11.800","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Assessment of the potential utility of deep learning with subsequent image analysis to automate the measurement of hallux valgus and intermetatarsal angles from radiographs to serve as a preoperative aid in establishing hallux valgus severity for clinical decision-making.</p><p><strong>Aim: </strong>To investigate the accuracy of automated measurements of angles of hallux valgus from radiographs for further integration with the preoperative planning process.</p><p><strong>Methods: </strong>The data comprises 265 consecutive digital anteroposterior weightbearing foot radiographs. 181 radiographs were utilized for training (161) and validating (20) a U-Net neural network to achieve a mean Sørensen-Dice index > 97% on bone segmentation. 84 test radiographs were used for manual (computer assisted) and automated measurements of hallux valgus severity determined by hallux valgus (HVA) and intermetatarsal angles (IMA). The reliability of manual and computer-based measurements was calculated using the interclass correlation coefficient (ICC) and standard error of measurement (SEM). Inter- and intraobserver reliability coefficients were also compared. An operative treatment recommendation was then applied to compare results between automated and manual angle measurements.</p><p><strong>Results: </strong>Very high reliability was achieved for HVA and IMA between the manual measurements of three independent clinicians. For HVA, the ICC between manual measurements was 0.96-0.99. For IMA, ICC was 0.78-0.95. Comparing manual against automated computer measurement, the reliability was high as well. For HVA, absolute agreement ICC and consistency ICC were 0.97, and SEM was 0.32. For IMA, absolute agreement ICC was 0.75, consistency ICC was 0.89, and SEM was 0.21. Additionally, a strong correlation (0.80) was observed between our approach and traditional clinical adjudication for preoperative planning of hallux valgus, according to an operative treatment algorithm proposed by EFORT.</p><p><strong>Conclusion: </strong>The proposed automated, artificial intelligence assisted determination of hallux valgus angles based on deep learning holds great potential as an accurate and efficient tool, with comparable accuracy to manual measurements by expert clinicians. Our approach can be effectively implemented in clinical practice to determine the angles of hallux valgus from radiographs, classify the deformity severity, streamline preoperative decision-making prior to corrective surgery.</p>","PeriodicalId":47843,"journal":{"name":"World Journal of Orthopedics","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2023-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10698342/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Orthopedics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5312/wjo.v14.i11.800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Assessment of the potential utility of deep learning with subsequent image analysis to automate the measurement of hallux valgus and intermetatarsal angles from radiographs to serve as a preoperative aid in establishing hallux valgus severity for clinical decision-making.
Aim: To investigate the accuracy of automated measurements of angles of hallux valgus from radiographs for further integration with the preoperative planning process.
Methods: The data comprises 265 consecutive digital anteroposterior weightbearing foot radiographs. 181 radiographs were utilized for training (161) and validating (20) a U-Net neural network to achieve a mean Sørensen-Dice index > 97% on bone segmentation. 84 test radiographs were used for manual (computer assisted) and automated measurements of hallux valgus severity determined by hallux valgus (HVA) and intermetatarsal angles (IMA). The reliability of manual and computer-based measurements was calculated using the interclass correlation coefficient (ICC) and standard error of measurement (SEM). Inter- and intraobserver reliability coefficients were also compared. An operative treatment recommendation was then applied to compare results between automated and manual angle measurements.
Results: Very high reliability was achieved for HVA and IMA between the manual measurements of three independent clinicians. For HVA, the ICC between manual measurements was 0.96-0.99. For IMA, ICC was 0.78-0.95. Comparing manual against automated computer measurement, the reliability was high as well. For HVA, absolute agreement ICC and consistency ICC were 0.97, and SEM was 0.32. For IMA, absolute agreement ICC was 0.75, consistency ICC was 0.89, and SEM was 0.21. Additionally, a strong correlation (0.80) was observed between our approach and traditional clinical adjudication for preoperative planning of hallux valgus, according to an operative treatment algorithm proposed by EFORT.
Conclusion: The proposed automated, artificial intelligence assisted determination of hallux valgus angles based on deep learning holds great potential as an accurate and efficient tool, with comparable accuracy to manual measurements by expert clinicians. Our approach can be effectively implemented in clinical practice to determine the angles of hallux valgus from radiographs, classify the deformity severity, streamline preoperative decision-making prior to corrective surgery.