Barr N Hadar, Zvonimir Poljak, Brenda Bonnett, Jason Coe, Elizabeth A Stone, Theresa M Bernardo
{"title":"Machine learning predicts selected cat diseases using insurance data amid challenges in interpretability.","authors":"Barr N Hadar, Zvonimir Poljak, Brenda Bonnett, Jason Coe, Elizabeth A Stone, Theresa M Bernardo","doi":"10.2460/ajvr.24.09.0282","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop models for prediction of the onset of specific diseases in cats using pet insurance data and to evaluate their predictive performance.</p><p><strong>Methods: </strong>Agria Pet Insurance data from almost 550,000 cats (2011 to 2016) were analyzed and used to train predictive models for periodontal disease and skin tumors using breed, sex, and insurance claim history. Random downsampling and 1:1 matching by age, insurance duration, and time at risk balanced the dataset. Variables were then further processed, with random forest and conditional logistic regression used for analysis. Model accuracy was assessed through leave-one-out cross-validation, while variable importance plots, partial dependence plots, and coefficients were used for model interpretation.</p><p><strong>Results: </strong>Model accuracy ranged from 81.9% to 88.2% (P < .01, baseline 50%). Key predictors included prior insurance claims for \"digestive,\" \"whole body symptom,\" \"skin,\" and \"injury conditions,\" which may be nonspecific and predictive of various diseases. Maine Coon, Siamese, and Burmese cats were associated with periodontal disease-positive predictions, while domestic cats were linked with negative predictions. For skin tumors, Norwegian Forest Cats, Devon Rex and Sphynx cats, and Maine Coon cats were associated with positive predictions, whereas Birman and domestic cats were linked with negative predictions.</p><p><strong>Conclusions: </strong>This study presents a method of machine learning predictive analysis on pet insurance data, although more comprehensive medical information and approaches accounting for data characteristics may be necessary to develop clearer predictors.</p><p><strong>Clinical relevance: </strong>To prevent or detect these conditions early, veterinarians can use the breed risk results to guide clients, especially those with high-risk breeds, by offering early advice on lifestyle and monitoring.</p>","PeriodicalId":7754,"journal":{"name":"American journal of veterinary research","volume":" ","pages":"1-11"},"PeriodicalIF":1.3000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of veterinary research","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.2460/ajvr.24.09.0282","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"VETERINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To develop models for prediction of the onset of specific diseases in cats using pet insurance data and to evaluate their predictive performance.
Methods: Agria Pet Insurance data from almost 550,000 cats (2011 to 2016) were analyzed and used to train predictive models for periodontal disease and skin tumors using breed, sex, and insurance claim history. Random downsampling and 1:1 matching by age, insurance duration, and time at risk balanced the dataset. Variables were then further processed, with random forest and conditional logistic regression used for analysis. Model accuracy was assessed through leave-one-out cross-validation, while variable importance plots, partial dependence plots, and coefficients were used for model interpretation.
Results: Model accuracy ranged from 81.9% to 88.2% (P < .01, baseline 50%). Key predictors included prior insurance claims for "digestive," "whole body symptom," "skin," and "injury conditions," which may be nonspecific and predictive of various diseases. Maine Coon, Siamese, and Burmese cats were associated with periodontal disease-positive predictions, while domestic cats were linked with negative predictions. For skin tumors, Norwegian Forest Cats, Devon Rex and Sphynx cats, and Maine Coon cats were associated with positive predictions, whereas Birman and domestic cats were linked with negative predictions.
Conclusions: This study presents a method of machine learning predictive analysis on pet insurance data, although more comprehensive medical information and approaches accounting for data characteristics may be necessary to develop clearer predictors.
Clinical relevance: To prevent or detect these conditions early, veterinarians can use the breed risk results to guide clients, especially those with high-risk breeds, by offering early advice on lifestyle and monitoring.
期刊介绍:
The American Journal of Veterinary Research supports the collaborative exchange of information between researchers and clinicians by publishing novel research findings that bridge the gulf between basic research and clinical practice or that help to translate laboratory research and preclinical studies to the development of clinical trials and clinical practice. The journal welcomes submission of high-quality original studies and review articles in a wide range of scientific fields, including anatomy, anesthesiology, animal welfare, behavior, epidemiology, genetics, heredity, infectious disease, molecular biology, oncology, pharmacology, pathogenic mechanisms, physiology, surgery, theriogenology, toxicology, and vaccinology. Species of interest include production animals, companion animals, equids, exotic animals, birds, reptiles, and wild and marine animals. Reports of laboratory animal studies and studies involving the use of animals as experimental models of human diseases are considered only when the study results are of demonstrable benefit to the species used in the research or to another species of veterinary interest. Other fields of interest or animals species are not necessarily excluded from consideration, but such reports must focus on novel research findings. Submitted papers must make an original and substantial contribution to the veterinary medicine knowledge base; preliminary studies are not appropriate.