Panuwat Trairatphisan, Lena Dorsheimer, Peter Monecke, Jan Wenzel, Rubin James, Andreas Czich, Yasmin Dietz-Baum, Friedemann Schmidt
{"title":"Machine learning enhances genotoxicity assessment using MultiFlow® DNA damage assay.","authors":"Panuwat Trairatphisan, Lena Dorsheimer, Peter Monecke, Jan Wenzel, Rubin James, Andreas Czich, Yasmin Dietz-Baum, Friedemann Schmidt","doi":"10.1002/em.22648","DOIUrl":null,"url":null,"abstract":"<p><p>Genotoxicity is a critical determinant for assessing the safety of pharmaceutical drugs, their metabolites, and impurities. Among genotoxicity tests, mechanistic assays such as the MultiFlow® DNA damage assay (MFA) allows the investigations on mode of action (MoA) of DNA damage through four mechanistic markers recorded at two time points. Previous studies have shown that machine learning (ML) can enhance precision on classifying the MoA of genotoxicants. Nevertheless, these approaches need to be tailored to specific chemical spaces and lab conditions for accurate risk assessment. In this study, we applied various state-of-the-art ML algorithms available in an open-source R package (caret) to build MFA-ML models using data from Bryce et al. (2016). The best model achieved 95% accuracy on the training dataset and correctly predicted genotoxicity in 16 out of 17 cases in the test dataset. Incorporating molecular descriptors properties from established in silico models demonstrated further improved performance of the approach to cover challenging examples of pharmaceuticals exhibiting a pharmacological mode of action that could interfere with the biomarker response. Further model validation on an external test set with 49 non-overlapped compounds showed a high model accuracy at 92%. Additionally, a tailored graphical user interface was developed using a freely available R package (shiny) to support visual analysis of MFA data including MoA predictions, facilitating broad usage by laboratory scientists. Lastly, a perspective on the integration of MoA predictions as additional evidence into a genotoxicity assessment workflow is proposed.</p>","PeriodicalId":11791,"journal":{"name":"Environmental and Molecular Mutagenesis","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental and Molecular Mutagenesis","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1002/em.22648","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Genotoxicity is a critical determinant for assessing the safety of pharmaceutical drugs, their metabolites, and impurities. Among genotoxicity tests, mechanistic assays such as the MultiFlow® DNA damage assay (MFA) allows the investigations on mode of action (MoA) of DNA damage through four mechanistic markers recorded at two time points. Previous studies have shown that machine learning (ML) can enhance precision on classifying the MoA of genotoxicants. Nevertheless, these approaches need to be tailored to specific chemical spaces and lab conditions for accurate risk assessment. In this study, we applied various state-of-the-art ML algorithms available in an open-source R package (caret) to build MFA-ML models using data from Bryce et al. (2016). The best model achieved 95% accuracy on the training dataset and correctly predicted genotoxicity in 16 out of 17 cases in the test dataset. Incorporating molecular descriptors properties from established in silico models demonstrated further improved performance of the approach to cover challenging examples of pharmaceuticals exhibiting a pharmacological mode of action that could interfere with the biomarker response. Further model validation on an external test set with 49 non-overlapped compounds showed a high model accuracy at 92%. Additionally, a tailored graphical user interface was developed using a freely available R package (shiny) to support visual analysis of MFA data including MoA predictions, facilitating broad usage by laboratory scientists. Lastly, a perspective on the integration of MoA predictions as additional evidence into a genotoxicity assessment workflow is proposed.
期刊介绍:
Environmental and Molecular Mutagenesis publishes original research manuscripts, reviews and commentaries on topics related to six general areas, with an emphasis on subject matter most suited for the readership of EMM as outlined below. The journal is intended for investigators in fields such as molecular biology, biochemistry, microbiology, genetics and epigenetics, genomics and epigenomics, cancer research, neurobiology, heritable mutation, radiation biology, toxicology, and molecular & environmental epidemiology.