Jia-Nan Ren, Qiang Chen, Hong-Yu-Xiang Ye, Cheng Cao, Ya-Min Guo, Jin-Rong Yang, Hao Wang, Muhammad Zafar Irshad Khan, Jian-Zhong Chen
{"title":"FGTN: Fragment-based graph transformer network for predicting reproductive toxicity","authors":"Jia-Nan Ren, Qiang Chen, Hong-Yu-Xiang Ye, Cheng Cao, Ya-Min Guo, Jin-Rong Yang, Hao Wang, Muhammad Zafar Irshad Khan, Jian-Zhong Chen","doi":"10.1007/s00204-024-03866-4","DOIUrl":null,"url":null,"abstract":"<div><p>Reproductive toxicity is one of the important issues in chemical safety. Traditional laboratory testing methods are costly and time-consuming with raised ethical issues. Only a few in silico models have been reported to predict human reproductive toxicity, but none of them make full use of the topological information of compounds. In addition, most existing atom-based graph neural network methods focus on attributing model predictions to individual nodes or edges rather than chemically meaningful fragments or substructures. In current studies, we develop a novel fragment-based graph transformer network (FGTN) approach to generate the QSAR model of human reproductive toxicity by considering internal topological structure information of compounds. In the FGTN model, the compound is represented by a graph architecture using fragments to be nodes and bonds linking two fragments to be edges. A super molecule-level node is further proposed to connect all fragment nodes by undirected edges, obtaining global molecular features from fragment embeddings. The FGTN model achieved an accuracy (ACC) of 0.861 and an area under the receiver operating characteristic curve (AUC) value of 0.914 on nonredundant blind tests, outperforming traditional fingerprint-based machine learning models and atom-based GCN model. The FGTN model can attribute toxic predictions to fragments, generating specific structural alerts for the positive compound. Moreover, FGTN may also have the capability to distinguish various chemical isomers. We believe that FGTN can be used as a reliable and effective tool for human reproductive toxicity prediction in contribution to the advancement of chemical safety assessment.</p></div>","PeriodicalId":8329,"journal":{"name":"Archives of Toxicology","volume":"98 12","pages":"4077 - 4092"},"PeriodicalIF":4.8000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Toxicology","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s00204-024-03866-4","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TOXICOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Reproductive toxicity is one of the important issues in chemical safety. Traditional laboratory testing methods are costly and time-consuming with raised ethical issues. Only a few in silico models have been reported to predict human reproductive toxicity, but none of them make full use of the topological information of compounds. In addition, most existing atom-based graph neural network methods focus on attributing model predictions to individual nodes or edges rather than chemically meaningful fragments or substructures. In current studies, we develop a novel fragment-based graph transformer network (FGTN) approach to generate the QSAR model of human reproductive toxicity by considering internal topological structure information of compounds. In the FGTN model, the compound is represented by a graph architecture using fragments to be nodes and bonds linking two fragments to be edges. A super molecule-level node is further proposed to connect all fragment nodes by undirected edges, obtaining global molecular features from fragment embeddings. The FGTN model achieved an accuracy (ACC) of 0.861 and an area under the receiver operating characteristic curve (AUC) value of 0.914 on nonredundant blind tests, outperforming traditional fingerprint-based machine learning models and atom-based GCN model. The FGTN model can attribute toxic predictions to fragments, generating specific structural alerts for the positive compound. Moreover, FGTN may also have the capability to distinguish various chemical isomers. We believe that FGTN can be used as a reliable and effective tool for human reproductive toxicity prediction in contribution to the advancement of chemical safety assessment.
期刊介绍:
Archives of Toxicology provides up-to-date information on the latest advances in toxicology. The journal places particular emphasis on studies relating to defined effects of chemicals and mechanisms of toxicity, including toxic activities at the molecular level, in humans and experimental animals. Coverage includes new insights into analysis and toxicokinetics and into forensic toxicology. Review articles of general interest to toxicologists are an additional important feature of the journal.