Ye Lin, Xin Yang, Mingxuan Zhang, Jinyan Cheng, Hai Lin, Qi Zhao
{"title":"CLSSATP: Contrastive learning and self-supervised learning model for aquatic toxicity prediction","authors":"Ye Lin, Xin Yang, Mingxuan Zhang, Jinyan Cheng, Hai Lin, Qi Zhao","doi":"10.1016/j.aquatox.2025.107244","DOIUrl":null,"url":null,"abstract":"As compound concentrations in aquatic environments increase, the habitat degradation of aquatic organisms underscores the growing importance of studying the impact of chemicals on diverse aquatic populations. Understanding the potential impacts of different chemical substances on different species is a necessary requirement for protecting the environment and ensuring sustainable human development. In this regard, deep learning methods offer significant advantages over traditional experimental approaches in terms of cost, accuracy, and generalization ability. This research introduces CLSSATP, an efficient contrastive self-supervised learning deep neural network prediction model for organic toxicity. The model integrates two modules, a self-supervised learning module using molecular fingerprints for representation, and a contrastive learning module utilizing molecular graphs. Through dual-perspective learning, the model gains clear insights into the structural and property relationships of molecules. The experiment results indicate that our model outperforms comparative methods, demonstrating the effectiveness of our proposed architecture. Moreover, ablation experiments show that the self-supervised module and contrastive learning module respectively provide average performance improvements of 9.43 % and 10.98 % to CLSSATP. Furthermore, by visualizing the representations of our model, we observe that it correctly identifies the substructures that determine the molecular properties, granting itself with interpretability. In conclusion, CLSSATP offers a novel and effective perspective for future research in aquatic toxicity assessment. All of codes and datasets are freely available online at <ce:inter-ref xlink:href=\"https://github.co\" xlink:type=\"simple\">https://github.co</ce:inter-ref><ce:inter-ref xlink:href=\"http://m/z\" xlink:type=\"simple\"><ce:italic>m/z</ce:italic></ce:inter-ref><ce:inter-ref xlink:href=\"http://haoqi106/CLSSATP\" xlink:type=\"simple\">haoqi106/CLSSATP</ce:inter-ref>.","PeriodicalId":248,"journal":{"name":"Aquatic Toxicology","volume":"76 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aquatic Toxicology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.aquatox.2025.107244","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MARINE & FRESHWATER BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
As compound concentrations in aquatic environments increase, the habitat degradation of aquatic organisms underscores the growing importance of studying the impact of chemicals on diverse aquatic populations. Understanding the potential impacts of different chemical substances on different species is a necessary requirement for protecting the environment and ensuring sustainable human development. In this regard, deep learning methods offer significant advantages over traditional experimental approaches in terms of cost, accuracy, and generalization ability. This research introduces CLSSATP, an efficient contrastive self-supervised learning deep neural network prediction model for organic toxicity. The model integrates two modules, a self-supervised learning module using molecular fingerprints for representation, and a contrastive learning module utilizing molecular graphs. Through dual-perspective learning, the model gains clear insights into the structural and property relationships of molecules. The experiment results indicate that our model outperforms comparative methods, demonstrating the effectiveness of our proposed architecture. Moreover, ablation experiments show that the self-supervised module and contrastive learning module respectively provide average performance improvements of 9.43 % and 10.98 % to CLSSATP. Furthermore, by visualizing the representations of our model, we observe that it correctly identifies the substructures that determine the molecular properties, granting itself with interpretability. In conclusion, CLSSATP offers a novel and effective perspective for future research in aquatic toxicity assessment. All of codes and datasets are freely available online at https://github.com/zhaoqi106/CLSSATP.
期刊介绍:
Aquatic Toxicology publishes significant contributions that increase the understanding of the impact of harmful substances (including natural and synthetic chemicals) on aquatic organisms and ecosystems.
Aquatic Toxicology considers both laboratory and field studies with a focus on marine/ freshwater environments. We strive to attract high quality original scientific papers, critical reviews and expert opinion papers in the following areas: Effects of harmful substances on molecular, cellular, sub-organismal, organismal, population, community, and ecosystem level; Toxic Mechanisms; Genetic disturbances, transgenerational effects, behavioral and adaptive responses; Impacts of harmful substances on structure, function of and services provided by aquatic ecosystems; Mixture toxicity assessment; Statistical approaches to predict exposure to and hazards of contaminants
The journal also considers manuscripts in other areas, such as the development of innovative concepts, approaches, and methodologies, which promote the wider application of toxicological datasets to the protection of aquatic environments and inform ecological risk assessments and decision making by relevant authorities.