Irini Furxhi, Michal Kalapus, Anna Costa, Tomasz Puzyn
{"title":"Artificial augmented dataset for the enhancement of nano-QSARs models. A methodology based on topological projections.","authors":"Irini Furxhi, Michal Kalapus, Anna Costa, Tomasz Puzyn","doi":"10.1080/17435390.2023.2268163","DOIUrl":null,"url":null,"abstract":"<p><p>Nanoinformatics demands accurate predictive models to assess the potential hazards of nanomaterials (NMs). However, limited data availability and the diverse nature of NMs physicochemical properties and their interaction with biological media, hinder the development of robust nano-Quantitative Structure-Activity Relationship (QSAR) models. This article proposes an approach that combines artificially data generation techniques and topological projections to address the challenges of insufficient dataset sizes and their limited representativeness of the chemical space. By leveraging the rich information embedded in the topological features, this methodology enhances the representation of the chemical space, enabling a more an exploration of the structure-activity relationships. We demonstrate the efficacy of our approach through extensive experiments, employing various machine learning regression algorithms to validate the methodology. Finally, we compare two different resampling approaches based on different modeling scenarios. The results showcase a significant improved predictive performance of QSAR models demonstrating a promising strategy to overcome the limitations of small datasets in the field of nanoinformatics. The proposed approach offers noteworthy potential for advancing nanoinformatics research within the nanosafety domain by enabling the development of more accurate predictive models for assessing the potential hazards associated with NMs.</p>","PeriodicalId":18899,"journal":{"name":"Nanotoxicology","volume":" ","pages":"529-544"},"PeriodicalIF":3.6000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nanotoxicology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17435390.2023.2268163","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/1 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"NANOSCIENCE & NANOTECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Nanoinformatics demands accurate predictive models to assess the potential hazards of nanomaterials (NMs). However, limited data availability and the diverse nature of NMs physicochemical properties and their interaction with biological media, hinder the development of robust nano-Quantitative Structure-Activity Relationship (QSAR) models. This article proposes an approach that combines artificially data generation techniques and topological projections to address the challenges of insufficient dataset sizes and their limited representativeness of the chemical space. By leveraging the rich information embedded in the topological features, this methodology enhances the representation of the chemical space, enabling a more an exploration of the structure-activity relationships. We demonstrate the efficacy of our approach through extensive experiments, employing various machine learning regression algorithms to validate the methodology. Finally, we compare two different resampling approaches based on different modeling scenarios. The results showcase a significant improved predictive performance of QSAR models demonstrating a promising strategy to overcome the limitations of small datasets in the field of nanoinformatics. The proposed approach offers noteworthy potential for advancing nanoinformatics research within the nanosafety domain by enabling the development of more accurate predictive models for assessing the potential hazards associated with NMs.
期刊介绍:
Nanotoxicology invites contributions addressing research relating to the potential for human and environmental exposure, hazard and risk associated with the use and development of nano-structured materials. In this context, the term nano-structured materials has a broad definition, including ‘materials with at least one dimension in the nanometer size range’. These nanomaterials range from nanoparticles and nanomedicines, to nano-surfaces of larger materials and composite materials. The range of nanomaterials in use and under development is extremely diverse, so this journal includes a range of materials generated for purposeful delivery into the body (food, medicines, diagnostics and prosthetics), to consumer products (e.g. paints, cosmetics, electronics and clothing), and particles designed for environmental applications (e.g. remediation). It is the nano-size range if these materials which unifies them and defines the scope of Nanotoxicology .
While the term ‘toxicology’ indicates risk, the journal Nanotoxicology also aims to encompass studies that enhance safety during the production, use and disposal of nanomaterials. Well-controlled studies demonstrating a lack of exposure, hazard or risk associated with nanomaterials, or studies aiming to improve biocompatibility are welcomed and encouraged, as such studies will lead to an advancement of nanotechnology. Furthermore, many nanoparticles are developed with the intention to improve human health (e.g. antimicrobial agents), and again, such articles are encouraged. In order to promote quality, Nanotoxicology will prioritise publications that have demonstrated characterisation of the nanomaterials investigated.