{"title":"通过高通量建模对非目标分析确定的分子式进行优先排序:应用于从房屋灰尘中识别具有高人类积累潜力的化合物。","authors":"Zhizhen Zhang, Li Li, Hui Peng and Frank Wania","doi":"10.1039/D3EM00317E","DOIUrl":null,"url":null,"abstract":"<p >Because it is typically not possible to pursue compound identification efforts for all chemical features detected during non-target analysis (NTA), the need for prioritization arises. Here we propose a strategy that ranks chemical features detected in environmental samples based on a model-derived metric that quantifies a feature's attribute that makes it desirable to elucidate its structure, <em>e.g.</em>, a high potential for bioaccumulation in humans or wildlife. The procedure involves the identification of isomers that could plausibly represent the molecular formulae assigned to NTA-detected chemical features. For each isomer, the prioritization metric is calculated using properties predicted with high-throughput methods. After the molecular formulae are ranked based on the average values of the prioritization metric calculated for all isomers assigned to a formula, the highest ranked molecular formulae are prioritized for structure elucidation. We applied this workflow to features identified in house dust, using the ratio of chemical intake through dust ingestion to chemical concentration in blood (dose-to-concentration ratio, DCR) as the prioritization metric. Collections of isomers for the molecular formulae were assembled from the PubChem database and DCR was estimated using partitioning and biotransformation properties predicted for each isomer using quantitative structure property relationships. The ten top-ranked molecular formulae with notably lower average DCR-values represented mostly compounds already known to be indoor pollutants of concern, such as two polybrominated diphenyl ethers, bis(2-ethylhexyl) tetrabromophthalate, tetrabromobisphenol A, tris(1,3-dichloroisopropyl)phosphate and the azo dye disperse blue 373.</p>","PeriodicalId":74,"journal":{"name":"Environmental Science: Processes & Impacts","volume":" 11","pages":" 1817-1829"},"PeriodicalIF":4.3000,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prioritizing molecular formulae identified by non-target analysis through high-throughput modelling: application to identify compounds with high human accumulation potential from house dust\",\"authors\":\"Zhizhen Zhang, Li Li, Hui Peng and Frank Wania\",\"doi\":\"10.1039/D3EM00317E\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Because it is typically not possible to pursue compound identification efforts for all chemical features detected during non-target analysis (NTA), the need for prioritization arises. Here we propose a strategy that ranks chemical features detected in environmental samples based on a model-derived metric that quantifies a feature's attribute that makes it desirable to elucidate its structure, <em>e.g.</em>, a high potential for bioaccumulation in humans or wildlife. The procedure involves the identification of isomers that could plausibly represent the molecular formulae assigned to NTA-detected chemical features. For each isomer, the prioritization metric is calculated using properties predicted with high-throughput methods. After the molecular formulae are ranked based on the average values of the prioritization metric calculated for all isomers assigned to a formula, the highest ranked molecular formulae are prioritized for structure elucidation. We applied this workflow to features identified in house dust, using the ratio of chemical intake through dust ingestion to chemical concentration in blood (dose-to-concentration ratio, DCR) as the prioritization metric. Collections of isomers for the molecular formulae were assembled from the PubChem database and DCR was estimated using partitioning and biotransformation properties predicted for each isomer using quantitative structure property relationships. The ten top-ranked molecular formulae with notably lower average DCR-values represented mostly compounds already known to be indoor pollutants of concern, such as two polybrominated diphenyl ethers, bis(2-ethylhexyl) tetrabromophthalate, tetrabromobisphenol A, tris(1,3-dichloroisopropyl)phosphate and the azo dye disperse blue 373.</p>\",\"PeriodicalId\":74,\"journal\":{\"name\":\"Environmental Science: Processes & Impacts\",\"volume\":\" 11\",\"pages\":\" 1817-1829\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2023-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Science: Processes & Impacts\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2023/em/d3em00317e\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science: Processes & Impacts","FirstCategoryId":"93","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2023/em/d3em00317e","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Prioritizing molecular formulae identified by non-target analysis through high-throughput modelling: application to identify compounds with high human accumulation potential from house dust
Because it is typically not possible to pursue compound identification efforts for all chemical features detected during non-target analysis (NTA), the need for prioritization arises. Here we propose a strategy that ranks chemical features detected in environmental samples based on a model-derived metric that quantifies a feature's attribute that makes it desirable to elucidate its structure, e.g., a high potential for bioaccumulation in humans or wildlife. The procedure involves the identification of isomers that could plausibly represent the molecular formulae assigned to NTA-detected chemical features. For each isomer, the prioritization metric is calculated using properties predicted with high-throughput methods. After the molecular formulae are ranked based on the average values of the prioritization metric calculated for all isomers assigned to a formula, the highest ranked molecular formulae are prioritized for structure elucidation. We applied this workflow to features identified in house dust, using the ratio of chemical intake through dust ingestion to chemical concentration in blood (dose-to-concentration ratio, DCR) as the prioritization metric. Collections of isomers for the molecular formulae were assembled from the PubChem database and DCR was estimated using partitioning and biotransformation properties predicted for each isomer using quantitative structure property relationships. The ten top-ranked molecular formulae with notably lower average DCR-values represented mostly compounds already known to be indoor pollutants of concern, such as two polybrominated diphenyl ethers, bis(2-ethylhexyl) tetrabromophthalate, tetrabromobisphenol A, tris(1,3-dichloroisopropyl)phosphate and the azo dye disperse blue 373.
期刊介绍:
Environmental Science: Processes & Impacts publishes high quality papers in all areas of the environmental chemical sciences, including chemistry of the air, water, soil and sediment. We welcome studies on the environmental fate and effects of anthropogenic and naturally occurring contaminants, both chemical and microbiological, as well as related natural element cycling processes.