欧盟海洋观测系统溶解度预测挑战赛的概要和背景。

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-03-20 DOI:10.1016/j.slasd.2024.100155
Wenyu Wang , Jing Tang , Andrea Zaliani
{"title":"欧盟海洋观测系统溶解度预测挑战赛的概要和背景。","authors":"Wenyu Wang ,&nbsp;Jing Tang ,&nbsp;Andrea Zaliani","doi":"10.1016/j.slasd.2024.100155","DOIUrl":null,"url":null,"abstract":"<div><p>In June 2022, EU-OS came to the decision to make public a solubility data set of 100+K compounds obtained from several of the EU-OS proprietary screening compound collections. Leveraging on the interest of SLAS for screening scientific development it was decided to launch a joint EUOS-SLAS competition within the chemoinformatics and machine learning (ML) communities. The competition was open to real world computation experts, for the best, most predictive, classification model of compound solubility. The aim of the competition was multiple: from a practical side, the winning model should then serve as a cornerstone for future solubility predictions having used the largest training set so far publicly available. From a higher project perspective, the intent was to focus the energies and experiences, even if professionally not precisely coming from Pharma R&amp;D; to address the issue of how to predict compound solubility. Here we report how the competition was ideated and the practical aspects of conducting it within the Kaggle framework, leveraging of the versatility and the open-source nature of this data science platform. Consideration on results and challenges encountered have been also examined.</p></div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2472555224000170/pdfft?md5=31adf40156a4682f7a09408654d5d462&pid=1-s2.0-S2472555224000170-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Outline and background for the EU-OS solubility prediction challenge\",\"authors\":\"Wenyu Wang ,&nbsp;Jing Tang ,&nbsp;Andrea Zaliani\",\"doi\":\"10.1016/j.slasd.2024.100155\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In June 2022, EU-OS came to the decision to make public a solubility data set of 100+K compounds obtained from several of the EU-OS proprietary screening compound collections. Leveraging on the interest of SLAS for screening scientific development it was decided to launch a joint EUOS-SLAS competition within the chemoinformatics and machine learning (ML) communities. The competition was open to real world computation experts, for the best, most predictive, classification model of compound solubility. The aim of the competition was multiple: from a practical side, the winning model should then serve as a cornerstone for future solubility predictions having used the largest training set so far publicly available. From a higher project perspective, the intent was to focus the energies and experiences, even if professionally not precisely coming from Pharma R&amp;D; to address the issue of how to predict compound solubility. Here we report how the competition was ideated and the practical aspects of conducting it within the Kaggle framework, leveraging of the versatility and the open-source nature of this data science platform. Consideration on results and challenges encountered have been also examined.</p></div>\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2472555224000170/pdfft?md5=31adf40156a4682f7a09408654d5d462&pid=1-s2.0-S2472555224000170-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2472555224000170\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2472555224000170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

摘要

2022 年 6 月,EU-OS 决定公开从几个 EU-OS 专有筛选化合物库中获得的 100 多 K 个化合物的溶解度数据集。利用 SLAS 对筛选科学发展的兴趣,决定在化学信息学和机器学习 (ML) 社区内发起 EUOS-SLAS 联合竞赛。比赛面向现实世界中的计算专家,旨在选出最佳、最具预测性的化合物溶解度分类模型。比赛的目的是多方面的:从实用角度看,获胜模型应作为未来溶解度预测的基石,并使用迄今为止可公开获得的最大训练集。从更高的项目角度来看,目的是集中精力和经验(即使在专业上并非完全来自制药研发),解决如何预测化合物溶解度的问题。在此,我们将报告如何在 Kaggle 框架内构思比赛,以及利用这一数据科学平台的多功能性和开源性开展比赛的实际情况。此外,我们还探讨了比赛结果和遇到的挑战。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Outline and background for the EU-OS solubility prediction challenge

In June 2022, EU-OS came to the decision to make public a solubility data set of 100+K compounds obtained from several of the EU-OS proprietary screening compound collections. Leveraging on the interest of SLAS for screening scientific development it was decided to launch a joint EUOS-SLAS competition within the chemoinformatics and machine learning (ML) communities. The competition was open to real world computation experts, for the best, most predictive, classification model of compound solubility. The aim of the competition was multiple: from a practical side, the winning model should then serve as a cornerstone for future solubility predictions having used the largest training set so far publicly available. From a higher project perspective, the intent was to focus the energies and experiences, even if professionally not precisely coming from Pharma R&D; to address the issue of how to predict compound solubility. Here we report how the competition was ideated and the practical aspects of conducting it within the Kaggle framework, leveraging of the versatility and the open-source nature of this data science platform. Consideration on results and challenges encountered have been also examined.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
期刊最新文献
A Systematic Review of Sleep Disturbance in Idiopathic Intracranial Hypertension. Advancing Patient Education in Idiopathic Intracranial Hypertension: The Promise of Large Language Models. Anti-Myelin-Associated Glycoprotein Neuropathy: Recent Developments. Approach to Managing the Initial Presentation of Multiple Sclerosis: A Worldwide Practice Survey. Association Between LACE+ Index Risk Category and 90-Day Mortality After Stroke.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1