AI4DR: Development and implementation of an annotation system for high-throughput dose-response experiments

Marc Bianciotto , Lionel Colliandre , Kun Mi , Isabelle Schreiber , Cécile Delorme , Stéphanie Vougier , Hervé Minoux
{"title":"AI4DR: Development and implementation of an annotation system for high-throughput dose-response experiments","authors":"Marc Bianciotto ,&nbsp;Lionel Colliandre ,&nbsp;Kun Mi ,&nbsp;Isabelle Schreiber ,&nbsp;Cécile Delorme ,&nbsp;Stéphanie Vougier ,&nbsp;Hervé Minoux","doi":"10.1016/j.ailsci.2023.100063","DOIUrl":null,"url":null,"abstract":"<div><p>One of the common strategies to identify novel chemical matter in drug discovery consists in performing a High Throughput Screening (HTS). However, the large amount of data generated at the dose-response (DR) step of an HTS campaign requires a careful analysis to detect artifacts and correct erroneous datapoints before validating the experiments. This step which requires to review each DR experiment can be time consuming and prone to human errors or inconsistencies. AI4DR is a system that has been developed for the classification of DR curves based on a Convolutional Neural Network (CNN) acting on normalized images of the DR curves. AI4DR allows the annotation in minutes of thousands of curves among 14 categories to help the High Throughput Screening biologists in their analyses. Several categories are associated with active and inactive compounds, other categories correspond to features of interest such as the presence of noise, a weaker effect at high doses, or a suspiciously weak or strong slope at the inflexion point of the DR curves of actives. The classifier has been trained on an algorithmically generated dataset curated and refined by experts, tested using real screening campaigns and improved using thousands of annotations by experts. The solution is deployed using a MLFlow model server interfaced with the Genedata Screener data analysis software used by the end users. AI4DR improves the consistency, the robustness, and the speed of HTS data analysis as well as reducing the human effort to identify faster new medicines for patients.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100063"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence in the life sciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667318523000077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

One of the common strategies to identify novel chemical matter in drug discovery consists in performing a High Throughput Screening (HTS). However, the large amount of data generated at the dose-response (DR) step of an HTS campaign requires a careful analysis to detect artifacts and correct erroneous datapoints before validating the experiments. This step which requires to review each DR experiment can be time consuming and prone to human errors or inconsistencies. AI4DR is a system that has been developed for the classification of DR curves based on a Convolutional Neural Network (CNN) acting on normalized images of the DR curves. AI4DR allows the annotation in minutes of thousands of curves among 14 categories to help the High Throughput Screening biologists in their analyses. Several categories are associated with active and inactive compounds, other categories correspond to features of interest such as the presence of noise, a weaker effect at high doses, or a suspiciously weak or strong slope at the inflexion point of the DR curves of actives. The classifier has been trained on an algorithmically generated dataset curated and refined by experts, tested using real screening campaigns and improved using thousands of annotations by experts. The solution is deployed using a MLFlow model server interfaced with the Genedata Screener data analysis software used by the end users. AI4DR improves the consistency, the robustness, and the speed of HTS data analysis as well as reducing the human effort to identify faster new medicines for patients.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AI4DR:高通量剂量反应实验注释系统的开发与实现
在药物发现中识别新化学物质的常见策略之一是进行高通量筛选(HTS)。然而,在HTS活动的剂量-反应(DR)步骤中产生的大量数据需要在验证实验之前进行仔细分析,以检测伪影并纠正错误的数据点。这一步骤需要审查每个DR实验,可能非常耗时,而且容易出现人为错误或不一致。AI4DR是一种基于卷积神经网络(CNN)的DR曲线分类系统,它作用于DR曲线的归一化图像。AI4DR允许在几分钟内注释14个类别中的数千条曲线,以帮助高通量筛选生物学家进行分析。一些类别与活性和非活性化合物有关,其他类别对应于感兴趣的特征,例如噪声的存在,高剂量时较弱的效应,或活性物质DR曲线拐点处可疑的弱或强斜率。分类器已经在由专家策划和改进的算法生成的数据集上进行了训练,使用真实的筛选活动进行了测试,并使用专家的数千个注释进行了改进。该解决方案使用MLFlow模型服务器与最终用户使用的Genedata Screener数据分析软件进行部署。AI4DR提高了HTS数据分析的一致性、稳健性和速度,并减少了为更快地为患者识别新药而付出的人力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Artificial intelligence in the life sciences
Artificial intelligence in the life sciences Pharmacology, Biochemistry, Genetics and Molecular Biology (General), Computer Science Applications, Health Informatics, Drug Discovery, Veterinary Science and Veterinary Medicine (General)
CiteScore
5.00
自引率
0.00%
发文量
0
审稿时长
15 days
期刊最新文献
Actively protective combinatorial analysis: A scalable novel method for detecting variants that contribute to reduced disease prevalence in high-risk individuals Explainable artificial intelligence for targeted protein degradation predictions The path to adoption of open source AI for drug discovery in Africa Editorial Board Rethinking the 'best method' paradigm: The effectiveness of hybrid and multidisciplinary approaches in chemoinformatics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1