AI4DR: Development and implementation of an annotation system for high-throughput dose-response experiments

IF 5.4 Artificial intelligence in the life sciences Pub Date : 2023-02-06 DOI:10.1016/j.ailsci.2023.100063

Marc Bianciotto , Lionel Colliandre , Kun Mi , Isabelle Schreiber , Cécile Delorme , Stéphanie Vougier , Hervé Minoux

{"title":"AI4DR: Development and implementation of an annotation system for high-throughput dose-response experiments","authors":"Marc Bianciotto , Lionel Colliandre , Kun Mi , Isabelle Schreiber , Cécile Delorme , Stéphanie Vougier , Hervé Minoux","doi":"10.1016/j.ailsci.2023.100063","DOIUrl":null,"url":null,"abstract":"<div><p>One of the common strategies to identify novel chemical matter in drug discovery consists in performing a High Throughput Screening (HTS). However, the large amount of data generated at the dose-response (DR) step of an HTS campaign requires a careful analysis to detect artifacts and correct erroneous datapoints before validating the experiments. This step which requires to review each DR experiment can be time consuming and prone to human errors or inconsistencies. AI4DR is a system that has been developed for the classification of DR curves based on a Convolutional Neural Network (CNN) acting on normalized images of the DR curves. AI4DR allows the annotation in minutes of thousands of curves among 14 categories to help the High Throughput Screening biologists in their analyses. Several categories are associated with active and inactive compounds, other categories correspond to features of interest such as the presence of noise, a weaker effect at high doses, or a suspiciously weak or strong slope at the inflexion point of the DR curves of actives. The classifier has been trained on an algorithmically generated dataset curated and refined by experts, tested using real screening campaigns and improved using thousands of annotations by experts. The solution is deployed using a MLFlow model server interfaced with the Genedata Screener data analysis software used by the end users. AI4DR improves the consistency, the robustness, and the speed of HTS data analysis as well as reducing the human effort to identify faster new medicines for patients.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100063"},"PeriodicalIF":5.4000,"publicationDate":"2023-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence in the life sciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667318523000077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

One of the common strategies to identify novel chemical matter in drug discovery consists in performing a High Throughput Screening (HTS). However, the large amount of data generated at the dose-response (DR) step of an HTS campaign requires a careful analysis to detect artifacts and correct erroneous datapoints before validating the experiments. This step which requires to review each DR experiment can be time consuming and prone to human errors or inconsistencies. AI4DR is a system that has been developed for the classification of DR curves based on a Convolutional Neural Network (CNN) acting on normalized images of the DR curves. AI4DR allows the annotation in minutes of thousands of curves among 14 categories to help the High Throughput Screening biologists in their analyses. Several categories are associated with active and inactive compounds, other categories correspond to features of interest such as the presence of noise, a weaker effect at high doses, or a suspiciously weak or strong slope at the inflexion point of the DR curves of actives. The classifier has been trained on an algorithmically generated dataset curated and refined by experts, tested using real screening campaigns and improved using thousands of annotations by experts. The solution is deployed using a MLFlow model server interfaced with the Genedata Screener data analysis software used by the end users. AI4DR improves the consistency, the robustness, and the speed of HTS data analysis as well as reducing the human effort to identify faster new medicines for patients.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

AI4DR：高通量剂量反应实验注释系统的开发与实现

在药物发现中识别新化学物质的常见策略之一是进行高通量筛选(HTS)。然而，在HTS活动的剂量-反应(DR)步骤中产生的大量数据需要在验证实验之前进行仔细分析，以检测伪影并纠正错误的数据点。这一步骤需要审查每个DR实验，可能非常耗时，而且容易出现人为错误或不一致。AI4DR是一种基于卷积神经网络(CNN)的DR曲线分类系统，它作用于DR曲线的归一化图像。AI4DR允许在几分钟内注释14个类别中的数千条曲线，以帮助高通量筛选生物学家进行分析。一些类别与活性和非活性化合物有关，其他类别对应于感兴趣的特征，例如噪声的存在，高剂量时较弱的效应，或活性物质DR曲线拐点处可疑的弱或强斜率。分类器已经在由专家策划和改进的算法生成的数据集上进行了训练，使用真实的筛选活动进行了测试，并使用专家的数千个注释进行了改进。该解决方案使用MLFlow模型服务器与最终用户使用的Genedata Screener数据分析软件进行部署。AI4DR提高了HTS数据分析的一致性、稳健性和速度，并减少了为更快地为患者识别新药而付出的人力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Artificial intelligence in the life sciences Pharmacology, Biochemistry, Genetics and Molecular Biology (General), Computer Science Applications, Health Informatics, Drug Discovery, Veterinary Science and Veterinary Medicine (General)

CiteScore

5.00

自引率

0.00%

发文量

审稿时长

15 days