Machine Learning-Assisted High-Throughput Screening for Anti-MRSA Compounds

IF 3.4 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS IEEE/ACM Transactions on Computational Biology and Bioinformatics Pub Date : 2024-07-26 DOI:10.1109/TCBB.2024.3434340

Fadi Shehadeh;LewisOscar Felix;Markos Kalligeros;Adnan Shehadeh;Beth Burgwyn Fuchs;Frederick M. Ausubel;Paul P. Sotiriadis;Eleftherios Mylonakis

{"title":"Machine Learning-Assisted High-Throughput Screening for Anti-MRSA Compounds","authors":"Fadi Shehadeh;LewisOscar Felix;Markos Kalligeros;Adnan Shehadeh;Beth Burgwyn Fuchs;Frederick M. Ausubel;Paul P. Sotiriadis;Eleftherios Mylonakis","doi":"10.1109/TCBB.2024.3434340","DOIUrl":null,"url":null,"abstract":"Background: Antimicrobial resistance is a major public health threat, and new agents are needed. Computational approaches have been proposed to reduce the cost and time needed for compound screening. Aims: A machine learning (ML) model was developed for the \n<italic>in silico</i>\n screening of low molecular weight molecules. Methods: We used the results of a high-throughput \n<italic>Caenorhabditis elegans</i>\n methicillin-resistant \n<italic>Staphylococcus aureus</i>\n (MRSA) liquid infection assay to develop ML models for compound prioritization and quality control. Results: The compound prioritization model achieved an AUC of 0.795 with a sensitivity of 81% and a specificity of 70%. When applied to a validation set of 22,768 compounds, the model identified 81% of the active compounds identified by high-throughput screening (HTS) among only 30.6% of the total 22,768 compounds, resulting in a 2.67-fold increase in hit rate. When we retrained the model on all the compounds of the HTS dataset, it further identified 45 discordant molecules classified as non-hits by the HTS, with 42/45 (93%) having known antimicrobial activity. Conclusion: Our ML approach can be used to increase HTS efficiency by reducing the number of compounds that need to be physically screened and identifying potential missed hits, making HTS more accessible and reducing barriers to entry.","PeriodicalId":13344,"journal":{"name":"IEEE/ACM Transactions on Computational Biology and Bioinformatics","volume":"21 6","pages":"1911-1921"},"PeriodicalIF":3.4000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE/ACM Transactions on Computational Biology and Bioinformatics","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10609973/","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Antimicrobial resistance is a major public health threat, and new agents are needed. Computational approaches have been proposed to reduce the cost and time needed for compound screening. Aims: A machine learning (ML) model was developed for the in silico screening of low molecular weight molecules. Methods: We used the results of a high-throughput Caenorhabditis elegans methicillin-resistant Staphylococcus aureus (MRSA) liquid infection assay to develop ML models for compound prioritization and quality control. Results: The compound prioritization model achieved an AUC of 0.795 with a sensitivity of 81% and a specificity of 70%. When applied to a validation set of 22,768 compounds, the model identified 81% of the active compounds identified by high-throughput screening (HTS) among only 30.6% of the total 22,768 compounds, resulting in a 2.67-fold increase in hit rate. When we retrained the model on all the compounds of the HTS dataset, it further identified 45 discordant molecules classified as non-hits by the HTS, with 42/45 (93%) having known antimicrobial activity. Conclusion: Our ML approach can be used to increase HTS efficiency by reducing the number of compounds that need to be physically screened and identifying potential missed hits, making HTS more accessible and reducing barriers to entry.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

机器学习辅助高通量筛选抗 MRSA 化合物。

背景：抗菌药耐药性是一个重大的公共卫生威胁，需要新的制剂。目的：我们开发了一种机器学习（ML）模型，用于对低分子量分子进行硅学筛选：我们利用高通量秀丽隐杆线虫耐甲氧西林金黄色葡萄球菌（MRSA）液体感染试验的结果，开发了用于化合物优先排序和质量控制的机器学习模型：化合物优先排序模型的 AUC 为 0.795，灵敏度为 81%，特异度为 70%。当应用于由 22,768 个化合物组成的验证集时，该模型仅从总数 22,768 个化合物中的 30.6% 中识别出了 81% 通过高通量筛选 (HTS) 确定的活性化合物，从而使命中率提高了 2.67 倍。当我们在高通量筛选数据集的所有化合物上重新训练模型时，它进一步识别出了 45 个被高通量筛选归类为非命中的不和谐分子，其中 42/45 （93%）具有已知的抗菌活性：我们的 ML 方法可用于提高 HTS 效率，减少需要物理筛选的化合物数量，并识别潜在的漏检分子，从而使 HTS 更容易获得并降低进入门槛。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE/ACM Transactions on Computational Biology and Bioinformatics 工程技术-计算机：跨学科应用

CiteScore

7.50

自引率

6.70%

发文量

479

审稿时长

3 months

期刊介绍： IEEE/ACM Transactions on Computational Biology and Bioinformatics emphasizes the algorithmic, mathematical, statistical and computational methods that are central in bioinformatics and computational biology; the development and testing of effective computer programs in bioinformatics; the development of biological databases; and important biological results that are obtained from the use of these methods, programs and databases; the emerging field of Systems Biology, where many forms of data are used to create a computer-based model of a complex biological system