首页 > 最新文献

Artificial intelligence in the life sciences最新文献

英文 中文
Analysis of Swin-UNet vision transformer for Inferior Vena Cava filter segmentation from CT scans Swin-UNet视觉变换器用于下腔静脉CT滤波分割的分析
Pub Date : 2023-08-18 DOI: 10.1016/j.ailsci.2023.100084
Rahul Gomes , Tyler Pham , Nichol He , Connor Kamrowski , Joseph Wildenberg

Purpose

The purpose of this study is to develop an accurate deep learning model capable of Inferior Vena Cava (IVC) filter segmentation from CT scans. The study does a comparative assessment of the impact of Residual Networks (ResNets) complemented with reduced convolutional layer depth and also analyzes the impact of using vision transformer architectures without performance degradation.

Materials and Methods

This experimental retrospective study on 84 CT scans consisting of 54618 slices involves design, implementation, and evaluation of segmentation algorithm which can be used to generate a clinical report for the presence of IVC filters on abdominal CT scans performed for any reason. Several variants of patch-based 3D-Convolutional Neural Network (CNN) and the Swin UNet Transformer (Swin-UNETR) are used to retrieve the signature of IVC filters. The Dice Score is used as a metric to compare the performance of the segmentation models.

Results

Model trained on UNet variant using four ResNet layers showed a higher segmentation performance achieving median Dice = 0.92 [Interquartile range(IQR): 0.85, 0.93] compared to the plain UNet model with four layers having median Dice = 0.89 [IQR: 0.83, 0.92]. Segmentation results from ResNet with two layers achieved a median Dice = 0.93 [IQR: 0.87, 0.94] which was higher than the plain UNet model with two layers at median Dice = 0.87 [IQR: 0.77, 0.90]. Models trained using SWIN-based transformers performed significantly better in both training and validation datasets compared to the four CNN variants. The validation median Dice was highest in 4 layer Swin UNETR at 0.88 followed by 2 layer Swin UNETR at 0.85.

Conclusion

Utilization of vision based transformer Swin-UNETR results in segmentation output with both low bias and variance thereby solving a real-world problem within healthcare for advanced Artificial Intelligence (AI) image processing and recognition. The Swin UNETR will reduce the time spent manually tracking IVC filters by centralizing within the electronic health record. Link to GitHub repository.

目的建立一种精确的深度学习模型,用于下腔静脉(IVC) CT图像的滤波分割。该研究对残差网络(ResNets)与减少卷积层深度相结合的影响进行了比较评估,并分析了在不降低性能的情况下使用视觉转换器架构的影响。材料和方法本实验回顾性研究了84个CT扫描,包括54618个切片,涉及分割算法的设计、实现和评估,该算法可用于生成临床报告,用于任何原因进行的腹部CT扫描中存在IVC过滤器。基于补丁的三维卷积神经网络(CNN)和Swin UNet变压器(swan - unetr)的几种变体被用于检索IVC滤波器的特征。Dice Score被用作比较分割模型性能的指标。结果使用4个ResNet层训练的UNet变体模型与使用4个ResNet层训练的UNet模型相比,具有更高的分割性能,达到中位数Dice = 0.92[四分位间距(IQR): 0.85, 0.93],而普通UNet模型的中位数Dice = 0.89 [IQR: 0.83, 0.92]。ResNet两层分割结果的中位数Dice = 0.93 [IQR: 0.87, 0.94],高于普通UNet两层模型的中位数Dice = 0.87 [IQR: 0.77, 0.90]。与四种CNN变体相比,使用基于swn的变压器训练的模型在训练和验证数据集中的表现都要好得多。4层Swin UNETR的验证中位数骰子最高,为0.88,其次是2层Swin UNETR,为0.85。结论使用基于视觉的swun - unetr变压器可以获得低偏差和方差的分割输出,从而解决了先进人工智能(AI)图像处理和识别在医疗保健中的现实问题。Swin UNETR将通过集中在电子健康记录内减少人工跟踪IVC过滤器所花费的时间。链接到GitHub仓库。
{"title":"Analysis of Swin-UNet vision transformer for Inferior Vena Cava filter segmentation from CT scans","authors":"Rahul Gomes ,&nbsp;Tyler Pham ,&nbsp;Nichol He ,&nbsp;Connor Kamrowski ,&nbsp;Joseph Wildenberg","doi":"10.1016/j.ailsci.2023.100084","DOIUrl":"10.1016/j.ailsci.2023.100084","url":null,"abstract":"<div><h3>Purpose</h3><p>The purpose of this study is to develop an accurate deep learning model capable of Inferior Vena Cava (IVC) filter segmentation from CT scans. The study does a comparative assessment of the impact of Residual Networks (ResNets) complemented with reduced convolutional layer depth and also analyzes the impact of using vision transformer architectures without performance degradation.</p></div><div><h3>Materials and Methods</h3><p>This experimental retrospective study on 84 CT scans consisting of 54618 slices involves design, implementation, and evaluation of segmentation algorithm which can be used to generate a clinical report for the presence of IVC filters on abdominal CT scans performed for any reason. Several variants of patch-based 3D-Convolutional Neural Network (CNN) and the Swin UNet Transformer (Swin-UNETR) are used to retrieve the signature of IVC filters. The Dice Score is used as a metric to compare the performance of the segmentation models.</p></div><div><h3>Results</h3><p>Model trained on UNet variant using four ResNet layers showed a higher segmentation performance achieving median Dice = 0.92 [Interquartile range(IQR): 0.85, 0.93] compared to the plain UNet model with four layers having median Dice = 0.89 [IQR: 0.83, 0.92]. Segmentation results from ResNet with two layers achieved a median Dice = 0.93 [IQR: 0.87, 0.94] which was higher than the plain UNet model with two layers at median Dice = 0.87 [IQR: 0.77, 0.90]. Models trained using SWIN-based transformers performed significantly better in both training and validation datasets compared to the four CNN variants. The validation median Dice was highest in 4 layer Swin UNETR at 0.88 followed by 2 layer Swin UNETR at 0.85.</p></div><div><h3>Conclusion</h3><p>Utilization of vision based transformer Swin-UNETR results in segmentation output with both low bias and variance thereby solving a real-world problem within healthcare for advanced Artificial Intelligence (AI) image processing and recognition. The Swin UNETR will reduce the time spent manually tracking IVC filters by centralizing within the electronic health record. Link to <span>GitHub</span><svg><path></path></svg> repository.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"4 ","pages":"Article 100084"},"PeriodicalIF":0.0,"publicationDate":"2023-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46348564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep neural network architectures for cardiac image segmentation 用于心脏图像分割的深度神经网络结构
Pub Date : 2023-08-09 DOI: 10.1016/j.ailsci.2023.100083
Jasmine El-Taraboulsi , Claudia P. Cabrera , Caroline Roney , Nay Aung

Imaging plays a fundamental role in the effective diagnosis, staging, management, and monitoring of various cardiac pathologies. Successful radiological analysis relies on accurate image segmentation, a technically arduous process, prone to human-error. To overcome the laborious and time-consuming nature of cardiac image analysis, deep learning approaches have been developed, enabling the accurate, time-efficient, and highly personalised diagnosis, staging and management of cardiac pathologies.

Here, we present a review of over 60 papers, proposing deep learning models for cardiac image segmentation. We summarise the theoretical basis of Convolutional Neural Networks, Fully Convolutional Neural Networks, U-Net, V-Net, No-New-U-Net (nnU-Net), Transformer Networks, DeepLab, Generative Adversarial Networks, Auto Encoders and Recurrent Neural Networks. In addition, we identify pertinent performance-enhancing measures including adaptive convolutional kernels, atrous convolutions, attention gates, and deep supervision modules.

Top-performing models in ventricular, myocardial, atrial and aortic segmentation are explored, highlighting U-Net and nnU-Net-based model architectures achieving state-of-the art segmentation accuracies. Additionally, key gaps in the current research and technology are identified, and areas of future research are suggested, aiming to guide the innovation and clinical adoption of automated cardiac segmentation methods.

影像在各种心脏疾病的有效诊断、分期、管理和监测中起着重要作用。成功的放射分析依赖于准确的图像分割,这是一个技术上艰巨的过程,容易出现人为错误。为了克服心脏图像分析的费力和耗时的性质,深度学习方法已经被开发出来,能够准确,高效,高度个性化的心脏病理诊断,分期和管理。在这里,我们回顾了60多篇论文,提出了用于心脏图像分割的深度学习模型。我们总结了卷积神经网络、全卷积神经网络、U-Net、V-Net、No-New-U-Net (nnU-Net)、变压器网络、DeepLab、生成对抗网络、自动编码器和循环神经网络的理论基础。此外,我们还确定了相关的性能增强措施,包括自适应卷积核、亚属性卷积、注意门和深度监督模块。探索了心室、心肌、心房和主动脉分割中表现最好的模型,突出了基于U-Net和nnu - net的模型架构,实现了最先进的分割精度。此外,指出了当前研究和技术的关键差距,并提出了未来的研究领域,旨在指导自动化心脏分割方法的创新和临床应用。
{"title":"Deep neural network architectures for cardiac image segmentation","authors":"Jasmine El-Taraboulsi ,&nbsp;Claudia P. Cabrera ,&nbsp;Caroline Roney ,&nbsp;Nay Aung","doi":"10.1016/j.ailsci.2023.100083","DOIUrl":"10.1016/j.ailsci.2023.100083","url":null,"abstract":"<div><p>Imaging plays a fundamental role in the effective diagnosis, staging, management, and monitoring of various cardiac pathologies. Successful radiological analysis relies on accurate image segmentation, a technically arduous process, prone to human-error. To overcome the laborious and time-consuming nature of cardiac image analysis, deep learning approaches have been developed, enabling the accurate, time-efficient, and highly personalised diagnosis, staging and management of cardiac pathologies.</p><p>Here, we present a review of over 60 papers, proposing deep learning models for cardiac image segmentation. We summarise the theoretical basis of Convolutional Neural Networks, Fully Convolutional Neural Networks, U-Net, V-Net, No-New-U-Net (nnU-Net), Transformer Networks, DeepLab, Generative Adversarial Networks, Auto Encoders and Recurrent Neural Networks. In addition, we identify pertinent performance-enhancing measures including adaptive convolutional kernels, atrous convolutions, attention gates, and deep supervision modules.</p><p>Top-performing models in ventricular, myocardial, atrial and aortic segmentation are explored, highlighting U-Net and nnU-Net-based model architectures achieving state-of-the art segmentation accuracies. Additionally, key gaps in the current research and technology are identified, and areas of future research are suggested, aiming to guide the innovation and clinical adoption of automated cardiac segmentation methods.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"4 ","pages":"Article 100083"},"PeriodicalIF":0.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45888732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling and survival exploration of breast carcinoma: A statistical, maximum likelihood estimation, and artificial neural network perspective 乳腺癌的建模和生存探索:统计学、最大似然估计和人工神经网络的视角
Pub Date : 2023-07-17 DOI: 10.1016/j.ailsci.2023.100082
Anum Shafiq , Andaç Batur Çolak , Tabassum Naz Sindhu , Showkat Ahmad Lone , Tahani A. Abushal

The core objective of this research is to describe the behavior of the distribution using the MLE method to estimate its parameters, as well as to determine the optimal Artificial Neural Network method by comparing it to the maximum likelihood estimation method and applying it to real data for breast cancer patients to determine survival, risk, and other survival study functions of the log-logistic distribution. The parameters were defined in the input layer of the artificial neural network developed for the purpose of survival analysis and reliability function, hazard rate function, probability density function, reserved hazard rate function, Mills ratio, Odd function and CHR values were obtained in the output layer. The findings show that risk function increases with the increase in the time of infection and then decreases for a group of breast cancer patients under study, which corresponds to the theoretical properties of this according to the practical conclusions. The examination of survival analysis reveals that practical conclusions correspond to the theoretical properties of log-logistic distribution. Artificial neural networks have proven to be one of the ideal tools that can be used to predict various vital parameters, especially survival of cancer patients, with their high predictive capabilities.

本研究的核心目标是利用最大似然估计方法来描述分布的行为,并将其与最大似然估计方法进行比较,并将其应用于乳腺癌患者的实际数据,确定log-logistic分布的生存、风险等生存研究函数,从而确定最优的人工神经网络方法。在为生存分析而开发的人工神经网络的输入层定义参数,并在输出层获得可靠性函数、风险率函数、概率密度函数、保留风险率函数、Mills比、Odd函数和CHR值。研究结果表明,在所研究的一组乳腺癌患者中,风险函数随着感染时间的增加而增加,然后降低,这与实际结论的理论性质相对应。对生存分析的检验表明,实际结论符合逻辑-logistic分布的理论性质。人工神经网络已被证明是预测各种重要参数,特别是癌症患者生存的理想工具之一,具有很高的预测能力。
{"title":"Modeling and survival exploration of breast carcinoma: A statistical, maximum likelihood estimation, and artificial neural network perspective","authors":"Anum Shafiq ,&nbsp;Andaç Batur Çolak ,&nbsp;Tabassum Naz Sindhu ,&nbsp;Showkat Ahmad Lone ,&nbsp;Tahani A. Abushal","doi":"10.1016/j.ailsci.2023.100082","DOIUrl":"10.1016/j.ailsci.2023.100082","url":null,"abstract":"<div><p>The core objective of this research is to describe the behavior of the distribution using the MLE method to estimate its parameters, as well as to determine the optimal Artificial Neural Network method by comparing it to the maximum likelihood estimation method and applying it to real data for breast cancer patients to determine survival, risk, and other survival study functions of the log-logistic distribution. The parameters were defined in the input layer of the artificial neural network developed for the purpose of survival analysis and reliability function, hazard rate function, probability density function, reserved hazard rate function, Mills ratio, Odd function and CHR values were obtained in the output layer. The findings show that risk function increases with the increase in the time of infection and then decreases for a group of breast cancer patients under study, which corresponds to the theoretical properties of this according to the practical conclusions. The examination of survival analysis reveals that practical conclusions correspond to the theoretical properties of log-logistic distribution. Artificial neural networks have proven to be one of the ideal tools that can be used to predict various vital parameters, especially survival of cancer patients, with their high predictive capabilities.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"4 ","pages":"Article 100082"},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41536157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
piCRISPR: Physically informed deep learning models for CRISPR/Cas9 off-target cleavage prediction piCRISPR:用于CRISPR/Cas9脱靶切割预测的物理信息深度学习模型
Pub Date : 2023-05-15 DOI: 10.1016/j.ailsci.2023.100075
Florian Störtz, Jeffrey K. Mak, Peter Minary

CRISPR/Cas programmable nuclease systems have become ubiquitous in the field of gene editing. With progressing development, applications in in vivo therapeutic gene editing are increasingly within reach, yet limited by possible adverse side effects from unwanted edits. Recent years have thus seen continuous development of off-target prediction algorithms trained on in vitro cleavage assay data gained from immortalised cell lines. It has been shown that in contrast to experimental epigenetic features, computed physically informed features are so far underutilised despite bearing considerably larger correlation with cleavage activity. Here, we implement state-of-the-art deep learning algorithms and feature encodings for off-target prediction with emphasis on physically informed features that capture the biological environment of the cleavage site, hence terming our approach piCRISPR. Features were gained from the large, diverse crisprSQL off-target cleavage dataset. We find that our best-performing models highlight the importance of sequence context and chromatin accessibility for cleavage prediction and compare favourably with literature standard prediction performance. We further show that our novel, environmentally sensitive features are crucial to accurate prediction on sequence-identical locus pairs, making them highly relevant for clinical guide design. The source code and trained models can be found ready to use at github.com/florianst/picrispr.

CRISPR/Cas可编程核酸酶系统在基因编辑领域已经变得无处不在。随着开发的进展,体内治疗性基因编辑的应用越来越触手可及,但受到不必要编辑可能产生的副作用的限制。因此,近年来,在从永生化细胞系获得的体外切割测定数据上训练的脱靶预测算法不断发展。研究表明,与实验表观遗传学特征相比,尽管计算的物理信息特征与切割活性具有相当大的相关性,但迄今为止尚未得到充分利用。在这里,我们实现了最先进的深度学习算法和特征编码,用于脱靶预测,重点是捕捉切割位点生物环境的物理信息特征,从而确定了我们的方法piCRISPR。特征是从大型、多样化的crisprSQL脱靶切割数据集中获得的。我们发现,我们表现最好的模型强调了序列上下文和染色质可及性对切割预测的重要性,并与文献标准预测性能相比较。我们进一步表明,我们新颖的环境敏感特征对于准确预测序列相同的基因座对至关重要,使其与临床指南设计高度相关。源代码和经过训练的模型可以在github.com/florians/picrispr上找到。
{"title":"piCRISPR: Physically informed deep learning models for CRISPR/Cas9 off-target cleavage prediction","authors":"Florian Störtz,&nbsp;Jeffrey K. Mak,&nbsp;Peter Minary","doi":"10.1016/j.ailsci.2023.100075","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100075","url":null,"abstract":"<div><p>CRISPR/Cas programmable nuclease systems have become ubiquitous in the field of gene editing. With progressing development, applications in <em>in vivo</em> therapeutic gene editing are increasingly within reach, yet limited by possible adverse side effects from unwanted edits. Recent years have thus seen continuous development of off-target prediction algorithms trained on <em>in vitro</em> cleavage assay data gained from immortalised cell lines. It has been shown that in contrast to experimental epigenetic features, computed physically informed features are so far underutilised despite bearing considerably larger correlation with cleavage activity. Here, we implement state-of-the-art deep learning algorithms and feature encodings for off-target prediction with emphasis on <em>physically informed</em> features that capture the biological environment of the cleavage site, hence terming our approach piCRISPR. Features were gained from the large, diverse crisprSQL off-target cleavage dataset. We find that our best-performing models highlight the importance of sequence context and chromatin accessibility for cleavage prediction and compare favourably with literature standard prediction performance. We further show that our novel, environmentally sensitive features are crucial to accurate prediction on sequence-identical locus pairs, making them highly relevant for clinical guide design. The source code and trained models can be found ready to use at <span>github.com/florianst/picrispr</span><svg><path></path></svg>.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100075"},"PeriodicalIF":0.0,"publicationDate":"2023-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49774977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trends and challenges in chemoinformatics research in Latin America 拉丁美洲化学信息学研究的趋势和挑战
Pub Date : 2023-05-11 DOI: 10.1016/j.ailsci.2023.100077
Jazmín Miranda-Salas , Carlos Peña-Varas , Ignacio Valenzuela Martínez , Dionisio A. Olmedo , William J. Zamora , Miguel Angel Chávez-Fumagalli , Daniela Q. Azevedo , Rachel Oliveira Castilho , Vinicius G. Maltarollo , David Ramírez , José L. Medina-Franco

Chemoinformatics is an independent inter-discipline with a broad impact in drug design and discovery, medicinal chemistry, biochemistry, analytical and organic chemistry, natural products, and several other areas in chemistry. Through collaborations, scientific exchanges, and participation in international research networks, Latin American scientists have contributed to the development of this subject. The aim of this perspective is to discuss the status and progress of the chemoinformatic discipline in Latin America. We team up to provide an author´s perspective on the topics that have been investigated and published over the past twelve years, collaborations between Latin America researchers and others worldwide, contributions to open-access chemoinformatic tools such as web servers, and educational-related resources and events, such as scientific conferences. We conclude that linking and fostering collaboration within each nation as well as among other Latin American nations and globally is made possible by open science and the democratization of science. We also outline strategic actions that can boost the development and practice of chemoinformatic in the region and enhance the interaction between Latin American countries and the rest of the world.

化学信息学是一门独立的交叉学科,在药物设计和发现、药物化学、生物化学、分析和有机化学、天然产物以及化学的其他几个领域都有广泛的影响。通过合作、科学交流和参与国际研究网络,拉丁美洲科学家为这一学科的发展做出了贡献。本展望的目的是讨论拉丁美洲化学信息学学科的现状和进展。我们合作提供作者对过去12年来研究和发表的主题的观点,拉丁美洲研究人员和世界各地其他研究人员之间的合作,对开放获取的化学信息学工具(如web服务器)的贡献,以及与教育相关的资源和事件(如科学会议)。我们的结论是,通过开放科学和科学民主化,可以在每个国家内部、在其他拉丁美洲国家之间以及在全球范围内建立联系并促进合作。我们还概述了可以促进该地区化学信息学发展和实践的战略行动,并加强拉丁美洲国家与世界其他国家之间的互动。
{"title":"Trends and challenges in chemoinformatics research in Latin America","authors":"Jazmín Miranda-Salas ,&nbsp;Carlos Peña-Varas ,&nbsp;Ignacio Valenzuela Martínez ,&nbsp;Dionisio A. Olmedo ,&nbsp;William J. Zamora ,&nbsp;Miguel Angel Chávez-Fumagalli ,&nbsp;Daniela Q. Azevedo ,&nbsp;Rachel Oliveira Castilho ,&nbsp;Vinicius G. Maltarollo ,&nbsp;David Ramírez ,&nbsp;José L. Medina-Franco","doi":"10.1016/j.ailsci.2023.100077","DOIUrl":"10.1016/j.ailsci.2023.100077","url":null,"abstract":"<div><p>Chemoinformatics is an independent inter-discipline with a broad impact in drug design and discovery, medicinal chemistry, biochemistry, analytical and organic chemistry, natural products, and several other areas in chemistry. Through collaborations, scientific exchanges, and participation in international research networks, Latin American scientists have contributed to the development of this subject. The aim of this perspective is to discuss the status and progress of the chemoinformatic discipline in Latin America. We team up to provide an author´s perspective on the topics that have been investigated and published over the past twelve years, collaborations between Latin America researchers and others worldwide, contributions to open-access chemoinformatic tools such as web servers, and educational-related resources and events, such as scientific conferences. We conclude that linking and fostering collaboration within each nation as well as among other Latin American nations and globally is made possible by open science and the democratization of science. We also outline strategic actions that can boost the development and practice of chemoinformatic in the region and enhance the interaction between Latin American countries and the rest of the world.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100077"},"PeriodicalIF":0.0,"publicationDate":"2023-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42646088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bayesian optimization for ternary complex prediction (BOTCP) 基于贝叶斯优化的三元复变预测
Pub Date : 2023-04-19 DOI: 10.1016/j.ailsci.2023.100072
Arjun Rao , Tin M. Tunjic , Michael Brunsteiner , Michael Müller, Hosein Fooladi, Chiara Gasbarri, Noah Weber

Proximity-inducing compounds (PICs) are an emergent drug technology through which a protein of interest (POI), often a drug target, is brought into the vicinity of a second protein which modifies the POI’s function, abundance or localisation, giving rise to a therapeutic effect. One of the best-known examples for such compounds are heterobifunctional molecules known as proteolysis targeting chimeras (PROTACs). PROTACs reduce the abundance of the target protein by establishing proximity to an E3 ligase which labels the protein for degradation via the ubiquitin-proteasomal pathway. Design of PROTACs in silico requires the computational prediction of the ternary complex consisting of POI, PROTAC molecule, and the E3 ligase.

We present a novel machine learning-based method for predicting PROTAC-mediated ternary complex structures using Bayesian optimization. We show how a fitness score combining an estimation of protein-protein interactions with PROTAC conformation energy calculations enables the sample-efficient exploration of candidate structures. Furthermore, our method presents two novel scores for filtering and reranking which take PROTAC stability (Autodock-Vina based PROTAC stability score) and protein interaction restraints (the TCP-AIR score) into account. We evaluate our method using DockQ scores on a number of available ternary complex structures (including previously unevaluated cases) and demonstrate that even with a clustering that requires members to have a high similarity, i.e., with smaller clusters, we can assign high ranks to those clusters that contain poses close to the experimentally determined native structure of the ternary complexes. We also demonstrate the resultant improved yield of near-native poses3 in these clusters.

邻近诱导化合物(PIC)是一种新兴的药物技术,通过该技术,将感兴趣的蛋白质(POI)(通常是药物靶点)引入第二种蛋白质附近,从而改变POI的功能、丰度或定位,从而产生治疗效果。这类化合物最著名的例子之一是被称为蛋白水解靶向嵌合体(PROTACs)的异双功能分子。PROTAC通过建立与E3连接酶的接近度来降低靶蛋白的丰度,该连接酶通过泛素-蛋白酶体途径标记蛋白进行降解。在计算机中设计PROTAC需要对由POI、PROTAC分子和E3连接酶组成的三元复合物进行计算预测。我们提出了一种新的基于机器学习的方法,用于使用贝叶斯优化预测PROTAC介导的三元复杂结构。我们展示了将蛋白质-蛋白质相互作用的估计与PROTAC构象能量计算相结合的适应度得分如何能够有效地探索候选结构。此外,我们的方法提出了两种新的过滤和重新排序分数,其中考虑了PROTAC稳定性(基于Autodock-Vina的PROTAC稳定分数)和蛋白质相互作用限制(TCP-AIR分数)。我们使用DockQ评分对许多可用的三元复杂结构(包括以前未评估的情况)评估了我们的方法,并证明即使使用需要成员具有高度相似性的聚类,即使用较小的聚类,我们可以为那些包含接近实验确定的三元配合物的天然结构的位姿的团簇分配高阶。我们还证明了在这些簇中近本机偏序3的改进产量。
{"title":"Bayesian optimization for ternary complex prediction (BOTCP)","authors":"Arjun Rao ,&nbsp;Tin M. Tunjic ,&nbsp;Michael Brunsteiner ,&nbsp;Michael Müller,&nbsp;Hosein Fooladi,&nbsp;Chiara Gasbarri,&nbsp;Noah Weber","doi":"10.1016/j.ailsci.2023.100072","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100072","url":null,"abstract":"<div><p>Proximity-inducing compounds (PICs) are an emergent drug technology through which a protein of interest (POI), often a drug target, is brought into the vicinity of a second protein which modifies the POI’s function, abundance or localisation, giving rise to a therapeutic effect. One of the best-known examples for such compounds are heterobifunctional molecules known as proteolysis targeting chimeras (PROTACs). PROTACs reduce the abundance of the target protein by establishing proximity to an E3 ligase which labels the protein for degradation via the ubiquitin-proteasomal pathway. Design of PROTACs in silico requires the computational prediction of the ternary complex consisting of POI, PROTAC molecule, and the E3 ligase.</p><p>We present a novel machine learning-based method for predicting PROTAC-mediated ternary complex structures using Bayesian optimization. We show how a fitness score combining an estimation of protein-protein interactions with PROTAC conformation energy calculations enables the sample-efficient exploration of candidate structures. Furthermore, our method presents two novel scores for filtering and reranking which take PROTAC stability (Autodock-Vina based PROTAC stability score) and protein interaction restraints (the TCP-AIR score) into account. We evaluate our method using DockQ scores on a number of available ternary complex structures (including previously unevaluated cases) and demonstrate that even with a clustering that requires members to have a high similarity, i.e., with smaller clusters, we can assign high ranks to those clusters that contain poses close to the experimentally determined native structure of the ternary complexes. We also demonstrate the resultant improved yield of near-native poses<span><sup>3</sup></span> in these clusters.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100072"},"PeriodicalIF":0.0,"publicationDate":"2023-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49775003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing microplate layouts using artificial intelligence 利用人工智能设计微孔板布局
Pub Date : 2023-04-14 DOI: 10.1016/j.ailsci.2023.100073
María Andreína Francisco Rodríguez, Jordi Carreras Puigvert, Ola Spjuth

Microplates are indispensable in large-scale biomedical experiments but the physical location of samples and controls on the microplate can significantly affect the resulting data and quality metric values. We introduce a new method based on constraint programming for designing microplate layouts that reduces unwanted bias and limits the impact of batch effects after error correction and normalisation. We demonstrate that our method applied to dose-response experiments leads to more accurate regression curves and lower errors when estimating IC50/EC50, and for drug screening leads to increased precision, when compared to random layouts. It also reduces the risk of inflated scores from common microplate quality assessment metrics such as Z factor and SSMD. We make our method available via a suite of tools (PLAID) including a reference constraint model, a web application, and Python notebooks to evaluate and compare designs when planning microplate experiments.

微孔板在大规模生物医学实验中是必不可少的,但样品和对照物在微孔板上的物理位置会显著影响所得数据和质量度量值。我们介绍了一种基于约束编程的微板布局设计新方法,该方法减少了不必要的偏差,并限制了纠错和归一化后批次效应的影响。我们证明,与随机布局相比,我们的方法应用于剂量反应实验,在估计IC50/EC50时会产生更准确的回归曲线和更低的误差,而药物筛选则会提高精度。它还降低了常见微板质量评估指标(如Z’因子和SSMD)分数膨胀的风险。我们通过一套工具(PLAID)提供了我们的方法,包括参考约束模型、网络应用程序和Python笔记本,以在规划微板实验时评估和比较设计。
{"title":"Designing microplate layouts using artificial intelligence","authors":"María Andreína Francisco Rodríguez,&nbsp;Jordi Carreras Puigvert,&nbsp;Ola Spjuth","doi":"10.1016/j.ailsci.2023.100073","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100073","url":null,"abstract":"<div><p>Microplates are indispensable in large-scale biomedical experiments but the physical location of samples and controls on the microplate can significantly affect the resulting data and quality metric values. We introduce a new method based on constraint programming for designing microplate layouts that reduces unwanted bias and limits the impact of batch effects after error correction and normalisation. We demonstrate that our method applied to dose-response experiments leads to more accurate regression curves and lower errors when estimating <span><math><msub><mtext>IC</mtext><mn>50</mn></msub></math></span>/<span><math><msub><mtext>EC</mtext><mn>50</mn></msub></math></span>, and for drug screening leads to increased precision, when compared to random layouts. It also reduces the risk of inflated scores from common microplate quality assessment metrics such as <span><math><msup><mi>Z</mi><mo>′</mo></msup></math></span> factor and SSMD. We make our method available via a suite of tools (PLAID) including a reference constraint model, a web application, and Python notebooks to evaluate and compare designs when planning microplate experiments.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100073"},"PeriodicalIF":0.0,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49774976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep metric learning for the classification of MALDI-TOF spectral signatures from multiple species of neotropical disease vectors 多种新热带病媒MALDI-TOF谱特征分类的深度度量学习
Pub Date : 2023-04-06 DOI: 10.1016/j.ailsci.2023.100071
Fernando Merchan , Kenji Contreras , Rolando A. Gittens , Jose R. Loaiza , Javier E. Sanchez-Galan

Deep Learning techniques have significant advantages for mass spectral classification, such as parallelized signal correction and feature extraction. Deep Metric Learning models combine Metric Learning to determine the degree of similarity or difference between a set of mass spectra with the generalization power of Deep Learning to improve feature extraction even further. The two most popular of these models combine multiple neural networks with identical architectures and are commonly called Siamese (SNN) and Triplet Neural Networks (TNN). Herein, using both SNNs and TNNs, we intended to taxonomically categorize two sets of previously-validated mass spectra that corresponded to 30 species of Neotropical arthropods in the Culicidae and Ixodidae families, some of which are disease vectors. The effectiveness of SNNs and TNNs to correctly classify 826 spectra from 12 mosquito species and 310 spectra from 18 species of hard ticks was highly effective, with both algorithms performing with minimal average loss during cross-validation. SNNs produced accuracy rates for ticks and mosquitoes of 91.22% and 94.46%, respectively, while accuracy rates of 93% and 99% were obtained with TNNs. Our results indicate that Deep Metric Learning is a practical machine learning tool for quickly and precisely classifying MALDI-TOF-generated mass spectra of Neotropical and public-health-relevant arthropod species.

深度学习技术在质谱分类中具有显著的优势,如并行信号校正和特征提取。深度度量学习模型将度量学习与深度学习的泛化能力相结合,以确定一组质谱之间的相似或差异程度,从而进一步改进特征提取。其中最流行的两种模型将具有相同架构的多个神经网络组合在一起,通常称为Siamese (SNN)和Triplet neural networks (TNN)。本文利用snn和tnn对库蚊科和伊蚊科30种新热带节肢动物的两组经验证的质谱进行了分类,其中一些是病媒动物。snn和tnn对12种蚊子的826种光谱和18种硬蜱的310种光谱的正确分类效果非常好,交叉验证时两种算法的平均损失都很小。snn对蜱和蚊的准确率分别为91.22%和94.46%,tnn对蜱和蚊的准确率分别为93%和99%。我们的结果表明,深度度量学习是一种实用的机器学习工具,可以快速准确地对maldi - tof生成的新热带和公共卫生相关节肢动物物种的质谱进行分类。
{"title":"Deep metric learning for the classification of MALDI-TOF spectral signatures from multiple species of neotropical disease vectors","authors":"Fernando Merchan ,&nbsp;Kenji Contreras ,&nbsp;Rolando A. Gittens ,&nbsp;Jose R. Loaiza ,&nbsp;Javier E. Sanchez-Galan","doi":"10.1016/j.ailsci.2023.100071","DOIUrl":"10.1016/j.ailsci.2023.100071","url":null,"abstract":"<div><p>Deep Learning techniques have significant advantages for mass spectral classification, such as parallelized signal correction and feature extraction. Deep Metric Learning models combine Metric Learning to determine the degree of similarity or difference between a set of mass spectra with the generalization power of Deep Learning to improve feature extraction even further. The two most popular of these models combine multiple neural networks with identical architectures and are commonly called Siamese (SNN) and Triplet Neural Networks (TNN). Herein, using both SNNs and TNNs, we intended to taxonomically categorize two sets of previously-validated mass spectra that corresponded to 30 species of Neotropical arthropods in the Culicidae and Ixodidae families, some of which are disease vectors. The effectiveness of SNNs and TNNs to correctly classify 826 spectra from 12 mosquito species and 310 spectra from 18 species of hard ticks was highly effective, with both algorithms performing with minimal average loss during cross-validation. SNNs produced accuracy rates for ticks and mosquitoes of 91.22% and 94.46%, respectively, while accuracy rates of 93% and 99% were obtained with TNNs. Our results indicate that Deep Metric Learning is a practical machine learning tool for quickly and precisely classifying MALDI-TOF-generated mass spectra of Neotropical and public-health-relevant arthropod species.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100071"},"PeriodicalIF":0.0,"publicationDate":"2023-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41748999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Conformal efficiency as a metric for comparative model assessment befitting federated learning 适形效率作为适合联邦学习的比较模型评估的度量
Pub Date : 2023-04-01 DOI: 10.1016/j.ailsci.2023.100070
Wouter Heyndrickx , Adam Arany , Jaak Simm , Anastasia Pentina , Noé Sturm , Lina Humbeck , Lewis Mervin , Adam Zalewski , Martijn Oldenhof , Peter Schmidtke , Lukas Friedrich , Regis Loeb , Arina Afanasyeva , Ansgar Schuffenhauer , Yves Moreau , Hugo Ceulemans

In a drug discovery setting, pharmaceutical companies own substantial but confidential datasets. The MELLODDY project developed a privacy-preserving federated machine learning solution and deployed it at an unprecedented scale. Each partner built models for their own private assays that benefitted from a shared representation. Established predictive performance metrics such as AUC ROC or AUC PR are constrained to unseen labeled chemical space and cannot gage performance gains in unlabeled chemical space. Federated learning indirectly extends labeled space, but in a privacy-preserving context, a partner cannot use this label extension for performance assessment. Metrics that estimate uncertainty on a prediction can be calculated even where no label is known. Practically, the chemical space covered with predictions above an uncertainty threshold, reflects the applicability domain of a model. After establishing a link to established performance metrics, we propose the efficiency from the conformal prediction framework (‘conformal efficiency’) as a proxy to the applicability domain size. A documented extension of the applicability domain would qualify as a tangible benefit from federated learning. In interim assessments, MELLODDY partners reported a median increase in conformal efficiency of the federated over the single-partner model of 5.5% (with increases up to 9.7%). Subject to distributional conditions, that efficiency increase can be directly interpreted as the expected increase in conformal i.e. low uncertainty predictions. In conclusion, we present the first indication that privacy-preserving federated machine learning across massive drug-discovery datasets from ten pharma partners indeed extends the applicability domain of property prediction models.

在药物研发环境中,制药公司拥有大量但保密的数据集。MELLODDY项目开发了一种保护隐私的联邦机器学习解决方案,并以前所未有的规模进行了部署。每个合作伙伴都为自己的私人分析建立了模型,这些模型受益于共享的表示。已建立的预测性能指标(如AUC ROC或AUC PR)仅限于未见标记的化学空间,无法衡量未标记的化学空间中的性能增益。联邦学习间接地扩展了标记空间,但是在保护隐私的上下文中,合作伙伴不能使用这个标签扩展进行性能评估。即使在没有已知标签的情况下,也可以计算出估计预测不确定性的度量。实际上,化学空间覆盖着超过不确定性阈值的预测,反映了模型的适用范围。在建立了与已建立的性能指标的联系之后,我们提出了共形预测框架的效率(“共形效率”)作为适用领域大小的代理。适用性领域的文档化扩展将符合联邦学习的实际好处。在中期评估中,MELLODDY合作伙伴报告联合的适形效率中位数比单一合作伙伴模型提高了5.5%(最高可达9.7%)。根据分布条件,效率的提高可以直接解释为保形预测(即低不确定性预测)的预期增加。总之,我们提出了第一个迹象,表明来自十个制药合作伙伴的大规模药物发现数据集的隐私保护联合机器学习确实扩展了属性预测模型的适用范围。
{"title":"Conformal efficiency as a metric for comparative model assessment befitting federated learning","authors":"Wouter Heyndrickx ,&nbsp;Adam Arany ,&nbsp;Jaak Simm ,&nbsp;Anastasia Pentina ,&nbsp;Noé Sturm ,&nbsp;Lina Humbeck ,&nbsp;Lewis Mervin ,&nbsp;Adam Zalewski ,&nbsp;Martijn Oldenhof ,&nbsp;Peter Schmidtke ,&nbsp;Lukas Friedrich ,&nbsp;Regis Loeb ,&nbsp;Arina Afanasyeva ,&nbsp;Ansgar Schuffenhauer ,&nbsp;Yves Moreau ,&nbsp;Hugo Ceulemans","doi":"10.1016/j.ailsci.2023.100070","DOIUrl":"10.1016/j.ailsci.2023.100070","url":null,"abstract":"<div><p>In a drug discovery setting, pharmaceutical companies own substantial but confidential datasets. The MELLODDY project developed a privacy-preserving federated machine learning solution and deployed it at an unprecedented scale. Each partner built models for their own private assays that benefitted from a shared representation. Established predictive performance metrics such as AUC ROC or AUC PR are constrained to unseen labeled chemical space and cannot gage performance gains in unlabeled chemical space. Federated learning indirectly extends labeled space, but in a privacy-preserving context, a partner cannot use this label extension for performance assessment. Metrics that estimate uncertainty on a prediction can be calculated even where no label is known. Practically, the chemical space covered with predictions above an uncertainty threshold, reflects the applicability domain of a model. After establishing a link to established performance metrics, we propose the efficiency from the conformal prediction framework (‘conformal efficiency’) as a proxy to the applicability domain size. A documented extension of the applicability domain would qualify as a tangible benefit from federated learning. In interim assessments, MELLODDY partners reported a median increase in conformal efficiency of the federated over the single-partner model of 5.5% (with increases up to 9.7%). Subject to distributional conditions, that efficiency increase can be directly interpreted as the expected increase in conformal i.e. low uncertainty predictions. In conclusion, we present the first indication that privacy-preserving federated machine learning across massive drug-discovery datasets from ten pharma partners indeed extends the applicability domain of property prediction models.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100070"},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42954871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Pharmaceutical patent landscaping: A novel approach to understand patents from the drug discovery perspective 药物专利景观:一种从药物发现角度理解专利的新方法
Pub Date : 2023-03-31 DOI: 10.1016/j.ailsci.2023.100069
Yojana Gadiya , Philip Gribbon , Martin Hofmann-Apitius , Andrea Zaliani

Patents play a crucial role in the drug discovery process by providing legal protection for discoveries and incentivising investments in research and development. By identifying patterns within patent data resources, researchers can gain insight into the market trends and priorities of the pharmaceutical and biotechnology industries, as well as provide additional perspectives on more fundamental aspects such as the emergence of potential new drug targets. In this paper, we used the patent enrichment tool, PEMT, to extract, integrate, and analyse patent literature for rare diseases (RD) and Alzheimer's disease (AD). This is followed by a systematic review of the underlying patent landscape to decipher trends and applications in patents for these diseases. To do so, we discuss prominent organisations involved in drug discovery research in AD and RD. This allows us to gain an understanding of the importance of AD and RD from specific organisational (pharmaceutical or university) perspectives. Next, we analyse the historical focus of patents in relation to individual therapeutic targets and correlate them with market scenarios allowing the identification of prominent targets for a disease. Lastly, we identified drug repurposing activities within the two diseases with the help of patents. This resulted in identifying existing repurposed drugs and novel potential therapeutic approaches applicable to the indication areas. The study demonstrates the expanded applicability of patent documents from legal to drug discovery, design, and research, thus, providing a valuable resource for future drug discovery efforts. Moreover, this study is an attempt towards understanding the importance of data underlying patent documents and raising the need for preparing the data for machine learning-based applications.

专利通过为发现提供法律保护和激励研发投资,在药物发现过程中发挥着至关重要的作用。通过识别专利数据资源中的模式,研究人员可以深入了解制药和生物技术行业的市场趋势和优先事项,并对潜在新药靶点的出现等更基本的方面提供更多的视角。在本文中,我们使用专利富集工具PEMT来提取、整合和分析罕见病(RD)和阿尔茨海默病(AD)的专利文献。接下来是对潜在专利前景的系统审查,以解读这些疾病专利的趋势和应用。为此,我们讨论了参与AD和RD药物发现研究的知名组织。这使我们能够从特定的组织(制药或大学)角度了解AD和RD的重要性。接下来,我们分析了专利与个体治疗靶点相关的历史焦点,并将其与市场情景相关联,从而确定疾病的突出靶点。最后,我们在专利的帮助下确定了这两种疾病中的药物再利用活动。这导致确定了适用于适应症领域的现有再利用药物和新的潜在治疗方法。该研究表明,专利文件的适用性从法律扩展到药物发现、设计和研究,从而为未来的药物发现工作提供了宝贵的资源。此外,这项研究试图理解专利文件中数据的重要性,并提出为基于机器学习的应用准备数据的必要性。
{"title":"Pharmaceutical patent landscaping: A novel approach to understand patents from the drug discovery perspective","authors":"Yojana Gadiya ,&nbsp;Philip Gribbon ,&nbsp;Martin Hofmann-Apitius ,&nbsp;Andrea Zaliani","doi":"10.1016/j.ailsci.2023.100069","DOIUrl":"https://doi.org/10.1016/j.ailsci.2023.100069","url":null,"abstract":"<div><p>Patents play a crucial role in the drug discovery process by providing legal protection for discoveries and incentivising investments in research and development. By identifying patterns within patent data resources, researchers can gain insight into the market trends and priorities of the pharmaceutical and biotechnology industries, as well as provide additional perspectives on more fundamental aspects such as the emergence of potential new drug targets. In this paper, we used the patent enrichment tool, PEMT, to extract, integrate, and analyse patent literature for rare diseases (RD) and Alzheimer's disease (AD). This is followed by a systematic review of the underlying patent landscape to decipher trends and applications in patents for these diseases. To do so, we discuss prominent organisations involved in drug discovery research in AD and RD. This allows us to gain an understanding of the importance of AD and RD from specific organisational (pharmaceutical or university) perspectives. Next, we analyse the historical focus of patents in relation to individual therapeutic targets and correlate them with market scenarios allowing the identification of prominent targets for a disease. Lastly, we identified drug repurposing activities within the two diseases with the help of patents. This resulted in identifying existing repurposed drugs and novel potential therapeutic approaches applicable to the indication areas. The study demonstrates the expanded applicability of patent documents from legal to drug discovery, design, and research, thus, providing a valuable resource for future drug discovery efforts. Moreover, this study is an attempt towards understanding the importance of data underlying patent documents and raising the need for preparing the data for machine learning-based applications.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":"3 ","pages":"Article 100069"},"PeriodicalIF":0.0,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49774974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial intelligence in the life sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1