Jack D Murray, Harriet Bennett-Lenane, Patrick J O'Dwyer, Brendan T Griffin
{"title":"Establishing a Pharmacoinformatics Repository of Approved Medicines: A Database to Support Drug Product Development.","authors":"Jack D Murray, Harriet Bennett-Lenane, Patrick J O'Dwyer, Brendan T Griffin","doi":"10.1021/acs.molpharmaceut.4c00991","DOIUrl":null,"url":null,"abstract":"<p><p>Advanced predictive modeling approaches have harnessed data to fuel important innovations at all stages of drug development. However, the need for a machine-readable drug product library which consolidates many aspects of formulation design and performance remains largely unmet. This study presents a scripted, reproducible approach to database curation and explores its potential to streamline oral medicine development. The Product Information files for all centrally authorized drug products containing a small molecule active ingredient were retrieved programmatically from the European Medicines Agency Web site. Text processing isolated relevant information, including the maximum clinical dose, dosage form, route of administration, excipients, and pharmacokinetic performance. Chemical and bioactivity data were integrated through automated linking to external curated databases. The capability of this database to inform oral medicine development was assessed in the context of drug-likeness evaluation, excipient selection, and prediction of oral fraction absorbed. Existing filters of drug-likeness, such as the Rule of Five, were found to poorly capture the chemical space of marketed oral drug products. Association rule learning identified frequent patterns in tablet formulation compositions that can be used to establish excipient combinations that have seen clinical success. Binary prediction models of oral fraction absorbed constructed exclusively from regulatory data achieved acceptable performance (balanced accuracy<sub>test</sub> = 0.725), demonstrating its modelability and potential for use during early stage molecule prioritization tasks. This study illustrates the impact of highly linked drug product data in accelerating clinical translation and underlines the ongoing need for accuracy and completeness of data reported in the regulatory datasphere.</p>","PeriodicalId":52,"journal":{"name":"Molecular Pharmaceutics","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.molpharmaceut.4c00991","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Advanced predictive modeling approaches have harnessed data to fuel important innovations at all stages of drug development. However, the need for a machine-readable drug product library which consolidates many aspects of formulation design and performance remains largely unmet. This study presents a scripted, reproducible approach to database curation and explores its potential to streamline oral medicine development. The Product Information files for all centrally authorized drug products containing a small molecule active ingredient were retrieved programmatically from the European Medicines Agency Web site. Text processing isolated relevant information, including the maximum clinical dose, dosage form, route of administration, excipients, and pharmacokinetic performance. Chemical and bioactivity data were integrated through automated linking to external curated databases. The capability of this database to inform oral medicine development was assessed in the context of drug-likeness evaluation, excipient selection, and prediction of oral fraction absorbed. Existing filters of drug-likeness, such as the Rule of Five, were found to poorly capture the chemical space of marketed oral drug products. Association rule learning identified frequent patterns in tablet formulation compositions that can be used to establish excipient combinations that have seen clinical success. Binary prediction models of oral fraction absorbed constructed exclusively from regulatory data achieved acceptable performance (balanced accuracytest = 0.725), demonstrating its modelability and potential for use during early stage molecule prioritization tasks. This study illustrates the impact of highly linked drug product data in accelerating clinical translation and underlines the ongoing need for accuracy and completeness of data reported in the regulatory datasphere.
期刊介绍:
Molecular Pharmaceutics publishes the results of original research that contributes significantly to the molecular mechanistic understanding of drug delivery and drug delivery systems. The journal encourages contributions describing research at the interface of drug discovery and drug development.
Scientific areas within the scope of the journal include physical and pharmaceutical chemistry, biochemistry and biophysics, molecular and cellular biology, and polymer and materials science as they relate to drug and drug delivery system efficacy. Mechanistic Drug Delivery and Drug Targeting research on modulating activity and efficacy of a drug or drug product is within the scope of Molecular Pharmaceutics. Theoretical and experimental peer-reviewed research articles, communications, reviews, and perspectives are welcomed.