A Data-Driven Approach for Leveraging Inline and Offline Data to Determine the Causes of Monoclonal Antibody Productivity Reduction in the Commercial-Scale Cell Culture Process.
Sheng Zhang, Hang Chen, Yuxiang Wan, Haibin Wang, Haibin Qu
{"title":"A Data-Driven Approach for Leveraging Inline and Offline Data to Determine the Causes of Monoclonal Antibody Productivity Reduction in the Commercial-Scale Cell Culture Process.","authors":"Sheng Zhang, Hang Chen, Yuxiang Wan, Haibin Wang, Haibin Qu","doi":"10.3390/pharmaceutics16081082","DOIUrl":null,"url":null,"abstract":"<p><p>The monoclonal antibody (mAb) manufacturing process comes with high profits and high costs, and thus mAb productivity is of vital importance. However, many factors can impact the cell culture process, and lead to mAb productivity reduction. Nowadays, the biopharma industry is actively employing manufacturing information systems, which enable the integration of both online data and offline data. Although the volume of data is large, related data mining studies for mAb productivity improvement are rare. Therefore, a data-driven approach is proposed in this study to leverage both the inline and offline data of the cell culture process to discover the causes of mAb productivity reduction. The approach consists of four steps, namely data preprocessing, phase division, feature extraction and fusion, and cluster comparing. First, data quality issues are solved during the data preprocessing step. Next, the inline data are divided into several phases based on the moving window <i>k</i>-nearest neighbor method. Then, the inline data features are extracted via functional data analysis and combined with the offline data features. Finally, the causes of mAb productivity reduction are identified using the contrasting clusters via the principal component analysis method. A commercial-scale cell culture process case study is provided in this research to verify the effectiveness of the approach. Data from 35 batches were collected, and each batch contained nine inline variables and seven offline variables. The causes of mAb productivity reduction were identified to be the lack of nutrients, and recommended actions were taken according to the result, which was subsequently proven by six validation batches.</p>","PeriodicalId":19894,"journal":{"name":"Pharmaceutics","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2024-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11359819/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pharmaceutics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/pharmaceutics16081082","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
The monoclonal antibody (mAb) manufacturing process comes with high profits and high costs, and thus mAb productivity is of vital importance. However, many factors can impact the cell culture process, and lead to mAb productivity reduction. Nowadays, the biopharma industry is actively employing manufacturing information systems, which enable the integration of both online data and offline data. Although the volume of data is large, related data mining studies for mAb productivity improvement are rare. Therefore, a data-driven approach is proposed in this study to leverage both the inline and offline data of the cell culture process to discover the causes of mAb productivity reduction. The approach consists of four steps, namely data preprocessing, phase division, feature extraction and fusion, and cluster comparing. First, data quality issues are solved during the data preprocessing step. Next, the inline data are divided into several phases based on the moving window k-nearest neighbor method. Then, the inline data features are extracted via functional data analysis and combined with the offline data features. Finally, the causes of mAb productivity reduction are identified using the contrasting clusters via the principal component analysis method. A commercial-scale cell culture process case study is provided in this research to verify the effectiveness of the approach. Data from 35 batches were collected, and each batch contained nine inline variables and seven offline variables. The causes of mAb productivity reduction were identified to be the lack of nutrients, and recommended actions were taken according to the result, which was subsequently proven by six validation batches.
单克隆抗体(mAb)生产过程利润高、成本高,因此 mAb 的生产率至关重要。然而,许多因素都会影响细胞培养过程,导致 mAb 生产率降低。如今,生物制药行业正在积极采用生产信息系统,以实现在线数据和离线数据的整合。虽然数据量很大,但针对提高 mAb 生产率的相关数据挖掘研究却很少见。因此,本研究提出了一种数据驱动方法,利用细胞培养过程的在线和离线数据来发现 mAb 生产率降低的原因。该方法包括四个步骤,即数据预处理、阶段划分、特征提取和融合以及聚类比较。首先,在数据预处理步骤中解决数据质量问题。然后,根据移动窗口 K 近邻法将内联数据分为几个阶段。然后,通过功能数据分析提取内联数据特征,并与离线数据特征相结合。最后,通过主成分分析方法,利用对比聚类找出 mAb 生产率降低的原因。本研究提供了一个商业规模的细胞培养过程案例研究,以验证该方法的有效性。研究收集了 35 个批次的数据,每个批次包含 9 个在线变量和 7 个离线变量。研究确定了 mAb 生产率降低的原因是缺乏营养,并根据结果建议采取相应措施,随后通过六个验证批次进行了验证。
PharmaceuticsPharmacology, Toxicology and Pharmaceutics-Pharmaceutical Science
CiteScore
7.90
自引率
11.10%
发文量
2379
审稿时长
16.41 days
期刊介绍:
Pharmaceutics (ISSN 1999-4923) is an open access journal which provides an advanced forum for the science and technology of pharmaceutics and biopharmaceutics. It publishes reviews, regular research papers, communications, and short notes. Covered topics include pharmacokinetics, toxicokinetics, pharmacodynamics, pharmacogenetics and pharmacogenomics, and pharmaceutical formulation. Our aim is to encourage scientists to publish their experimental and theoretical details in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced.