Cha Yong Jong, Geordi Tristan, Lee Jun Jie Felix, Eunice Wan Qi Yeap, Srinivas Reddy Dubbaka, Harsha Nagesh Rao, Shin Yee Wong
{"title":"Systematic Assessment of Calibration Strategies in Spectroscopic Analysis: A Case Study of Paracetamol Crystallization","authors":"Cha Yong Jong, Geordi Tristan, Lee Jun Jie Felix, Eunice Wan Qi Yeap, Srinivas Reddy Dubbaka, Harsha Nagesh Rao, Shin Yee Wong","doi":"10.1021/acs.oprd.4c00496","DOIUrl":null,"url":null,"abstract":"Converting spectral data to concentration is beneficial for effective crystallization process monitoring, enabling timely insights into supersaturation profiles. Calibration models are essential in this process, as they transform spectral information into concentration data. While various calibration strategies exist in the literature, they typically involve three stages: Stage 1 for baseline correction, Stage 2 for regressor selection, and Stage 3 for model form selection. In this study, we systematically evaluated all common strategies within each stage, combining them through a Design of Experiments (DoE) approach using a single paracetamol (PCM) and <i>p</i>-acetoxyacetanilide (PAA) crystallization system. The results showed that Savitzky–Golay Second Derivative (SGSD) performed best for baseline correction (Stage 1), while selecting spectral data from a specific range yielded the highest accuracy in regressor selection (Stage 2). For model selection (Stage 3), Partial Least Squares Regression (PLSR), Principal Component Regression (PCR), and Artificial Neural Network (ANN) were assessed with three optimized models deployed to monitor four crystallization runs in real time. During deployment, PLSR demonstrated the most moderate concentration prediction. However, when comparing all three model forms, the standard deviation of predicted concentrations ranged from 4% to 6% for PCM and 10% to 30% for PAA, with similar performance across all models. Validation against offline High-Performance Liquid Chromatography (HPLC) data showed relative errors of 0–12% for PCM, while PAA predictions had higher errors ranging from 0 to 50<sup>+</sup>%, largely due to PAA’s lower concentration range (10–20 g/L) compared to that of PCM (100–350 g/L). These findings indicate that while online models provide useful real-time approximations, precise measurements still require offline validation.","PeriodicalId":55,"journal":{"name":"Organic Process Research & Development","volume":"11 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Organic Process Research & Development","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.oprd.4c00496","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Converting spectral data to concentration is beneficial for effective crystallization process monitoring, enabling timely insights into supersaturation profiles. Calibration models are essential in this process, as they transform spectral information into concentration data. While various calibration strategies exist in the literature, they typically involve three stages: Stage 1 for baseline correction, Stage 2 for regressor selection, and Stage 3 for model form selection. In this study, we systematically evaluated all common strategies within each stage, combining them through a Design of Experiments (DoE) approach using a single paracetamol (PCM) and p-acetoxyacetanilide (PAA) crystallization system. The results showed that Savitzky–Golay Second Derivative (SGSD) performed best for baseline correction (Stage 1), while selecting spectral data from a specific range yielded the highest accuracy in regressor selection (Stage 2). For model selection (Stage 3), Partial Least Squares Regression (PLSR), Principal Component Regression (PCR), and Artificial Neural Network (ANN) were assessed with three optimized models deployed to monitor four crystallization runs in real time. During deployment, PLSR demonstrated the most moderate concentration prediction. However, when comparing all three model forms, the standard deviation of predicted concentrations ranged from 4% to 6% for PCM and 10% to 30% for PAA, with similar performance across all models. Validation against offline High-Performance Liquid Chromatography (HPLC) data showed relative errors of 0–12% for PCM, while PAA predictions had higher errors ranging from 0 to 50+%, largely due to PAA’s lower concentration range (10–20 g/L) compared to that of PCM (100–350 g/L). These findings indicate that while online models provide useful real-time approximations, precise measurements still require offline validation.
期刊介绍:
The journal Organic Process Research & Development serves as a communication tool between industrial chemists and chemists working in universities and research institutes. As such, it reports original work from the broad field of industrial process chemistry but also presents academic results that are relevant, or potentially relevant, to industrial applications. Process chemistry is the science that enables the safe, environmentally benign and ultimately economical manufacturing of organic compounds that are required in larger amounts to help address the needs of society. Consequently, the Journal encompasses every aspect of organic chemistry, including all aspects of catalysis, synthetic methodology development and synthetic strategy exploration, but also includes aspects from analytical and solid-state chemistry and chemical engineering, such as work-up tools,process safety, or flow-chemistry. The goal of development and optimization of chemical reactions and processes is their transfer to a larger scale; original work describing such studies and the actual implementation on scale is highly relevant to the journal. However, studies on new developments from either industry, research institutes or academia that have not yet been demonstrated on scale, but where an industrial utility can be expected and where the study has addressed important prerequisites for a scale-up and has given confidence into the reliability and practicality of the chemistry, also serve the mission of OPR&D as a communication tool between the different contributors to the field.