Vishal Kondal, Antil Jain, Monika Garg, Sundeep Kumar, Amit Kumar Singh, Rakesh Bhardwaj, G. P. Singh
{"title":"利用近红外透射光谱对小麦谷物蛋白质建模进行间隙导数优化","authors":"Vishal Kondal, Antil Jain, Monika Garg, Sundeep Kumar, Amit Kumar Singh, Rakesh Bhardwaj, G. P. Singh","doi":"10.1002/cche.10795","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background and Objective</h3>\n \n <p>Near-infrared spectroscopy is an established tool for the estimation of different nutrients in diverse sample matrices. Of these, near-infrared transmittance (NIT) has very wide usage in whole grain analysis for oil, protein, and other macronutrients. NIT spectra obtained from samples are regressed with actual laboratory values for developing prediction models. However, the spectra obtained are sloppy, slightly noisy, and show baseline drifts. To increase the resolution and signal-to-noise ratio, derivatives are common preprocessing tools, typically implemented along with smoothing.</p>\n </section>\n \n <section>\n \n <h3> Findings</h3>\n \n <p>A systematic study on different derivatives (1, 2, and 3) and gaps (2–90) was performed. The germplasm set with high variability for protein content (8.63%–19.56%) was used, and regression models were developed using the modified partial least squares method. Among all, the second-order derivative gave best-fit models; hence, the results of the gap with second-order derivatives are studied in detail. The plot of <i>R</i><sup>2</sup> for external validation set with different gaps at second-order gave three peaks, namely, at 47, 60, (69, 70, 71) where the highest <i>R</i><sup>2</sup> (0.985) was obtained for the third peak having three consecutive gap segments.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Hence, math treatment (2, 70, 2, 1) was finalized considering stability where a high residual prediction deviation of 7.149 and a low bias of (0.021) was obtained. A paired <i>t</i> test and reliability test between predicted and laboratory values confirmed nonsignificant differences between them. Thus, the developed model is robust and precise and can be utilized in high throughput screening of wheat germplasm.</p>\n </section>\n \n <section>\n \n <h3> Significance and Novelty</h3>\n \n <p>Better performance at second derivative and higher gap can be used for developing robust models with low bias by avoiding multi-collinearity, which is usually a limitation in multi-variate analysis.</p>\n </section>\n </div>","PeriodicalId":9807,"journal":{"name":"Cereal Chemistry","volume":"101 5","pages":"991-999"},"PeriodicalIF":2.2000,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Gap derivative optimization for modeling wheat grain protein using near-infrared transmission spectroscopy\",\"authors\":\"Vishal Kondal, Antil Jain, Monika Garg, Sundeep Kumar, Amit Kumar Singh, Rakesh Bhardwaj, G. P. Singh\",\"doi\":\"10.1002/cche.10795\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background and Objective</h3>\\n \\n <p>Near-infrared spectroscopy is an established tool for the estimation of different nutrients in diverse sample matrices. Of these, near-infrared transmittance (NIT) has very wide usage in whole grain analysis for oil, protein, and other macronutrients. NIT spectra obtained from samples are regressed with actual laboratory values for developing prediction models. However, the spectra obtained are sloppy, slightly noisy, and show baseline drifts. To increase the resolution and signal-to-noise ratio, derivatives are common preprocessing tools, typically implemented along with smoothing.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Findings</h3>\\n \\n <p>A systematic study on different derivatives (1, 2, and 3) and gaps (2–90) was performed. The germplasm set with high variability for protein content (8.63%–19.56%) was used, and regression models were developed using the modified partial least squares method. Among all, the second-order derivative gave best-fit models; hence, the results of the gap with second-order derivatives are studied in detail. The plot of <i>R</i><sup>2</sup> for external validation set with different gaps at second-order gave three peaks, namely, at 47, 60, (69, 70, 71) where the highest <i>R</i><sup>2</sup> (0.985) was obtained for the third peak having three consecutive gap segments.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusion</h3>\\n \\n <p>Hence, math treatment (2, 70, 2, 1) was finalized considering stability where a high residual prediction deviation of 7.149 and a low bias of (0.021) was obtained. A paired <i>t</i> test and reliability test between predicted and laboratory values confirmed nonsignificant differences between them. Thus, the developed model is robust and precise and can be utilized in high throughput screening of wheat germplasm.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Significance and Novelty</h3>\\n \\n <p>Better performance at second derivative and higher gap can be used for developing robust models with low bias by avoiding multi-collinearity, which is usually a limitation in multi-variate analysis.</p>\\n </section>\\n </div>\",\"PeriodicalId\":9807,\"journal\":{\"name\":\"Cereal Chemistry\",\"volume\":\"101 5\",\"pages\":\"991-999\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cereal Chemistry\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cche.10795\",\"RegionNum\":4,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cereal Chemistry","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cche.10795","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
Gap derivative optimization for modeling wheat grain protein using near-infrared transmission spectroscopy
Background and Objective
Near-infrared spectroscopy is an established tool for the estimation of different nutrients in diverse sample matrices. Of these, near-infrared transmittance (NIT) has very wide usage in whole grain analysis for oil, protein, and other macronutrients. NIT spectra obtained from samples are regressed with actual laboratory values for developing prediction models. However, the spectra obtained are sloppy, slightly noisy, and show baseline drifts. To increase the resolution and signal-to-noise ratio, derivatives are common preprocessing tools, typically implemented along with smoothing.
Findings
A systematic study on different derivatives (1, 2, and 3) and gaps (2–90) was performed. The germplasm set with high variability for protein content (8.63%–19.56%) was used, and regression models were developed using the modified partial least squares method. Among all, the second-order derivative gave best-fit models; hence, the results of the gap with second-order derivatives are studied in detail. The plot of R2 for external validation set with different gaps at second-order gave three peaks, namely, at 47, 60, (69, 70, 71) where the highest R2 (0.985) was obtained for the third peak having three consecutive gap segments.
Conclusion
Hence, math treatment (2, 70, 2, 1) was finalized considering stability where a high residual prediction deviation of 7.149 and a low bias of (0.021) was obtained. A paired t test and reliability test between predicted and laboratory values confirmed nonsignificant differences between them. Thus, the developed model is robust and precise and can be utilized in high throughput screening of wheat germplasm.
Significance and Novelty
Better performance at second derivative and higher gap can be used for developing robust models with low bias by avoiding multi-collinearity, which is usually a limitation in multi-variate analysis.
期刊介绍:
Cereal Chemistry publishes high-quality papers reporting novel research and significant conceptual advances in genetics, biotechnology, composition, processing, and utilization of cereal grains (barley, maize, millet, oats, rice, rye, sorghum, triticale, and wheat), pulses (beans, lentils, peas, etc.), oilseeds, and specialty crops (amaranth, flax, quinoa, etc.). Papers advancing grain science in relation to health, nutrition, pet and animal food, and safety, along with new methodologies, instrumentation, and analysis relating to these areas are welcome, as are research notes and topical review papers.
The journal generally does not accept papers that focus on nongrain ingredients, technology of a commercial or proprietary nature, or that confirm previous research without extending knowledge. Papers that describe product development should include discussion of underlying theoretical principles.