Feature selection and ensemble of regression models for predicting the protein macromolecule dissolution profile

Varun Ojha, K. Jackowski, A. Abraham, V. Snás̃el
{"title":"Feature selection and ensemble of regression models for predicting the protein macromolecule dissolution profile","authors":"Varun Ojha, K. Jackowski, A. Abraham, V. Snás̃el","doi":"10.1109/NABIC.2014.6921864","DOIUrl":null,"url":null,"abstract":"Predicting the dissolution rate of proteins plays a significant role in pharmaceutical/medical applications. The rate of dissolution of Poly Lactic-co-Glycolic Acid (PLGA) micro- and nanoparticles is influenced by several factors. Considering all factors leads to a dataset with three hundred features, making the prediction difficult and inaccurate. Our present study consists of three phases. Firstly, dimensionality reduction techniques are applied in order to simplify the task and eliminate irrelevant and redundant attributes. Subsequently, a heterogeneous pool of several classical regression algorithms is created and evaluated. Regression algorithms in the pool are independently trained to identify the problem at hand. Finally, we test several ensemble methods in order to elevate the accuracy of the prediction. The Evolutionary Weighted Ensemble method proposed in this paper offered the lowest RMSE and significantly outperformed competing classical algorithms and other ensemble techniques.","PeriodicalId":209716,"journal":{"name":"2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NABIC.2014.6921864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Predicting the dissolution rate of proteins plays a significant role in pharmaceutical/medical applications. The rate of dissolution of Poly Lactic-co-Glycolic Acid (PLGA) micro- and nanoparticles is influenced by several factors. Considering all factors leads to a dataset with three hundred features, making the prediction difficult and inaccurate. Our present study consists of three phases. Firstly, dimensionality reduction techniques are applied in order to simplify the task and eliminate irrelevant and redundant attributes. Subsequently, a heterogeneous pool of several classical regression algorithms is created and evaluated. Regression algorithms in the pool are independently trained to identify the problem at hand. Finally, we test several ensemble methods in order to elevate the accuracy of the prediction. The Evolutionary Weighted Ensemble method proposed in this paper offered the lowest RMSE and significantly outperformed competing classical algorithms and other ensemble techniques.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
预测蛋白质大分子溶解谱的特征选择和回归模型集合
预测蛋白质的溶解速率在制药/医疗应用中具有重要作用。聚乳酸-羟基乙酸(PLGA)微粒子和纳米粒子的溶解速率受多种因素的影响。考虑到所有因素会导致一个有300个特征的数据集,这使得预测变得困难和不准确。我们目前的研究分为三个阶段。首先,采用降维技术简化任务,剔除不相关和冗余的属性;随后,创建并评估了几种经典回归算法的异构池。池中的回归算法是独立训练的,以识别手头的问题。最后,为了提高预测的准确性,我们对几种集成方法进行了测试。本文提出的进化加权集成方法具有最低的均方根误差,显著优于经典算法和其他集成技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Feedforward and feedback optimal vibration rejection for active suspension discrete-time systems under in-vehicle networks On the efficiency of Multi-core Grammatical Evolution (MCGE) evolving multi-core parallel programs Fuzzy c-means with wavelet filtration for MR image segmentation Towards an autonomous multistate biomolecular devices built on DNA Energy optimization for task scheduling in distributed systems by an Artificial Bee Colony approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1