CPU and GPU based acceleration of high-dimensional population balance models via the vectorization and parallelization of multivariate aggregation and breakage integral terms

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2025-02-15 DOI:10.1016/j.compchemeng.2025.109037
Ashley Dan , Urjit Patil , Abhinav De , Bhavani Nandhini Mummidi Manuraj , Rohit Ramachandran
{"title":"CPU and GPU based acceleration of high-dimensional population balance models via the vectorization and parallelization of multivariate aggregation and breakage integral terms","authors":"Ashley Dan ,&nbsp;Urjit Patil ,&nbsp;Abhinav De ,&nbsp;Bhavani Nandhini Mummidi Manuraj ,&nbsp;Rohit Ramachandran","doi":"10.1016/j.compchemeng.2025.109037","DOIUrl":null,"url":null,"abstract":"<div><div>The development of mathematical models for physical systems often necessitates the use of high-dimensional spaces and fine discretizations to accurately capture complex dynamics. These models, which involve large matrices and extensive mathematical operations, tend to be computationally intensive, leading to slow execution times. In this study, we analyzed various acceleration strategies by comparing the simulation accuracy, computational time, and resource utilization of various vectorization and parallelization methods on both CPUs and GPUs, using a multi-dimensional Population Balance Model simulated in MATLAB and Python. Our findings revealed that GPU-based vectorization provided the highest performance, achieving a 40-fold speedup compared to the serial implementations. Unlike simulations on CPUs, where run time is often limited by processing power, GPUs simulations are limited by the available memory, especially at high resolution. This work highlights the importance of using appropriate resources and code optimization strategies to reduce computational time, for development of an efficient model.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"196 ","pages":"Article 109037"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425000419","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

The development of mathematical models for physical systems often necessitates the use of high-dimensional spaces and fine discretizations to accurately capture complex dynamics. These models, which involve large matrices and extensive mathematical operations, tend to be computationally intensive, leading to slow execution times. In this study, we analyzed various acceleration strategies by comparing the simulation accuracy, computational time, and resource utilization of various vectorization and parallelization methods on both CPUs and GPUs, using a multi-dimensional Population Balance Model simulated in MATLAB and Python. Our findings revealed that GPU-based vectorization provided the highest performance, achieving a 40-fold speedup compared to the serial implementations. Unlike simulations on CPUs, where run time is often limited by processing power, GPUs simulations are limited by the available memory, especially at high resolution. This work highlights the importance of using appropriate resources and code optimization strategies to reduce computational time, for development of an efficient model.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于CPU和GPU的高维人口平衡模型的矢量化和并行化多变量聚集和破碎积分项加速
物理系统数学模型的发展往往需要使用高维空间和精细离散化来准确地捕捉复杂的动力学。这些模型涉及大型矩阵和广泛的数学运算,往往需要大量的计算,导致执行时间较慢。在本研究中,我们使用MATLAB和Python模拟的多维人口平衡模型,通过比较各种矢量化和并行化方法在cpu和gpu上的仿真精度、计算时间和资源利用率,分析了各种加速策略。我们的研究结果表明,基于gpu的矢量化提供了最高的性能,与串行实现相比,实现了40倍的加速。与cpu上的模拟不同,cpu上的运行时间通常受到处理能力的限制,gpu模拟受到可用内存的限制,特别是在高分辨率下。这项工作强调了使用适当的资源和代码优化策略来减少计算时间的重要性,以开发有效的模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
期刊最新文献
YANNs: Y-wise affine neural networks for exact and efficient representations of piecewise linear functions Unlocking reactive power potential of industrial processes for voltage support through scheduling optimization Superstructure modeling and optimization of dynamic processes applied to high-performance liquid chromatography with recycling Partial least-squares model adaptation by bootstrap resampling PSFCL: A Probabilistic Slow Feature Contrastive Learning approach for incipient fault diagnosis in industrial processes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1