用于多线程数据处理的 Python 库的性能评估

Serhii Krivtsov, Yurii Parfeniuk, K. Bazilevych, I. Meniailov, D. Chumachenko
{"title":"用于多线程数据处理的 Python 库的性能评估","authors":"Serhii Krivtsov, Yurii Parfeniuk, K. Bazilevych, I. Meniailov, D. Chumachenko","doi":"10.20998/2522-9052.2024.1.05","DOIUrl":null,"url":null,"abstract":"Topicality. The rapid growth of data in various domains has necessitated the development of efficient tools and libraries for data processing and analysis. Python, a popular programming language for data analysis, offers several libraries, such as NumPy and Numba, for numerical computations. However, there is a lack of comprehensive studies comparing the performance of these libraries across different tasks and data sizes. The aim of the study. This study aims to fill this gap by comparing the performance of Python, NumPy, Numba, and Numba.Cuda across different tasks and data sizes. Additionally, it evaluates the impact of multithreading and GPU utilization on computation speed. Research results. The results indicate that Numba and Numba.Cuda significantly optimizes the performance of Python applications, especially for functions involving loops and array operations. Moreover, GPU and multithreading in Python further enhance computation speed, although with certain limitations and considerations. Conclusion. This study contributes to the field by providing valuable insights into the performance of different Python libraries and the effectiveness of GPU and multithreading in Python, thereby aiding researchers and practitioners in selecting the most suitable tools for their computational needs.","PeriodicalId":275587,"journal":{"name":"Advanced Information Systems","volume":"47 20","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PERFORMANCE EVALUATION OF PYTHON LIBRARIES FOR MULTITHREADING DATA PROCESSING\",\"authors\":\"Serhii Krivtsov, Yurii Parfeniuk, K. Bazilevych, I. Meniailov, D. Chumachenko\",\"doi\":\"10.20998/2522-9052.2024.1.05\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Topicality. The rapid growth of data in various domains has necessitated the development of efficient tools and libraries for data processing and analysis. Python, a popular programming language for data analysis, offers several libraries, such as NumPy and Numba, for numerical computations. However, there is a lack of comprehensive studies comparing the performance of these libraries across different tasks and data sizes. The aim of the study. This study aims to fill this gap by comparing the performance of Python, NumPy, Numba, and Numba.Cuda across different tasks and data sizes. Additionally, it evaluates the impact of multithreading and GPU utilization on computation speed. Research results. The results indicate that Numba and Numba.Cuda significantly optimizes the performance of Python applications, especially for functions involving loops and array operations. Moreover, GPU and multithreading in Python further enhance computation speed, although with certain limitations and considerations. Conclusion. This study contributes to the field by providing valuable insights into the performance of different Python libraries and the effectiveness of GPU and multithreading in Python, thereby aiding researchers and practitioners in selecting the most suitable tools for their computational needs.\",\"PeriodicalId\":275587,\"journal\":{\"name\":\"Advanced Information Systems\",\"volume\":\"47 20\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20998/2522-9052.2024.1.05\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20998/2522-9052.2024.1.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

主题性。随着各领域数据的快速增长,有必要开发用于数据处理和分析的高效工具和库。Python 作为一种流行的数据分析编程语言,提供了多个用于数值计算的库,如 NumPy 和 Numba。然而,目前还缺乏比较这些库在不同任务和数据规模下性能的全面研究。研究目的本研究旨在通过比较 Python、NumPy、Numba 和 Numba.Cuda 在不同任务和数据量下的性能,填补这一空白。此外,它还评估了多线程和 GPU 利用率对计算速度的影响。研究成果。研究结果表明,Numba 和 Numba.Cuda 显著优化了 Python 应用程序的性能,尤其是涉及循环和数组操作的函数。此外,Python 中的 GPU 和多线程能进一步提高计算速度,但也有一定的局限性和注意事项。结论本研究为该领域做出了贡献,提供了关于不同 Python 库的性能以及 Python 中 GPU 和多线程的有效性的宝贵见解,从而帮助研究人员和从业人员选择最适合其计算需求的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PERFORMANCE EVALUATION OF PYTHON LIBRARIES FOR MULTITHREADING DATA PROCESSING
Topicality. The rapid growth of data in various domains has necessitated the development of efficient tools and libraries for data processing and analysis. Python, a popular programming language for data analysis, offers several libraries, such as NumPy and Numba, for numerical computations. However, there is a lack of comprehensive studies comparing the performance of these libraries across different tasks and data sizes. The aim of the study. This study aims to fill this gap by comparing the performance of Python, NumPy, Numba, and Numba.Cuda across different tasks and data sizes. Additionally, it evaluates the impact of multithreading and GPU utilization on computation speed. Research results. The results indicate that Numba and Numba.Cuda significantly optimizes the performance of Python applications, especially for functions involving loops and array operations. Moreover, GPU and multithreading in Python further enhance computation speed, although with certain limitations and considerations. Conclusion. This study contributes to the field by providing valuable insights into the performance of different Python libraries and the effectiveness of GPU and multithreading in Python, thereby aiding researchers and practitioners in selecting the most suitable tools for their computational needs.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
MEDOIDS AS A PACKING OF ORB IMAGE DESCRIPTORS THE METHOD OF RANKING EFFECTIVE PROJECT SOLUTIONS IN CONDITIONS OF INCOMPLETE CERTAINTY ENSURING THE FUNCTIONAL STABILITY OF THE INFORMATION SYSTEM OF THE POWER PLANT ON THE BASIS OF MONITORING THE PARAMETERS OF THE WORKING CONDITION OF COMPUTER DEVICES COMPARATIVE ANALYSIS OF SPECTRAL ANOMALIES DETECTION METHODS ON IMAGES FROM ON-BOARD REMOTE SENSING SYSTEMS FPGA-BASED IMPLEMENTATION OF A GAUSSIAN SMOOTHING FILTER WITH POWERS-OF-TWO COEFFICIENTS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1