Multicore distributed dictionary learning: A microarray gene expression biclustering case study

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2017-06-19 DOI:10.1109/ICASSP.2017.7952340

Stephen Laide, J. McAllister

引用次数: 1

Abstract

The increasing pervasion and scale of machine learning technologies is posing fundamental challenges for their realisation. In the main, current algorithms are centralised, with a large number of processing agents, distributed across parallel processing resources, accessing a single, very large data object. This creates bottlenecks as a result of limited memory access rates. Distributed learning has the potential to resolve this problem by employing networks of co-operating agents each operating on subsets of the data, but as yet their suitability for realisation on parallel architectures such as multicore are unknown. This paper presents the results of a case study deploying distributed dictionary learning for microarray gene expression bi-clustering on a 16-core Epiphany multicore. It shows that distributed learning approaches can enable near-linear speed-up with the number of processing resources and, via the use of DMA-based communication, a 50% increase in throughput can be enabled.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

多核分布式字典学习:微阵列基因表达双聚类案例研究

机器学习技术的日益普及和规模正在为它们的实现带来根本性的挑战。总的来说，当前的算法是集中式的，具有大量的处理代理，分布在并行处理资源上，访问单个非常大的数据对象。由于内存访问速率有限，这会造成瓶颈。分布式学习有可能通过使用协作代理网络来解决这个问题，每个代理都在数据的子集上操作，但到目前为止，它们在并行架构(如多核)上实现的适用性尚不清楚。本文介绍了在16核Epiphany多核上部署用于微阵列基因表达双聚类的分布式字典学习的案例研究结果。它表明，分布式学习方法可以实现处理资源数量的近线性加速，并且通过使用基于dma的通信，可以使吞吐量增加50%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

自引率

0.00%

发文量