首页 > 最新文献

2010 13th IEEE International Conference on Computational Science and Engineering最新文献

英文 中文
A Locality-Aware Publish/Subscribe Scheme for High Level Architecture on Structured Peer-to-Peer Networks 结构化点对点网络高层体系结构的位置感知发布/订阅方案
Wei-Chao Chang, Shih-Hsiang Lo, Kuan-Chou Lai, Kuan-Ching Li, Yeh-Ching Chung
High Level Architecture (HLA) is a distributed simulation architecture which is applied to many simulation environments. In most of these environments, the simulation entities (called the federate in HLA) communicate with each other by interconnected local area networks (LANs). Due to the communications among federates which are in the same LAN have shorter latency and higher bandwidth, this paper proposes a message publish/subscribe scheme of HLA based on structured peer-to-peer overlay according to the principle of locality, by increasing the size of affordable federates and the size of workload by using the same hardware environment. Moreover, the run-time infrastructure (RTI) is implemented in accordance with HLA for performance evaluation. Experimental results show that the proposed scheme improves the performance of simulations.
HLA是一种适用于多种仿真环境的分布式仿真体系结构。在大多数这些环境中,仿真实体(在HLA中称为联邦实体)通过相互连接的局域网(lan)相互通信。针对同一局域网内的联盟间通信具有较短的时延和较高的带宽,本文提出了一种基于局部性原则的结构化点对点覆盖的HLA消息发布/订阅方案,通过在相同的硬件环境下增加可负担的联盟的规模和工作负载的规模。此外,还根据HLA实现了运行时基础结构(RTI),用于性能评估。实验结果表明,该方案提高了仿真性能。
{"title":"A Locality-Aware Publish/Subscribe Scheme for High Level Architecture on Structured Peer-to-Peer Networks","authors":"Wei-Chao Chang, Shih-Hsiang Lo, Kuan-Chou Lai, Kuan-Ching Li, Yeh-Ching Chung","doi":"10.1109/CSE.2010.69","DOIUrl":"https://doi.org/10.1109/CSE.2010.69","url":null,"abstract":"High Level Architecture (HLA) is a distributed simulation architecture which is applied to many simulation environments. In most of these environments, the simulation entities (called the federate in HLA) communicate with each other by interconnected local area networks (LANs). Due to the communications among federates which are in the same LAN have shorter latency and higher bandwidth, this paper proposes a message publish/subscribe scheme of HLA based on structured peer-to-peer overlay according to the principle of locality, by increasing the size of affordable federates and the size of workload by using the same hardware environment. Moreover, the run-time infrastructure (RTI) is implemented in accordance with HLA for performance evaluation. Experimental results show that the proposed scheme improves the performance of simulations.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"52 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122473303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Algorithm on the Mining of Batch Processing Process 一种批处理过程挖掘算法
Yiping Wen, Jianxun Liu, Zhigang Chen
Batch processing process denotes the process used in the workflow management systems (WfMSs) supporting dynamic batch processing. During the run time of their concurrent process instances, multiple activity cases of the same type may be vertically combined together according to some batch logics and submitted for execution as a whole. Therefore, it is necessary to study the actual way process instances are executed. In this paper, we propose an algorithm to mine batch processing process from system logs. A case study is also done to illustrate its effectiveness.
批处理过程是指支持动态批处理的工作流管理系统(wfms)中使用的过程。在并发流程实例的运行期间,可能会根据一些批处理逻辑将相同类型的多个活动案例垂直地组合在一起,并作为一个整体提交执行。因此,有必要研究流程实例的实际执行方式。本文提出了一种从系统日志中挖掘批量处理过程的算法。最后通过实例分析说明了该方法的有效性。
{"title":"An Algorithm on the Mining of Batch Processing Process","authors":"Yiping Wen, Jianxun Liu, Zhigang Chen","doi":"10.1109/CSE.2010.58","DOIUrl":"https://doi.org/10.1109/CSE.2010.58","url":null,"abstract":"Batch processing process denotes the process used in the workflow management systems (WfMSs) supporting dynamic batch processing. During the run time of their concurrent process instances, multiple activity cases of the same type may be vertically combined together according to some batch logics and submitted for execution as a whole. Therefore, it is necessary to study the actual way process instances are executed. In this paper, we propose an algorithm to mine batch processing process from system logs. A case study is also done to illustrate its effectiveness.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"199 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122558048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Semi-structured Overlay for Multi-attribute Range Queries in Cloud Computing 云计算中多属性范围查询的半结构化覆盖
You-Fu Yu, Kuan-Chou Lai
In recent years, Cloud computing is considered the next-generation computing paradigm. Cloud services could be broadly classified as software as a service (SaaS), platform as a service (PaaS) and infrastructure as a service (IaaS). However, most cloud providers only pay attention to certain services. This makes users vex about which cloud provider to be chosen, and when they need different services. Therefore, integrating these computing resources to satisfy users’ various requirements is an important issue. In this study, we present a Peer-to-Peer (P2P) resource search mechanism which adopts P2P networking technologies to orchestrate all the computing resources. Through this P2P resource search mechanism, cloud users can discover the computing resources in different cloud provides.
近年来,云计算被认为是下一代计算范式。云服务大致可分为软件即服务(SaaS)、平台即服务(PaaS)和基础设施即服务(IaaS)。然而,大多数云提供商只关注某些服务。这让用户烦恼该选择哪家云提供商,以及何时需要不同的服务。因此,如何整合这些计算资源以满足用户的各种需求是一个重要的问题。在本研究中,我们提出了一种点对点(P2P)资源搜索机制,该机制采用P2P网络技术对所有计算资源进行编排。通过这种P2P资源搜索机制,云用户可以发现不同云提供的计算资源。
{"title":"A Semi-structured Overlay for Multi-attribute Range Queries in Cloud Computing","authors":"You-Fu Yu, Kuan-Chou Lai","doi":"10.1109/CSE.2010.67","DOIUrl":"https://doi.org/10.1109/CSE.2010.67","url":null,"abstract":"In recent years, Cloud computing is considered the next-generation computing paradigm. Cloud services could be broadly classified as software as a service (SaaS), platform as a service (PaaS) and infrastructure as a service (IaaS). However, most cloud providers only pay attention to certain services. This makes users vex about which cloud provider to be chosen, and when they need different services. Therefore, integrating these computing resources to satisfy users’ various requirements is an important issue. In this study, we present a Peer-to-Peer (P2P) resource search mechanism which adopts P2P networking technologies to orchestrate all the computing resources. Through this P2P resource search mechanism, cloud users can discover the computing resources in different cloud provides.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"447 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124950591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Fast Fuzzy C-Means Clustering Based on Low-Cost High-Performance VLSI Architecture in Reconfigurable Hardware 基于低成本高性能VLSI架构的可重构硬件快速模糊c均值聚类
Y. Yeh, Hui-Ya Li, C. Yang, Wen-Jyi Hwang
This paper presents a novel low-cost and high-performance VLSI architecture for fuzzy c-means clustering. In the architecture, the operations at both the centroid and data levels are pipelined to attain high computational speed while consuming low hardware resources. In addition, the usual iterative operations for updating the membership matrix and cluster centroid are merged into one single updating process to evade the large storage requirement. Experimental results show that the proposed solution is an effective alternative for cluster analysis with low computational cost and high performance.
本文提出了一种新的低成本、高性能的模糊c均值聚类VLSI架构。在该体系结构中,质心级和数据级的操作都是流水线化的,从而在消耗较少硬件资源的同时获得较高的计算速度。此外,该方法将通常的迭代更新隶属矩阵和聚类质心的操作合并到一个更新过程中,避免了大量的存储需求。实验结果表明,该方法具有计算成本低、性能好等优点,是一种有效的聚类分析方法。
{"title":"Fast Fuzzy C-Means Clustering Based on Low-Cost High-Performance VLSI Architecture in Reconfigurable Hardware","authors":"Y. Yeh, Hui-Ya Li, C. Yang, Wen-Jyi Hwang","doi":"10.1109/CSE.2010.22","DOIUrl":"https://doi.org/10.1109/CSE.2010.22","url":null,"abstract":"This paper presents a novel low-cost and high-performance VLSI architecture for fuzzy c-means clustering. In the architecture, the operations at both the centroid and data levels are pipelined to attain high computational speed while consuming low hardware resources. In addition, the usual iterative operations for updating the membership matrix and cluster centroid are merged into one single updating process to evade the large storage requirement. Experimental results show that the proposed solution is an effective alternative for cluster analysis with low computational cost and high performance.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128392535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Performance Evaluation of an Adaptive Congestion Avoidance Algorithm for IEEE 802.15.4 IEEE 802.15.4自适应拥塞避免算法的性能评价
Ki-Chul Noh, Seung-Yeon Lee, Youn-Soon Shin, Kang-Woo Lee, Jong-Suk Ahn
Like most wireless network protocols, IEEE 802.15.4 adopts a CA (Congestion Avoidance) algorithm to avoid early collisions of new frames by randomizing its first transmission delay instead of delivering them immediately. The traditional CA scheme of IEEE 802.15.4 selects its random access time from the predetermined range without considering the degree of underlying network congestion. This static behavior often incurs a long delay to settle in the suitable range for the current network load or frequent clashes especially when heavy traffic lasts long. This paper proposes an ACA(Adaptive Congestion Avoidance) algorithm which dynamically adjusts the initial back off range according to the measurement of the average collision rate. It also introduces an analytical model to predict the performance spectrum in which ACA algorithm¡¯s throughput falls. With only small deviations from corresponding simulations, our model shows that ACA scheme can improve the throughput of IEEE 802.15.4 by up-to 41% while shortening the frame delays.
与大多数无线网络协议一样,IEEE 802.15.4采用CA(拥塞避免)算法,通过随机分配其第一次传输延迟,而不是立即发送,来避免新帧的早期冲突。传统的IEEE 802.15.4 CA方案在不考虑底层网络拥塞程度的情况下,从预定范围中选择随机接入时间。这种静态行为通常会导致长时间的延迟,以适应当前网络负载或频繁的冲突,特别是当大流量持续很长时间时。本文提出了一种根据平均碰撞率的测量值动态调整初始退离范围的自适应拥塞避免算法(ACA)。并引入了一个分析模型来预测ACA算法吞吐量下降时的性能谱。在与仿真结果偏差很小的情况下,我们的模型表明ACA方案可以在缩短帧延迟的同时将IEEE 802.15.4的吞吐量提高41%。
{"title":"Performance Evaluation of an Adaptive Congestion Avoidance Algorithm for IEEE 802.15.4","authors":"Ki-Chul Noh, Seung-Yeon Lee, Youn-Soon Shin, Kang-Woo Lee, Jong-Suk Ahn","doi":"10.1109/CSE.2010.12","DOIUrl":"https://doi.org/10.1109/CSE.2010.12","url":null,"abstract":"Like most wireless network protocols, IEEE 802.15.4 adopts a CA (Congestion Avoidance) algorithm to avoid early collisions of new frames by randomizing its first transmission delay instead of delivering them immediately. The traditional CA scheme of IEEE 802.15.4 selects its random access time from the predetermined range without considering the degree of underlying network congestion. This static behavior often incurs a long delay to settle in the suitable range for the current network load or frequent clashes especially when heavy traffic lasts long. This paper proposes an ACA(Adaptive Congestion Avoidance) algorithm which dynamically adjusts the initial back off range according to the measurement of the average collision rate. It also introduces an analytical model to predict the performance spectrum in which ACA algorithm¡¯s throughput falls. With only small deviations from corresponding simulations, our model shows that ACA scheme can improve the throughput of IEEE 802.15.4 by up-to 41% while shortening the frame delays.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125709070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Research on Stage Classification of Flight Parameter Based on PTSVM 基于PTSVM的飞行参数分级研究
Hui Lu, Kefei Mao
Flight Parameters stage classification is the premise of the fault diagnosis and trend forecast based on flight parameters. Stage classification belongs to the classification optimization problem of multi-attribute data through analysis the flight data. This paper carried out the research for the two-class classification based on the semi-supervised learning methods PTSVM (Progressive Transductive Support Vector Machines) and improved the PTSVM algorithm, which extends the application of PTSVM to the multi-class classification problem. The research and simulation work were carried out using the real flight parameters, and the comparison between the criterion of the flight parameters stage and the simulation results proved the validity of the research work for the flight parameters stage classification.
飞行参数阶段分类是基于飞行参数进行故障诊断和趋势预测的前提。通过对飞行数据的分析,阶段分类属于多属性数据的分类优化问题。本文对基于半监督学习方法的PTSVM (Progressive Transductive Support Vector Machines)进行了两类分类的研究,并对PTSVM算法进行了改进,将PTSVM扩展到多类分类问题中。利用真实飞行参数进行了研究和仿真工作,并将飞行参数分级准则与仿真结果进行了对比,验证了研究工作对飞行参数分级的有效性。
{"title":"Research on Stage Classification of Flight Parameter Based on PTSVM","authors":"Hui Lu, Kefei Mao","doi":"10.1109/CSE.2010.17","DOIUrl":"https://doi.org/10.1109/CSE.2010.17","url":null,"abstract":"Flight Parameters stage classification is the premise of the fault diagnosis and trend forecast based on flight parameters. Stage classification belongs to the classification optimization problem of multi-attribute data through analysis the flight data. This paper carried out the research for the two-class classification based on the semi-supervised learning methods PTSVM (Progressive Transductive Support Vector Machines) and improved the PTSVM algorithm, which extends the application of PTSVM to the multi-class classification problem. The research and simulation work were carried out using the real flight parameters, and the comparison between the criterion of the flight parameters stage and the simulation results proved the validity of the research work for the flight parameters stage classification.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134443462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid Harmony Search Method Based on OBL 基于OBL的混合和声搜索方法
X. Gao, Xiaolei Wang, S. Ovaska
The Harmony Search (HS) method is an emerging meta-heuristic optimization algorithm. However, like most of the evolutionary computation techniques, it sometimes suffers from a rather slow search speed, and fails to find the global optima in an efficient way. In this paper, we propose and study a hybrid optimization approach, in which the HS is merged together with the Opposition-Based Learning (OBL). Our modified HS, namely HS-OBL, has an improved convergence property. Simulations of 23 typical benchmark problems demonstrate that the HS-OBL can indeed yield a superior optimization performance over the regular HS method.
和谐搜索(HS)方法是一种新兴的元启发式优化算法。然而,与大多数进化计算技术一样,它有时也存在搜索速度较慢的问题,无法有效地找到全局最优解。在本文中,我们提出并研究了一种混合优化方法,该方法将HS与基于对立的学习(OBL)相结合。我们改进的HS,即HS- obl,具有更好的收敛性。对23个典型基准问题的仿真表明,HS- obl确实比常规HS方法具有更好的优化性能。
{"title":"A Hybrid Harmony Search Method Based on OBL","authors":"X. Gao, Xiaolei Wang, S. Ovaska","doi":"10.1109/CSE.2010.26","DOIUrl":"https://doi.org/10.1109/CSE.2010.26","url":null,"abstract":"The Harmony Search (HS) method is an emerging meta-heuristic optimization algorithm. However, like most of the evolutionary computation techniques, it sometimes suffers from a rather slow search speed, and fails to find the global optima in an efficient way. In this paper, we propose and study a hybrid optimization approach, in which the HS is merged together with the Opposition-Based Learning (OBL). Our modified HS, namely HS-OBL, has an improved convergence property. Simulations of 23 typical benchmark problems demonstrate that the HS-OBL can indeed yield a superior optimization performance over the regular HS method.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"41 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114099206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Approaches and Issues in Location-Aware Continuous Authentication 位置感知连续认证的方法与问题
S. Kurkovsky, Ewa Syta
Convergence of technologies enabling physical and information security makes it possible to combine the features of location-aware and continuous authentication in a single system. We discuss the design of a location-aware continuous authentication system and discuss different implementation approaches that would strike a balance between usability and security of such a system. Issues of location privacy arising from using such systems are also discussed.
实现物理和信息安全的技术融合使得在单个系统中结合位置感知和连续身份验证的特性成为可能。我们讨论了位置感知连续身份验证系统的设计,并讨论了不同的实现方法,这些方法将在这种系统的可用性和安全性之间取得平衡。还讨论了使用此类系统所产生的位置隐私问题。
{"title":"Approaches and Issues in Location-Aware Continuous Authentication","authors":"S. Kurkovsky, Ewa Syta","doi":"10.1109/CSE.2010.42","DOIUrl":"https://doi.org/10.1109/CSE.2010.42","url":null,"abstract":"Convergence of technologies enabling physical and information security makes it possible to combine the features of location-aware and continuous authentication in a single system. We discuss the design of a location-aware continuous authentication system and discuss different implementation approaches that would strike a balance between usability and security of such a system. Issues of location privacy arising from using such systems are also discussed.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"29 21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116737009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Design of Dynamically Reconfigurable Processor for the H.264/AVC Image Prediction and De-blocking Filter H.264/AVC图像预测去块滤波器动态可重构处理器设计
Yukihiko Hayakawa, A. Kanasugi
H.264/AVC provides high video quality at substantially low bit rates. However, the computational complexity of H.264/AVC is very high. A high-speed general-purpose processor is necessary to process H.264/AVC. However, it is difficult to use such a processor for a portable device. Therefore, an application-specific processor is necessary. A dynamic reconfiguration can virtually expand the circuit area in a limited chip area. Therefore, this article proposes a dynamically reconfigurable processor for H.264/AVC image prediction. H.264/AVC contains inter-prediction processes and de-blocking filter. The inter-prediction processes and de-blocking filter are not used at the same time. The proposed processor was designed and synthesized, and dynamically reconfigures those circuits. As a result, look up tables (LUTs) were reduced 10%, flip-flops were about the same, and the maximum delay was increased 10%.
H.264/AVC以相当低的比特率提供高视频质量。然而,H.264/AVC的计算复杂度非常高。高速通用处理器是处理H.264/AVC的必要条件。然而,这种处理器很难用于便携式设备。因此,特定于应用程序的处理器是必要的。动态重构实际上可以在有限的芯片面积内扩大电路面积。为此,本文提出了一种用于H.264/AVC图像预测的动态可重构处理器。H.264/AVC包含预测间处理和去块滤波器。预估过程和去块滤波器不同时使用。所提出的处理器被设计和合成,并动态地重新配置这些电路。结果,查找表(lut)减少了10%,触发器也差不多,最大延迟增加了10%。
{"title":"Design of Dynamically Reconfigurable Processor for the H.264/AVC Image Prediction and De-blocking Filter","authors":"Yukihiko Hayakawa, A. Kanasugi","doi":"10.1109/CSE.2010.24","DOIUrl":"https://doi.org/10.1109/CSE.2010.24","url":null,"abstract":"H.264/AVC provides high video quality at substantially low bit rates. However, the computational complexity of H.264/AVC is very high. A high-speed general-purpose processor is necessary to process H.264/AVC. However, it is difficult to use such a processor for a portable device. Therefore, an application-specific processor is necessary. A dynamic reconfiguration can virtually expand the circuit area in a limited chip area. Therefore, this article proposes a dynamically reconfigurable processor for H.264/AVC image prediction. H.264/AVC contains inter-prediction processes and de-blocking filter. The inter-prediction processes and de-blocking filter are not used at the same time. The proposed processor was designed and synthesized, and dynamically reconfigures those circuits. As a result, look up tables (LUTs) were reduced 10%, flip-flops were about the same, and the maximum delay was increased 10%.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128637277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementing Parallel LU Factorization with Pipelining on a MultiCore Using OpenMP 利用OpenMP在多核上实现流水线并行逻辑分解
Panagiotis D. Michailidis, K. Margaritis
Recent developments in high performance computer architecture have a significant effect on all fields of scientific computing. Linear algebra and especially the solution of linear systems of equations lies at the heart of many applications in scientific computing. This paper describes and analyzes three parallel versions of the dense LU factorization method that is used in linear system solving on a multicore using OpenMP interface. More specifically, we present two naive parallel algorithms based on row block and row cyclic data distribution and we put special emphasis on presenting a third parallel algorithm based on the pipeline technique. Further, we propose an implementation of the pipelining technique in OpenMP. Experimental results on a multicore CPU show that the proposed OpenMP pipeline implementation achieves good overall performance compared to the other two naive parallel methods. Finally, in this work we propose a simple, fast and reasonably analytical model to predict the performance of the LU decomposition method with the pipelining technique.
高性能计算机体系结构的最新发展对科学计算的各个领域都产生了重大影响。线性代数,特别是线性方程组的解是科学计算中许多应用的核心。本文描述并分析了基于OpenMP接口的密集LU分解方法在多核线性系统求解中的三个并行版本。更具体地说,我们提出了两种基于行块和行循环数据分布的朴素并行算法,并重点介绍了基于管道技术的第三种并行算法。此外,我们还提出了在OpenMP中实现流水线技术的方法。在多核CPU上的实验结果表明,与其他两种朴素并行方法相比,本文提出的OpenMP管道实现具有良好的综合性能。最后,本文提出了一个简单、快速、合理的分析模型,用于预测基于流水线技术的逻辑单元分解方法的性能。
{"title":"Implementing Parallel LU Factorization with Pipelining on a MultiCore Using OpenMP","authors":"Panagiotis D. Michailidis, K. Margaritis","doi":"10.1109/CSE.2010.39","DOIUrl":"https://doi.org/10.1109/CSE.2010.39","url":null,"abstract":"Recent developments in high performance computer architecture have a significant effect on all fields of scientific computing. Linear algebra and especially the solution of linear systems of equations lies at the heart of many applications in scientific computing. This paper describes and analyzes three parallel versions of the dense LU factorization method that is used in linear system solving on a multicore using OpenMP interface. More specifically, we present two naive parallel algorithms based on row block and row cyclic data distribution and we put special emphasis on presenting a third parallel algorithm based on the pipeline technique. Further, we propose an implementation of the pipelining technique in OpenMP. Experimental results on a multicore CPU show that the proposed OpenMP pipeline implementation achieves good overall performance compared to the other two naive parallel methods. Finally, in this work we propose a simple, fast and reasonably analytical model to predict the performance of the LU decomposition method with the pipelining technique.","PeriodicalId":342688,"journal":{"name":"2010 13th IEEE International Conference on Computational Science and Engineering","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129448639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2010 13th IEEE International Conference on Computational Science and Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1