首页 > 最新文献

1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)最新文献

英文 中文
A new synchronization algorithm for hitless protection switching in ATM networks ATM网络中一种新的无命中保护交换同步算法
A. Iselt
Redundant transmission on disjoint paths is a common precaution against the effects of failures in communication networks. To allow hitless switching from one path to the other, synchronization algorithms are required. Most of the current algorithms are based on supplementary synchronization information (e.g. synchronization cells). A novel algorithm for the synchronization and selection of redundant data streams is proposed, which allows hitless switching without supplementary information. The algorithm achieves synchronization based on the correlation of the user information. Further, it provides a fast synchronization detection time and is tolerant against loss and errors of single cells in one data path. Analytical analysis as well as simulation show the correctness and efficiency of the novel algorithm.
在不相交的路径上进行冗余传输是防止通信网络故障影响的常用预防措施。为了允许从一条路径到另一条路径的无命中切换,需要同步算法。目前大多数算法都是基于补充同步信息(如同步单元)。提出了一种新的冗余数据流同步和选择算法,该算法可以在不需要补充信息的情况下实现无命中切换。该算法基于用户信息的相关性实现同步。此外,它提供了快速的同步检测时间,并且可以容忍单个数据路径中的单个单元的丢失和错误。分析和仿真结果表明了该算法的正确性和有效性。
{"title":"A new synchronization algorithm for hitless protection switching in ATM networks","authors":"A. Iselt","doi":"10.1109/PCCC.1999.749461","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749461","url":null,"abstract":"Redundant transmission on disjoint paths is a common precaution against the effects of failures in communication networks. To allow hitless switching from one path to the other, synchronization algorithms are required. Most of the current algorithms are based on supplementary synchronization information (e.g. synchronization cells). A novel algorithm for the synchronization and selection of redundant data streams is proposed, which allows hitless switching without supplementary information. The algorithm achieves synchronization based on the correlation of the user information. Further, it provides a fast synchronization detection time and is tolerant against loss and errors of single cells in one data path. Analytical analysis as well as simulation show the correctness and efficiency of the novel algorithm.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124818147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Equivalent permutation capabilities between time division optical omega network and non-optical extra stage omega network 时分光欧米伽网络与非光额外级欧米伽网络的等效置换能力
Xiaojun Shen, Fan Yang, Yi Pan
Since optical networks must support crosstalk-free (CF) connections, regular N/spl times/N multistage interconnection networks (MINs) cannot be directly used to provide connections between N inputs and N outputs. A solution is to use a 2N/spl times/2N Cube-Type MIN to provide the N/spl times/N connections. C. Qiao (1993) proposed a time domain approach that divides the N optical inputs into several groups such that CF connections can be provided by an N/spl times/N regular MIN in several time slots, one for each group. The set of N-permutations realizable in two slots is defined as class /spl theta/. Qiao proved that the size of /spl theta/ is larger than the size of class /spl Omega/, where /spl Omega/ consists of all N-permutations passible through a regular N/spl times/N Omega network. The paper first presents an optimal O(NlgN) time algorithm for identifying whether a given permutation belongs to class /spl theta/ or not. From this algorithm, the paper proves an interesting fact that the class /spl theta/ is identical to the class /spl Omega/+1 which represents the set of N-permutations admissible by a non-optical N/spl times/N one-extra stage Omega network.
由于光网络必须支持无串扰(CF)连接,常规的N/spl倍/N多级互连网络(min)不能直接用于提供N个输入和N个输出之间的连接。一种解决方案是使用2N/spl times/2N Cube-Type MIN来提供N/spl times/N连接。C. Qiao(1993)提出了一种时域方法,将N个光输入分成几组,这样CF连接可以在多个时隙中由N/spl次/N次正则MIN提供,每组一个。定义两个槽中可实现的n个排列的集合为class /spl theta/。Qiao证明了/spl theta/的大小大于/spl Omega/类的大小,其中/spl Omega/由一个规则的N/spl乘以/N ω网络中可能存在的所有N个排列组成。本文首先提出了一种最优的O(NlgN)时间算法来识别给定的排列是否属于/spl theta/类。本文从该算法中证明了一个有趣的事实,即类/spl θ /与类/spl ω /+1是相同的,该类/spl ω /+1表示非光学N/spl乘以/N一阶ω网络所允许的N个排列的集合。
{"title":"Equivalent permutation capabilities between time division optical omega network and non-optical extra stage omega network","authors":"Xiaojun Shen, Fan Yang, Yi Pan","doi":"10.1109/PCCC.1999.749459","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749459","url":null,"abstract":"Since optical networks must support crosstalk-free (CF) connections, regular N/spl times/N multistage interconnection networks (MINs) cannot be directly used to provide connections between N inputs and N outputs. A solution is to use a 2N/spl times/2N Cube-Type MIN to provide the N/spl times/N connections. C. Qiao (1993) proposed a time domain approach that divides the N optical inputs into several groups such that CF connections can be provided by an N/spl times/N regular MIN in several time slots, one for each group. The set of N-permutations realizable in two slots is defined as class /spl theta/. Qiao proved that the size of /spl theta/ is larger than the size of class /spl Omega/, where /spl Omega/ consists of all N-permutations passible through a regular N/spl times/N Omega network. The paper first presents an optimal O(NlgN) time algorithm for identifying whether a given permutation belongs to class /spl theta/ or not. From this algorithm, the paper proves an interesting fact that the class /spl theta/ is identical to the class /spl Omega/+1 which represents the set of N-permutations admissible by a non-optical N/spl times/N one-extra stage Omega network.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126823281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A simulation performance study of TCP Vegas and Random Early Detection TCP Vegas和随机早期检测的仿真性能研究
A. Raghavendra, R. Kinicki
This paper presents results from a series of simulation experiments designed to study network performance when TCP Vegas and the Random Early Detection (RED) algorithm are used together. Using the x-Sim network simulator, simulation tests were conducted with a variety of traffic scenarios using both TCP Reno and TCP Vegas in combination with First-Come-First-Served (FCFS) and RED routers. Analysis of the performance results for three configurations shows that TCP Vegas host-dominated configurations perform better than TCP Reno host-dominated configurations and that the RED algorithm provides better fairness than FCFS. The RED improvement is higher when the configuration includes a mix of Reno and Vegas hosts. Furthermore, when the buffer allocation in a FCFS router is inadequate, the router performance becomes unstable under heavy traffic. When fragile and robust TCP sources send traffic into a congested router, the TCP Vegas/RED router combination provides the highest throughput and best fairness for the fragile hosts.
本文介绍了一系列仿真实验的结果,这些实验旨在研究TCP Vegas和随机早期检测(RED)算法一起使用时的网络性能。使用x-Sim网络模拟器,使用TCP Reno和TCP Vegas结合先到先得(FCFS)和RED路由器对各种流量场景进行了模拟测试。对三种配置的性能结果分析表明,TCP Vegas主机主导配置的性能优于TCP Reno主机主导配置,RED算法的公平性优于FCFS。当配置包括雷诺和拉斯维加斯主机的混合时,RED改进更高。此外,当FCFS路由器中的缓冲区分配不足时,在大流量下,路由器的性能会变得不稳定。当脆弱和健壮的TCP源向拥塞的路由器发送流量时,TCP Vegas/RED路由器组合为脆弱的主机提供最高的吞吐量和最佳的公平性。
{"title":"A simulation performance study of TCP Vegas and Random Early Detection","authors":"A. Raghavendra, R. Kinicki","doi":"10.1109/PCCC.1999.749435","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749435","url":null,"abstract":"This paper presents results from a series of simulation experiments designed to study network performance when TCP Vegas and the Random Early Detection (RED) algorithm are used together. Using the x-Sim network simulator, simulation tests were conducted with a variety of traffic scenarios using both TCP Reno and TCP Vegas in combination with First-Come-First-Served (FCFS) and RED routers. Analysis of the performance results for three configurations shows that TCP Vegas host-dominated configurations perform better than TCP Reno host-dominated configurations and that the RED algorithm provides better fairness than FCFS. The RED improvement is higher when the configuration includes a mix of Reno and Vegas hosts. Furthermore, when the buffer allocation in a FCFS router is inadequate, the router performance becomes unstable under heavy traffic. When fragile and robust TCP sources send traffic into a congested router, the TCP Vegas/RED router combination provides the highest throughput and best fairness for the fragile hosts.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128423817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Session traces: an enhancement to network simulator 会话跟踪:对网络模拟器的增强
S. Cooper, T. Bowman, A. Karshmer
This paper discusses a methodology for lifting a sessions trace from a packet trace. The session trace is created from a packet trace by removing many of the physical and network layer control mechanisms from the packet trace. The session trace embodies the true client/server transaction induced on the network link. We show that the session trace is superior to a packet trace for network simulation workloads in Network Simulator.
本文讨论了一种从数据包跟踪中提取会话跟踪的方法。会话跟踪是通过从数据包跟踪中删除许多物理层和网络层控制机制,从数据包跟踪中创建的。会话跟踪体现了在网络链路上产生的真实的客户机/服务器事务。我们展示了在network Simulator中对网络模拟工作负载进行会话跟踪优于数据包跟踪。
{"title":"Session traces: an enhancement to network simulator","authors":"S. Cooper, T. Bowman, A. Karshmer","doi":"10.1109/PCCC.1999.749442","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749442","url":null,"abstract":"This paper discusses a methodology for lifting a sessions trace from a packet trace. The session trace is created from a packet trace by removing many of the physical and network layer control mechanisms from the packet trace. The session trace embodies the true client/server transaction induced on the network link. We show that the session trace is superior to a packet trace for network simulation workloads in Network Simulator.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130787302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance analysis of intelligent mobile ATM networks 智能移动ATM网络性能分析
C. B. Ahmed, N. Boudriga, M. Obaidat
An intelligent network (IN) is characterized by the distribution of network intelligence and capabilities wherever required within the telecommunications network. IN is also an architectural concept that can be applied to a variety of telecommunication networks including the public switched networks, mobile networks, and the integrated services digital networks (ISDNs). In this paper, we analyze and evaluate the performance of wireless ATM network equipped with intelligent services. The mathematical analysis uses the concept of virtual resource, while the simulation uses the object-oriented (O-O) scheme.
智能网络(IN)的特点是在电信网络中需要的任何地方分布网络智能和功能。IN也是一种架构概念,可以应用于各种电信网络,包括公共交换网络、移动网络和综合业务数字网络(isdn)。本文对具有智能业务的无线ATM网络的性能进行了分析和评价。数学分析采用虚拟资源的概念,仿真采用面向对象(O-O)方案。
{"title":"Performance analysis of intelligent mobile ATM networks","authors":"C. B. Ahmed, N. Boudriga, M. Obaidat","doi":"10.1109/PCCC.1999.749424","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749424","url":null,"abstract":"An intelligent network (IN) is characterized by the distribution of network intelligence and capabilities wherever required within the telecommunications network. IN is also an architectural concept that can be applied to a variety of telecommunication networks including the public switched networks, mobile networks, and the integrated services digital networks (ISDNs). In this paper, we analyze and evaluate the performance of wireless ATM network equipped with intelligent services. The mathematical analysis uses the concept of virtual resource, while the simulation uses the object-oriented (O-O) scheme.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126567756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Measurement, analysis and performance improvement of the Apache Web server Apache Web服务器的测量、分析和性能改进
Yimin Hu, Ashwini K. Nanda, Qing Yang
Performance of Web servers is critical to the success of many corporations and organizations. However, very few results have been published that quantitatively study the server behavior and identify the performance bottlenecks. In this paper we measured and analyzed the behavior of the popular Apache Web server on a uniprocessor system and a 4-CPU SMP (Symmetric Multi-Processor) system running the IBM AIX operating system. Using the AIX built-in tracing facility, we obtained detailed information on kernel events and system activities while running Apache driven by the SPECweb96 and the WebStone benchmarks. After quantitatively identifying the performance bottlenecks, we proposed and implemented 6 techniques that improve the throughput of Apache by 61%. These techniques are general purpose and can be applied to other Web servers as well.
Web服务器的性能对许多公司和组织的成功至关重要。然而,很少有人发表定量研究服务器行为和确定性能瓶颈的结果。在本文中,我们测量并分析了流行的Apache Web服务器在一个单处理器系统和一个运行IBM AIX操作系统的4 cpu SMP(对称多处理器)系统上的行为。使用AIX内置跟踪工具,我们获得了运行由SPECweb96和WebStone基准测试驱动的Apache时的内核事件和系统活动的详细信息。在定量地确定性能瓶颈之后,我们提出并实现了6项技术,这些技术将Apache的吞吐量提高了61%。这些技术是通用的,也可以应用于其他Web服务器。
{"title":"Measurement, analysis and performance improvement of the Apache Web server","authors":"Yimin Hu, Ashwini K. Nanda, Qing Yang","doi":"10.1109/PCCC.1999.749447","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749447","url":null,"abstract":"Performance of Web servers is critical to the success of many corporations and organizations. However, very few results have been published that quantitatively study the server behavior and identify the performance bottlenecks. In this paper we measured and analyzed the behavior of the popular Apache Web server on a uniprocessor system and a 4-CPU SMP (Symmetric Multi-Processor) system running the IBM AIX operating system. Using the AIX built-in tracing facility, we obtained detailed information on kernel events and system activities while running Apache driven by the SPECweb96 and the WebStone benchmarks. After quantitatively identifying the performance bottlenecks, we proposed and implemented 6 techniques that improve the throughput of Apache by 61%. These techniques are general purpose and can be applied to other Web servers as well.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115890682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 140
Design and performance of a general-purpose software cache 通用软件缓存的设计与性能
A. Iyengar
This paper describes a General-Purpose Software cache (GPS cache) which can improve the performance of many applications including Web servers and databases. It can service several hundred thousand cache hits per second on a uniprocessor. When used to cache data for a Web server accelerator, the overhead due to the GPS cache was an insignificant factor in the overall performance of the system. The GPS cache can store objects in memory, on disk, or both. The cache uses a new algorithm for managing expiration times of cached objects which is more efficient than previous ones. The GPS cache uses Data Update Propagation (DUP) to invalidate complex objects which is crucial for caching and maintaining updated copies of dynamic Web pages. Transactions can be logged using different buffering mechanisms in order to provide a balance between efficiency and currency of transaction log files. The GPS cache provides API functions which allow applications to directly manipulate its contents.
本文介绍了一种通用软件缓存(GPS cache),它可以提高Web服务器和数据库等应用程序的性能。它可以在单处理器上每秒提供数十万次缓存命中。当用于为Web服务器加速器缓存数据时,由于GPS缓存而产生的开销在系统的整体性能中是一个无关紧要的因素。GPS缓存可以将对象存储在内存、磁盘或两者中。该缓存使用了一种新的算法来管理缓存对象的过期时间,比以前的算法更有效。GPS缓存使用数据更新传播(Data Update Propagation, DUP)使复杂对象失效,这对于缓存和维护动态Web页面的更新副本至关重要。可以使用不同的缓冲机制记录事务,以便在事务日志文件的效率和流通之间取得平衡。GPS缓存提供API函数,允许应用程序直接操作其内容。
{"title":"Design and performance of a general-purpose software cache","authors":"A. Iyengar","doi":"10.1109/PCCC.1999.749456","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749456","url":null,"abstract":"This paper describes a General-Purpose Software cache (GPS cache) which can improve the performance of many applications including Web servers and databases. It can service several hundred thousand cache hits per second on a uniprocessor. When used to cache data for a Web server accelerator, the overhead due to the GPS cache was an insignificant factor in the overall performance of the system. The GPS cache can store objects in memory, on disk, or both. The cache uses a new algorithm for managing expiration times of cached objects which is more efficient than previous ones. The GPS cache uses Data Update Propagation (DUP) to invalidate complex objects which is crucial for caching and maintaining updated copies of dynamic Web pages. Transactions can be logged using different buffering mechanisms in order to provide a balance between efficiency and currency of transaction log files. The GPS cache provides API functions which allow applications to directly manipulate its contents.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121992648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
On a virtual wavelength translation scheme for routing in all-optical networks 全光网络中路由的虚拟波长转换方案
S. Bandyopadhyay, A. Jaekel, A. Sengupta
Commercially available fiber-optic cables allow limited numbers of wavelengths on a single fiber. In this paper we have developed a pragmatic method of implementing all-optical wavelength routed networks using bundles of fibers to realize each communication link. We have explored some interesting features of this approach for dynamic lightpath assignment. We have shown that our approach allows virtual wavelength translation where we may derive some benefits of partial wavelength translation without actually having to change the carrier frequency. This design also allows us to have bundles of varying sizes to accommodate anticipated differences in traffic through different communication links of the network. Finally, we have presented a simple dynamic scheme, based on local information alone, for allocating lightpaths.
商用光纤电缆允许在一根光纤上传输有限的波长。在本文中,我们开发了一种实用的实现全光波长路由网络的方法,使用光纤束来实现每个通信链路。我们已经探索了这种动态光路分配方法的一些有趣的特性。我们已经证明,我们的方法允许虚拟波长平移,我们可以在不改变载波频率的情况下获得部分波长平移的一些好处。这种设计还允许我们拥有不同大小的数据包,以适应通过网络不同通信链路的预期流量差异。最后,我们提出了一个简单的动态方案,仅基于局部信息来分配光路。
{"title":"On a virtual wavelength translation scheme for routing in all-optical networks","authors":"S. Bandyopadhyay, A. Jaekel, A. Sengupta","doi":"10.1109/PCCC.1999.749465","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749465","url":null,"abstract":"Commercially available fiber-optic cables allow limited numbers of wavelengths on a single fiber. In this paper we have developed a pragmatic method of implementing all-optical wavelength routed networks using bundles of fibers to realize each communication link. We have explored some interesting features of this approach for dynamic lightpath assignment. We have shown that our approach allows virtual wavelength translation where we may derive some benefits of partial wavelength translation without actually having to change the carrier frequency. This design also allows us to have bundles of varying sizes to accommodate anticipated differences in traffic through different communication links of the network. Finally, we have presented a simple dynamic scheme, based on local information alone, for allocating lightpaths.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128170319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Validation of Turandot, a fast processor model for microarchitecture exploration 验证图兰朵,一个快速处理器模型的微架构探索
M. Moudgill, P. Bose, J. Moreno
We describe the results in validating the performance projections from a parameterized trace-driven simulation model of a speculative out-of-order superscalar processor which has been developed with the objective of acting as a microarchitecture exploration tool. Because of its objective, the model-called Turandot-has been designed to deliver much higher simulation speed than what is achieved from detailed (RTL) processor models. We summarize the validation methodology used, and present experimental data gathered in the calibration of one processor organization modeled with Turandot against a detailed reference model. The results indicate that, on the average for SPECint95 sampled traces, Turandot is within 5% of the results reported by the reference model while exhibiting a speed-up factor of about 70.
我们描述了从一个推测的无序超标标量处理器的参数化跟踪驱动仿真模型验证性能预测的结果,该模型是为了作为微架构探索工具而开发的。因为它的目标,这个被称为图兰朵的模型被设计为提供比详细的(RTL)处理器模型更高的仿真速度。我们总结了所使用的验证方法,并根据详细的参考模型,在图兰朵建模的一个处理器组织的校准中收集了实验数据。结果表明,在SPECint95采样痕迹的平均值上,图兰朵在参考模型报告的结果的5%以内,同时显示出大约70的加速因子。
{"title":"Validation of Turandot, a fast processor model for microarchitecture exploration","authors":"M. Moudgill, P. Bose, J. Moreno","doi":"10.1109/PCCC.1999.749471","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749471","url":null,"abstract":"We describe the results in validating the performance projections from a parameterized trace-driven simulation model of a speculative out-of-order superscalar processor which has been developed with the objective of acting as a microarchitecture exploration tool. Because of its objective, the model-called Turandot-has been designed to deliver much higher simulation speed than what is achieved from detailed (RTL) processor models. We summarize the validation methodology used, and present experimental data gathered in the calibration of one processor organization modeled with Turandot against a detailed reference model. The results indicate that, on the average for SPECint95 sampled traces, Turandot is within 5% of the results reported by the reference model while exhibiting a speed-up factor of about 70.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114612865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 72
Sidney and RDS: an evaluation of two persistent storage systems Sidney和RDS:两个持久存储系统的评估
S. Nettles
Comparative experimentation is increasingly important in computer science, but performing such experiments can be challenging. The paper presents a set of experiments that compare the performance of two persistent storage managers, and answer the question of whether the safer storage manager has performance comparable to the less safe one. This comparison was difficult for a number of reasons, among them: relatively few programs using either storage manager existed and no established benchmarks existed, and the two techniques are incompatible at the source code level, thus making a direct comparison impossible. In particular one storage manager used a malloc-and-free style of dynamic storage allocation, while the other used a high performance concurrent garbage collector. A number of approaches were used to overcome this difficulty. The most novel approach involved tracing the memory management of a production program that used the malloc-and-free based storage manager and then replaying the trace in an environment that allowed garbage collection and malloc-and-free to be compared. The study represents the most extensive study of a garbage collected persistent storage system to date.
比较实验在计算机科学中越来越重要,但进行这样的实验可能具有挑战性。本文提出了一组实验,比较了两种持久存储管理器的性能,并回答了更安全的存储管理器是否与不安全的存储管理器具有可比性的问题。由于许多原因,这种比较比较困难,其中包括:使用存储管理器的程序相对较少,并且没有建立的基准测试,并且这两种技术在源代码级别不兼容,因此不可能进行直接比较。具体来说,一个存储管理器使用malloc-and-free风格的动态存储分配,而另一个使用高性能并发垃圾收集器。为了克服这一困难,采用了许多方法。最新颖的方法涉及跟踪使用基于malloc和free存储管理器的生产程序的内存管理,然后在允许比较垃圾收集和malloc和free的环境中重播跟踪。该研究代表了迄今为止对垃圾收集持久性存储系统进行的最广泛的研究。
{"title":"Sidney and RDS: an evaluation of two persistent storage systems","authors":"S. Nettles","doi":"10.1109/PCCC.1999.749457","DOIUrl":"https://doi.org/10.1109/PCCC.1999.749457","url":null,"abstract":"Comparative experimentation is increasingly important in computer science, but performing such experiments can be challenging. The paper presents a set of experiments that compare the performance of two persistent storage managers, and answer the question of whether the safer storage manager has performance comparable to the less safe one. This comparison was difficult for a number of reasons, among them: relatively few programs using either storage manager existed and no established benchmarks existed, and the two techniques are incompatible at the source code level, thus making a direct comparison impossible. In particular one storage manager used a malloc-and-free style of dynamic storage allocation, while the other used a high performance concurrent garbage collector. A number of approaches were used to overcome this difficulty. The most novel approach involved tracing the memory management of a production program that used the malloc-and-free based storage manager and then replaying the trace in an environment that allowed garbage collection and malloc-and-free to be compared. The study represents the most extensive study of a garbage collected persistent storage system to date.","PeriodicalId":211210,"journal":{"name":"1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1999-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115422889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1