{"title":"使用硬件加速器的快速memset()实现:扩展抽象","authors":"K. Pusukuri, R. Gardner, Jared C. Smolens","doi":"10.1145/3127479.3132573","DOIUrl":null,"url":null,"abstract":"Multicore systems with large caches and huge main memories have become ubiquitous. They provide an attractive opportunity to maximize performance of big-memory applications such as in-memory databases, key-value stores, and graph analytics. However, these big-memory applications require many virtual-to-physical address translations, which increase TLB miss rate and hurt performance. To address this problem, modern hardware and OSes introduced support for huge pages. For example, on SPARC M7, Linux supports 8MB, 2GB, and 16GB huge pages (in addition to the default 8KB). Likewise, Linux supports 2MB and 1GB huge pages on Intel Xeon (E5-2630) platforms.","PeriodicalId":20679,"journal":{"name":"Proceedings of the 2017 Symposium on Cloud Computing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An implementation of fast memset() using hardware accelerators: extended abstract\",\"authors\":\"K. Pusukuri, R. Gardner, Jared C. Smolens\",\"doi\":\"10.1145/3127479.3132573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multicore systems with large caches and huge main memories have become ubiquitous. They provide an attractive opportunity to maximize performance of big-memory applications such as in-memory databases, key-value stores, and graph analytics. However, these big-memory applications require many virtual-to-physical address translations, which increase TLB miss rate and hurt performance. To address this problem, modern hardware and OSes introduced support for huge pages. For example, on SPARC M7, Linux supports 8MB, 2GB, and 16GB huge pages (in addition to the default 8KB). Likewise, Linux supports 2MB and 1GB huge pages on Intel Xeon (E5-2630) platforms.\",\"PeriodicalId\":20679,\"journal\":{\"name\":\"Proceedings of the 2017 Symposium on Cloud Computing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 Symposium on Cloud Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3127479.3132573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 Symposium on Cloud Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3127479.3132573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An implementation of fast memset() using hardware accelerators: extended abstract
Multicore systems with large caches and huge main memories have become ubiquitous. They provide an attractive opportunity to maximize performance of big-memory applications such as in-memory databases, key-value stores, and graph analytics. However, these big-memory applications require many virtual-to-physical address translations, which increase TLB miss rate and hurt performance. To address this problem, modern hardware and OSes introduced support for huge pages. For example, on SPARC M7, Linux supports 8MB, 2GB, and 16GB huge pages (in addition to the default 8KB). Likewise, Linux supports 2MB and 1GB huge pages on Intel Xeon (E5-2630) platforms.