H. Alissa, K. Nemati, Udaya L. N. Puvvadi, B. Sammakia, K. Ghose, M. Seymour, Russell Tipton, Ken Schneebeli
{"title":"Empirical analysis of blower cooling failure in containment: Effects on IT performance","authors":"H. Alissa, K. Nemati, Udaya L. N. Puvvadi, B. Sammakia, K. Ghose, M. Seymour, Russell Tipton, Ken Schneebeli","doi":"10.1109/ITHERM.2016.7517716","DOIUrl":null,"url":null,"abstract":"Data Centers are prone to power outages and cooling failures. During such events, complex transport interactions take place between the cooling system and the IT. Empirical data on this phenomenon is scarce in the current literature due to the complexity and size of such experiments. In this study, a facility level data center blowers cooling failure experiment is run and analyzed. Quantitative instrumentation includes pressure differentials, tile airflow, point air inlet temperature, contours air inlet temperature and IT IPMI data during failure-recovery. Qualitative measurements include IR imaging and airflow visualization via smoke trace. To our knowledge, this is the first experimental study in literature in which an actual multi aisle facility cooling failure is run with real IT (compute, Network and storage) load in the white space. This will enable a link between variations from the facility to the chip levels. Results show that by using external air inlet temperature sensors the containment configuration has a longer uptime during failure. However, the IPMI data shows the opposite. In fact, the RTT is reduced by ~70% when the external and internal sensors are compared. This occurs due external impedances formed by the containment during failure degrading IT airflow systems. The inconsistency between IT IPMI inlet sensors and externally placed IT or rack inlet sensors (based on best practices) are expected to increase as the airflow imbalances increase.","PeriodicalId":426908,"journal":{"name":"2016 15th IEEE Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 15th IEEE Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITHERM.2016.7517716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Data Centers are prone to power outages and cooling failures. During such events, complex transport interactions take place between the cooling system and the IT. Empirical data on this phenomenon is scarce in the current literature due to the complexity and size of such experiments. In this study, a facility level data center blowers cooling failure experiment is run and analyzed. Quantitative instrumentation includes pressure differentials, tile airflow, point air inlet temperature, contours air inlet temperature and IT IPMI data during failure-recovery. Qualitative measurements include IR imaging and airflow visualization via smoke trace. To our knowledge, this is the first experimental study in literature in which an actual multi aisle facility cooling failure is run with real IT (compute, Network and storage) load in the white space. This will enable a link between variations from the facility to the chip levels. Results show that by using external air inlet temperature sensors the containment configuration has a longer uptime during failure. However, the IPMI data shows the opposite. In fact, the RTT is reduced by ~70% when the external and internal sensors are compared. This occurs due external impedances formed by the containment during failure degrading IT airflow systems. The inconsistency between IT IPMI inlet sensors and externally placed IT or rack inlet sensors (based on best practices) are expected to increase as the airflow imbalances increase.