{"title":"A Low Power Complete Charge-Recycling Bus Architecture for Ultra-High Data Rate Ulsi's","authors":"H. Yamauchi, H. Akamatsu, T. Fujita","doi":"10.1109/VLSIC.1994.586172","DOIUrl":null,"url":null,"abstract":"The ultra-high data rate of 25Gbls and beyond is one of the most important design requirements in realizing the future ULSI's for super highdefiition (HD) moving pictures and graphics applications in consumer electronics. The most effective means to achieve such a data rate is to employ a large number of the buses interconnecting the embedded memory, the graphics controller, efc, on a ULSI system chip. For example, even at an operating frequency of SOMHz, parallel buses of more than 512 are required. However, a drastic increase of the power dissipation is inevitable due to the increased bus-capacitance. Even if suppressing the bus-swing to less than 1V[1], there still remains a bus-power dissipation of a far above 5OOmW, which is intolerable for battery operation as shown in Fig.1. Hence, this paper proposes a complete Charge-Recycling Bus(CRB) architecture that can reduce the bus-power dissipation to less than 15% of the conventional suppressed bus-swing scheme while realizing the data rate of 25Gbts. t of New Bus ( As shown in Fig. 2, even when using suppressed bus-swing (Vcc/k) scheme of Conv.Bq(k=4) coupled with a down converter, the power cannot be reduced adequately for battery operation. This is because there still remain a contribution from the total bus-capacitance nCd, where n and Cd are the number of total bus pairs and the capacitance of the individual bus, respectively. The key concept of the CRB architecture is virtual stacking of the individual buscapacitance Cd into a series configuration to reduce not only the bus-swing but also the total equivalent bus-capacitance. When the practical data-path layouts in ULSrs are considered, data buses are classified into a local bus (path width = 2 bit ) and a global bus (path width 2 4 bit) depending on the hierarchy of data-path. Hence, we propose the Local and Global CRB schemes for the local and global buses, respectively. The Local CRB scheme is virtual connection of two bus-capacitance Cd and a smaller dummy capacitance Cs in series while the Global CRB scheme is virtual stacking of numerous buscapacitance Cd in series. Regarding the i-th complementary bus pair in the two CBR schemes, the input INi and the equalization signal EQ establish not only the output signals, Di and XDi, but also the bus-level signals at nodes H and L as follows: 1)in former half of the clock cycle, the EQ signal synchronized with the system clock equalize Di and XDi of the complementary bus pair ; 2)in latter half of the clock cycle, the input signal INi switches higher (or lower) level of the bus-output pair Di and XDi to the node H (or L) according to the m t h table in Fig.3. In addition, the clocked altemating dummy Cs pair inserted between two pairs of complementary bus is an important element, especially when the number of the buses running in parallel is short as the case of the Local CRB scheme, where the clocked signal CLH of a half of the system clock frequency altemates the connections of the A and E) nodes of the dummy capacitor pair Cs with the nodes H and L as shown in Fig. 3. The series connection of the numerous bus capacitance Cd and the dummy capacitor Cs is realized by joining the node L of the i-lth bus (or capacitor) pair to the node H of the i-th bus (or capacitor) pair as shown in Fig.3. For example, in the Local CRB scheme, when IN1 = \"1\" and CLH = \"0\", D1 and XD1 are connected to Vcc and the node B of the Cs, respectively. On the other hand, in the Global CRB scheme, when INi-1 = \"1\". INi = \"0\" and M i + l = \"0, Di and XDi are connected to XDitl of the lower adjacent bus pair and XDi-1 of the upper adjacent bus pair, respectively.","PeriodicalId":350730,"journal":{"name":"Proceedings of 1994 IEEE Symposium on VLSI Circuits","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 1994 IEEE Symposium on VLSI Circuits","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VLSIC.1994.586172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The ultra-high data rate of 25Gbls and beyond is one of the most important design requirements in realizing the future ULSI's for super highdefiition (HD) moving pictures and graphics applications in consumer electronics. The most effective means to achieve such a data rate is to employ a large number of the buses interconnecting the embedded memory, the graphics controller, efc, on a ULSI system chip. For example, even at an operating frequency of SOMHz, parallel buses of more than 512 are required. However, a drastic increase of the power dissipation is inevitable due to the increased bus-capacitance. Even if suppressing the bus-swing to less than 1V[1], there still remains a bus-power dissipation of a far above 5OOmW, which is intolerable for battery operation as shown in Fig.1. Hence, this paper proposes a complete Charge-Recycling Bus(CRB) architecture that can reduce the bus-power dissipation to less than 15% of the conventional suppressed bus-swing scheme while realizing the data rate of 25Gbts. t of New Bus ( As shown in Fig. 2, even when using suppressed bus-swing (Vcc/k) scheme of Conv.Bq(k=4) coupled with a down converter, the power cannot be reduced adequately for battery operation. This is because there still remain a contribution from the total bus-capacitance nCd, where n and Cd are the number of total bus pairs and the capacitance of the individual bus, respectively. The key concept of the CRB architecture is virtual stacking of the individual buscapacitance Cd into a series configuration to reduce not only the bus-swing but also the total equivalent bus-capacitance. When the practical data-path layouts in ULSrs are considered, data buses are classified into a local bus (path width = 2 bit ) and a global bus (path width 2 4 bit) depending on the hierarchy of data-path. Hence, we propose the Local and Global CRB schemes for the local and global buses, respectively. The Local CRB scheme is virtual connection of two bus-capacitance Cd and a smaller dummy capacitance Cs in series while the Global CRB scheme is virtual stacking of numerous buscapacitance Cd in series. Regarding the i-th complementary bus pair in the two CBR schemes, the input INi and the equalization signal EQ establish not only the output signals, Di and XDi, but also the bus-level signals at nodes H and L as follows: 1)in former half of the clock cycle, the EQ signal synchronized with the system clock equalize Di and XDi of the complementary bus pair ; 2)in latter half of the clock cycle, the input signal INi switches higher (or lower) level of the bus-output pair Di and XDi to the node H (or L) according to the m t h table in Fig.3. In addition, the clocked altemating dummy Cs pair inserted between two pairs of complementary bus is an important element, especially when the number of the buses running in parallel is short as the case of the Local CRB scheme, where the clocked signal CLH of a half of the system clock frequency altemates the connections of the A and E) nodes of the dummy capacitor pair Cs with the nodes H and L as shown in Fig. 3. The series connection of the numerous bus capacitance Cd and the dummy capacitor Cs is realized by joining the node L of the i-lth bus (or capacitor) pair to the node H of the i-th bus (or capacitor) pair as shown in Fig.3. For example, in the Local CRB scheme, when IN1 = "1" and CLH = "0", D1 and XD1 are connected to Vcc and the node B of the Cs, respectively. On the other hand, in the Global CRB scheme, when INi-1 = "1". INi = "0" and M i + l = "0, Di and XDi are connected to XDitl of the lower adjacent bus pair and XDi-1 of the upper adjacent bus pair, respectively.