# Timing Analysis Considering IR Drop Waveforms in Power Gating Designs

Shih-Hung Weng, Yu-Min Kuo, Shih-Chieh Chang, Malgorzata Marek-Sadowska\*\* Department of CS, National Tsing Hua University, Hsinchu, Taiwan \*\*Department of ECE, University of California, Santa Barbara, CA {shweng, ymkuo, scchang}@cs.nthu.edu.tw, mms@ece.ucsb.edu

Abstract—IR drop noise has become a critical issue in advanced process technologies. Traditionally, timing analysis in which the IR drop noise is considered assumes a worst-case IR drop for each gate; however, using this assumption provides unduly pessimistic results. In this paper, we describe a timing analysis approach for power gating designs. To improve the accuracy of the gate delay calculation we determine the virtual voltage level by taking into account the IR drop waveforms across the sleep transistors. These can be obtained efficiently using a linear programming approach. Our experimental results are very promising.

#### I. INTRODUCTION

As the VLSI design process continues to scale down, the noise problem has become critical because it may degrade a circuit's performance and cause erroneous output values. Supply and reference voltage fluctuations are sources of IR drop noise. Traditionally, after the worst-case IR drop is determined, it is used by the timing analyzer to estimate a circuit's delay; however, using the worst-case IR drop can provide unnecessarily pessimistic results. Because IR drop depends on the applied input vectors, IR drop varies over time and is different for different nodes on the power grid. It is also well known that the worst cases of IR drop only occur occasionally.

There have been several previous works [1][3][5][11][14] that address timing analysis with IR drop noise. Researchers [1][14] consider the variations of both power and ground voltages. The authors of [5] calculate circuit delay with IR drop noise by modeling it as an effective output load. Researchers in [11] provide a pattern searching technique for producing the worst-case delay on the critical paths. The authors of [3] present a pattern-independent method to calculate a tight upper bound on the delay along the selected path with IR drop noise.

Unlike previous works that focus on the IR drop of a power grid network, we studied the IR drop effect in a power gating design [2][7][12][13][16][17] in which the IR

drop problem is even more severe. Power gating is used extensively in portable devices for leakage power reduction. Since the *on* resistances of the sleep transistors are relatively larger than the resistances of wires, currents may flow through the sleep transistors and cause larger IR drops across sleep transistors in the active mode. Hence, analyzing the timing with IR drop is very important in the power gating designs.

In this paper, we propose to an efficient method to use the *IR drop waveforms* to capture the time-varying voltage level of the local power grid that is connected to the global power grid through the sleep transistors in a power gating design. The IR drop waveform of a node on the power grid indicates the maximum possible IR drop value for every time instant. Although the number of nodes on a power grid can be large, the number of required IR drop waveforms across sleep transistors is limited by the number of sleep transistors, which is much less than the number of nodes on the global power grid. This is common in most power gated architectures, such as a Distributed Sleep Transistor Network (DSTN) [12].

Our contributions are as follows. First, we propose a linear programming solution to estimate the IR drop waveform without running massive numbers of SPICE-like simulations. Second, most of the previous works consider IR drop with *decoupling capacitances* (or *decaps*) only through circuit simulations. We show how to compute the upper bound of the IR drop considering the effects of decaps. Finally, we apply the concept of the sliced switching window [4][15] to the timing analysis that considers the IR drop noise. The result shows that using the IR drop waveform can provide 83% more accuracy than the traditional method

The remainder of this paper is organized as follows. Section II presents our method for IR drop waveform estimation. Section III discusses how to apply the IR drop waveform in the static timing analysis. Section IV shows the experimental results. Section V presents our conclusions.

## II. FAST IR DROP WAVEFORM ESTIMATION

In this section, we first review the basic architecture of power gating designs in Section II.A. Then, we present our efficient IR drop waveform estimation method for a power gating design without considering decaps in Section II.B. Next, in Section II.C, we extend this method to consider decaps. Finally, we propose an improved method with decaps in Section II.D.

# A. Basic Architecture of a Power Gating Design

In this paper, we use the DSTN architecture of a power gating design, as shown in Figure 1(a). In the DSTN architecture, a circuit is decomposed into several clusters, and each cluster is connected to a sleep transistor. Then, all sleep transistors are connected together by virtual power lines. Since sleep transistors operate in the linear region during the active mode, they can be modeled as resistors. Figure 1(b) shows a resistance network model of the power gating design in Figure 1(a).  $R_{STi}$  is the resistance of a sleep transistor STi;  $R_V$  is the resistance of a segment of the virtual power line;  $I_{Ci}$  is the current flowing through a cluster Ci; and  $I_{STi}$  denotes the current through a sleep transistor STi. Let us focus on the current through sleep transistor ST3,  $I_{ST3}$ , which is contributed by the currents  $I_{CI}$ ,  $I_{C2}$ , and  $I_{C3}$  from all clusters. Using Kirchhoff's Current Law and Ohm's Law, the relationship between  $I_{STi}$  and  $I_{Ci}$ can be stated in matrix form as follows.

$$\begin{bmatrix} I_{ST1} \\ I_{ST2} \\ I_{ST3} \end{bmatrix} = \begin{bmatrix} r_{11} & r_{12} & r_{13} \\ r_{21} & r_{22} & r_{23} \\ r_{31} & r_{32} & r_{33} \end{bmatrix} \cdot \begin{bmatrix} I_{C1} \\ I_{C2} \\ I_{C3} \end{bmatrix}$$
EQ(1)

where  $r_{ij}$ s are all constants. Let us take  $r_{11}$  as an example.

$$r_{11} = \frac{(R_{V} + R_{ST3}) \times R_{ST2} + R_{V} \times (R_{V} + R_{ST2} + R_{ST2})}{(R_{V} + R_{ST3}) \times R_{ST2} + (R_{V} + R_{ST1}) \times (R_{V} + R_{ST2} + R_{ST2})}$$

Using EQ(1), we can compute the current through a sleep transistor for a given pair of input vectors. First, for each pair of input patterns, we can obtain the current profile of clusters. With these, we can then derive the current profile through the sleep transistor by using EQ(1). Among all possible input patterns, we can find the *maximum current* through the sleep transistor STi, denoted as  $MC_{STi}(t)$  for a time instant *t*. Knowing the maximum current  $MC_{STi}(t)$ , we can then easily find the *maximum IR drop*  $MIR_{STi}(t)$  using EQ(2).

$$MIR_{STi}(t) = MC_{STi}(t) \times R_{STi} \qquad EQ(2)$$

However, the process of determining  $MC_{STi}(t)$  generally



Figure 1: A power gating design with its corresponding resistance network

requires a massive number of circuit-level simulations. The process becomes impractical even for a medium-sized design. Although  $MC_{STi}(t)$  is difficult to calculate, the **maximum current** of a cluster Ci, defined as  $MC_{Ci}(t)$ , is quite easy to obtain. Many efficient methods have been proposed in [6][8][9][10]. In addition, CAD tools such as PrimePower<sup>TM</sup> are available, which enable us to perform a massive number of gate-level simulations to obtain the maximum current of a cluster.

In this paper, we assume that the maximum currents for the clusters are available. One simple way of estimating the maximum current through the sleep transistors,  $MC_{STi}$ , is to plug the maximum cluster current,  $MC_{Ci}$ , into its corresponding variable  $I_{Ci}$  in EQ(1). However, this simple estimation is unduly pessimistic because the maximum currents of different clusters usually do not occur simultaneously.

# B. IR Drop Waveform Estimation without Decaps

Now, we formally formulate the problem of deriving IR drop waveforms for all nodes on the virtual power grid as shown in Figure 2. The input is a resistive network that models the sleep transistors and wire segments of the virtual power line. In addition, we assume that the waveforms of the clusters' maximum currents are available. Our objective is to find an accurate estimation of the sleep transistors' IR drop waveforms.



Figure 2: The IR drop waveform derivation for a resistance network

We have noted that the maximum currents of different clusters usually do not occur at the same time; however, the maximum current for a cluster,  $MC_{Ci}(t)$ , can only be used to constrain the value of the corresponding  $I_{Ci}$ . For the correlation between different clusters, we can use the maximum current of different combinations of clusters to constrain the current through the corresponding clusters. For example, we can obtain  $MC_{CLC2}(t)$ , which is the maximum current of the two clusters: C1 and C2. With  $MC_{Cl,C2}(t)$ , we must have  $I_{Cl} + I_{C2} \leq MC_{Cl,C2}(t)$ . Our idea is to use such maximum currents as linear constraints to consider the correlations between different clusters and also to obtain  $MIR_{STi}(t)$  by maximizing the value of  $IR_{STi}$ , which is the IR drop across the sleep transistor STi. Taking  $R_{STI}$  in Figure 2 as an example, the linear program to find the  $MIR_{STI}(t)$  for a time instant t can be stated as follows:

Maximize

$$IR_{ST1} = \left(\sum_{i=1}^{3} I_{Ci} \cdot r_{1i}\right) \cdot R_{ST1}$$
 EQ(3)

Subject to

$$I_{Ci} \leq MC_{Ci}(t) \quad \text{for } i = 1, 2, 3.$$

$$I_{C1} + I_{C2} \leq MC_{C1,C2}(t)$$

$$I_{C1} + I_{C3} \leq MC_{C1,C3}(t)$$

$$I_{C2} + I_{C3} \leq MC_{C2,C3}(t)$$

$$I_{C1} + I_{C2} + I_{C3} \leq MC_{C1,C2,C3}(t)$$

where  $r_{11}$ ,  $r_{12}$ , and  $r_{13}$  are constants. The linear programming solution can efficiently find the waveform of  $MIR_{STi}$  and take into account that the maximum currents for the clusters do not occur simultaneously.

## C. IR Drop Waveform Estimation with Decaps

Since decaps are commonly inserted between the ground and virtual power lines in a power gating design to prevent a sudden fluctuation of IR drops and to reduce the IR drop noise, it is important to consider their effect. In the



Figure 3: The IR drop waveform derivation for a resistance and capacitance network

previous section, we modeled the sleep transistor network as a linear system without considering decaps. In this section, we discuss how to modify the linear programming model to consider the effect of decaps. The circuit used for this problem is shown in Figure 3. Here,  $C_{Di}$  represents the capacitance of a decap Di. Again, we assume that the current profiles of all clusters are available.

With decaps, the value of the IR drop across a sleep transistor depends not only on the current flowing through the sleep transistor, but also on the previous voltages across decaps. Because of this dependency, the linear programming solution in EQ(3) cannot be directly applied. Since our objective is to find the maximum IR drop, we need to know which voltages on decaps at the previous time instant will cause the maximum IR drop at the current time instant. Luckily, we have the following lemma.

**Lemma 1:** In our linear programming model, the largest possible value of the IR drop occurs when all decaps at the previous time instant have the largest voltage values.

**Proof:** By using Kirchhoff's Current Law, the current through the sleep transistor *ST1* can be written as follows.

$$\frac{V_{ST1}(t)}{R_{ST1}} = I_{C1}(t) + \frac{V_{ST2}(t) - V_{ST1}(t)}{R_V} - V'_{ST1}(t) \cdot C_{D1}$$
 EQ(4)

Applying the Backward Euler technique with a fixed time step *h*, EQ(4) can be re-written as EQ(5), where  $V_{STI}(t-h)$  is the voltage across *ST1* at time instant *t*-*h*.

$$\frac{V_{ST1}(t)}{R_{ST1}} \cdot \left(\frac{h + R_{ST1} \cdot C_{D1}}{h}\right) = \left(I_{C1}(t) + \frac{V_{ST1}(t-h) \cdot C_{D1}}{h}\right) + \frac{V_{ST2}(t) - V_{ST1}(t)}{R_{V}} \quad EQ(5)$$

Similarly, we can obtain the EQ(5) for ST2 and ST3. Then we have the following matrix form.

$$\begin{bmatrix} \frac{V_{ST1}(t)}{R_{ST1}} \\ \frac{V_{ST2}(t)}{R_{ST2}} \\ \frac{V_{ST2}(t)}{R_{ST3}} \end{bmatrix} = \begin{bmatrix} \frac{h \cdot r_{11}}{h + R_{ST1} \cdot C_{D1}} & \frac{h \cdot r_{12}}{h + R_{ST1} \cdot C_{D1}} & \frac{h \cdot r_{33}}{h + R_{ST1} \cdot C_{D2}} \\ \frac{h \cdot r_{21}}{h + R_{ST2} \cdot C_{D2}} & \frac{h \cdot r_{22}}{h + R_{ST2} \cdot C_{D2}} & \frac{h \cdot r_{23}}{h + R_{ST2} \cdot C_{D2}} \\ \frac{h \cdot r_{31}}{h + R_{ST3} \cdot C_{D3}} & \frac{h \cdot r_{32}}{h + R_{ST3} \cdot C_{D3}} & \frac{h \cdot r_{33}}{h + R_{ST3} \cdot C_{D3}} \end{bmatrix} \begin{bmatrix} I_{C1}(t) + \frac{V_{ST1}(t - h) \cdot C_{D1}}{h} \\ I_{C2}(t) + \frac{V_{ST2}(t - h) \cdot C_{D2}}{h} \\ I_{C3}(t) + \frac{V_{ST3}(t - h) \cdot C_{D3}}{h} \end{bmatrix} EQ(6)$$

In EQ(6), the  $r_{ij}$ ,  $C_{Di}$ , and  $R_{STi}$  are all positive constants. Therefore, in order for  $V_{STi}(t)$  to have the maximum value, all  $V_{STi}(t-h)$  must have the maximum value. Since decap stores the previous values, all decaps at the previous time instant must have the largest voltage values. Q.E.D

According to Lemma 1, if we have the largest voltages for all decaps at time instant *t*-*h*,  $MIR_{STi}(t-h)$ , we can obtain  $MIR_{STi}(t)$  for all sleep transistors by using the linear programming method, but with many maximum currents of clusters as constraints. Using sleep transistor ST1 in Figure 3 as an example, we can obtain  $MIR_{STI}(t)$  from the following formula.

Maximize

$$IR_{ST1}(t) = \left(\frac{R_{ST1}h}{h + R_{ST1}C_{D1}}\right) \left(\sum_{i=1}^{3} r_{1i} \cdot \left(I_{Ci}(t) + MIR_{ST1}(t - h)\left(\frac{C_{Di}}{h}\right)\right)\right) EQ(7)$$

where the constraints are the same as in EQ(3). Furthermore, the voltage of decaps may not be zero at the initial time instant. To determine the value of  $MIR_{STi}(0)$ , we first consider  $MIR_{STi}(0)$  as having a zero voltage level. Then, after deriving the IR drop waveform, we can obtain the value of  $MIR_{STi}(T_{CYCLE})$  where  $T_{CYCLE}$  is the clock period. With the value of  $MIR_{STi}(T_{CYCLE})$ , we can assign  $MIR_{STi}(0) = MIR_{STi}(T_{CYCLE})$  and derive a new IR drop waveform. We continue this step iteratively until the value of  $MIR_{STi}(0)$  converges.

# D. IR Drop Waveform Estimation with Decaps Considering Vector Correlation

To improve the accuracy of our IR drop estimation, we consider the correlation of decap voltage vectors. First, our estimation in the previous section is still pessimistic because the vector that induces  $MIR_{STi}$  at the previous time instant may not be the same as the vector that induces  $MIR_{STi}$  at the current time instant. In other words, the values of  $MIR_{STi}(t-\Delta t)$  and  $MIR_{STi}(t)$  may be affected by different vectors. The dependences that exist among decaps' values result in a conservative IR drop waveform calculation.

The maximum current of a cluster is obtained in our model by running a large input set of gate level simulations using PrimePower<sup>TM</sup>. To reduce the correlation among the vectors, we partition all input vectors V into several partial sets such that  $V = \{V1, V2, V3 \dots, Vn\}$ . Thus, we can obtain the IR drop waveform for each set of vectors,  $MIR_{STI}(t, Vk)$ . Then, we obtain the IR drop waveform of the

sleep transistor by finding the maximum value among  $MIR_{STi}(t, Vk)$  at every time instant. Figure 4 shows the overall algorithm for the estimation of the IR drop waveform with decaps.

| Algorithm IR_Waveform( $MC_{Ci}(t), Ci, T_{CYCLE}, V$ )                   |
|---------------------------------------------------------------------------|
| 1. foreach $Vk$ in set $V$                                                |
| 2. <b>initial</b> $MIR_{STi}(0, Vk) = 0$ /*initial value of decaps as 0*/ |
| 3. do                                                                     |
| 4. <b>foreach</b> time instant <i>t</i> within $T_{CYCLE}$                |
| 5. find $MIR_{STi}(t, Vk)$ by linear programming                          |
| 6. end foreach                                                            |
| 7. $MIR_{STi}(0, Vk) = MIR_{STi}(T_{CYCLE}, Vk)$                          |
| 8. <b>until</b> $MIR_{STi}(0, Vk)$ equals $MIR_{STi}(T_{CYCLE}, Vk)$      |
| 9. end foreach                                                            |
| 10. $MIR_{STi}(t) = Max\{MIR_{STi}(t, Vk)\}$ for all $t, k$               |

| Figure 4: | Algorithm | for IR dron | ) waveform | with decaps |
|-----------|-----------|-------------|------------|-------------|
|           |           |             |            |             |

# III. TIMING ANALYSIS WITH IR DROP WAVEFORMS

Since IR drop will affect timing, in this section, we discuss how to perform the timing analysis to incorporate IR drop waveforms. We will use the sliced switching window. We define the gate's *switching window* as the time period between its earliest and latest arrival times. We divide the gate's switching windows into a set of small periods, called *slices*. The set of slices obtained for one gate is the *sliced switching window*.

With the IR drop waveform for a gate, we have information about the largest IR drop within each time slice. For example, in Figure 5, the largest IR drop within the slice [400, 500] is 0.082 V. For each slice in the switching window of a gate, we use its largest IR drop for computing the corresponding IR drop noise induced delay by applying the methods described in [1][14]. When the noise delays of all slices have been obtained, we can determine the gate's delay.

We use an example to illustrate the computation of the IR drop noise delay on a slice of a switching window. Suppose the IR drop waveform of a gate under consideration is as shown in Figure 5. Without considering the IR drop noise, the gate's switching window is [400, 700]. We divide the switching window of the gate into three slices {[400, 500], [500, 600], [600, 700]} of equal size: 100 ps. The largest IR drops of the gate are 0.082 V, 0.074 V, and 0.051 V within these three slices. For the first

slice [400, 500], we use the IR drop noise of 0.075 V to calculate the delay  $\Delta d_1$ . The first slice is then extended to be [400,  $500+\Delta d_1$ ] when considering the IR drop noise. Similarly, for the other two slices, we have [ $500, 600+\Delta d_2$ ] and [ $600, 700+\Delta d_3$ ]. In this example, suppose the maximum value among { $500+\Delta d_1$ ,  $600+\Delta d_2$ ,  $700+\Delta d_3$ } is  $600+\Delta d_2$ . Since the largest delay now is  $600+\Delta d_2$ , the gate's switching window can be extended to [ $400, 600+\Delta d_2$ ]. If we consider [400, 700] as a basic unit, the largest IR drop within the switching window is 0.082 V, which is less than the peak IR drop of 0.098V. Then, the switching window considering the IR drop noise will be [ $400, 700+\Delta d$ ];  $\Delta d$  is the noise delay under a 0.082 V IR



Figure 5: Noise delay computation with IR drop noise

drop. When we get the whole picture of the updated slices, we can easily obtain the delay for the entire circuit.

#### IV. EXPERIMENTAL RESULTS

We implemented the algorithms in Sections II and III and applied them to several benchmark circuits using the TSMC 90 nm CMOS technology process. These circuits were implemented using the power gating design style [2][7][12][13]. The implementation steps are shown in the following. First, a benchmark circuit is synthesized and placed. After that, we group the gates in the same row into a cluster that connects to a sleep transistor. The sleep transistors are inserted on the side of a row and underneath a power trunk. Decaps are placed near each sleep transistor and the value of a decap is assumed to be 0.5 pF. Then, we generated maximum current waveforms of clusters by using PrimePower<sup>™</sup> with 10,000 random vectors. In the TSMC 90 nm process, the on resistance of a  $1\mu$ m sleep transistor is 2499 Ohm but the resistance of a  $1\mu$ m Metal 1 wire is only 0.3 Ohm. The resistance of a wire is relatively small compared to the on resistance of a sleep transistor. Therefore, without losing much accuracy in the estimation, the IR drop of a node on the virtual power grid can be obtained from the IR drop of the node's corresponding sleep transistor.

Table 1 shows the results of our experiments. Column 1 shows the name of a benchmark circuit. Column 2 shows the number of sleep transistors. Column 3 shows a circuit's delay without IR drop noise. In an IR drop waveform, the peak IR drop denotes the largest value of the IR drop waveform. We use the peak IR drop as a constant for the whole cycle to pessimistically estimate the noise delay and report the noise delay, shown in Column 4. Note that the noise delay is the difference of a circuit's delay with noise minus a circuit's delay without noise. We then estimate a circuit's noise delay by using the IR drop waveform. The results are shown in Column 5. In columns 7 and 8, we show the circuit's noise delay considering decaps using the peak IR drop and using the IR drop waveform, respectively. Columns 6 and 9 show the runtimes for the IR drop timing analysis without and with decaps.

On average, compared to using the peak IR drop, noise delay over-estimation using IR drop waveforms can be reduced about 83%. The results show that our method, while still being pessimistic, can significantly increase the accuracy of the circuit delay estimation.

## V. CONCLUSIONS

In this paper, we have proposed a novel algorithm to derive IR drop waveforms considering the effect of decaps for the power-gating designs. We applied the sliced switching window to IR drop waveforms in order to obtain accurate noise delays. On average, our algorithm can reduce noise delay over-estimation by about 83%.

## REFERENCES

- R. Ahmadi and F. N. Najm, "Timing Analysis in Presence of Power Supply and Ground Voltage Variations," *Proc. of ICCAD*, pp. 176-183, 2003.
- [2] M. Anis, S. Areibi, and M. Elmasry, "Dynamic and Leakage Power Reduction in MTCMOS Circuits Using an Automated Efficient Gate Clustering Technique," *Proc. of DAC*, pp. 480-485, 2002.
- [3] G. Bai, S. Bobba, and I. N. Hajj, "Static Timing Analysis Including Power Supply Noise Effect on

| Name     | ST # | Circuit delay  | Noise delay without decaps (ns) |                 | Runtime | Noise delay  | with decaps (ns) | Runtime |
|----------|------|----------------|---------------------------------|-----------------|---------|--------------|------------------|---------|
| 1 (01110 | 0111 | w/o noise (ns) | Peak IR drop                    | IR drop wavefom | (Sec.)  | Peak IR drop | IR drop waveform | (Sec.)  |
| C880     | 15   | 0.815          | 0.145                           | 0.075           | 2.07    | 0.033        | 0.019            | 5.67    |
| C6288    | 48   | 1.102          | 0.169                           | 0.139           | 67.50   | 0.095        | 0.048            | 72.00   |
| C7552    | 43   | 1.807          | 0.185                           | 0.096           | 27.94   | 0.065        | 0.053            | 28.84   |
| C3540    | 26   | 1.098          | 0.138                           | 0.079           | 9.05    | 0.040        | 0.026            | 18.95   |
| C5315    | 33   | 0.776          | 0.175                           | 0.062           | 9.75    | 0.042        | 0.027            | 22.35   |
| C1355    | 18   | 0.849          | 0.158                           | 0.083           | 4.14    | 0.019        | 0.015            | 9.54    |
| C432     | 10   | 0.470          | 0.163                           | 0.069           | 1.10    | 0.020        | 0.014            | 3.80    |
| dalu     | 21   | 0.607          | 0.102                           | 0.047           | 2.50    | 0.017        | 0.010            | 7.00    |
| i4       | 11   | 0.505          | 0.116                           | 0.027           | 0.42    | 0.027        | 0.012            | 2.22    |
| i5       | 9    | 0.419          | 0.084                           | 0.032           | 0.35    | 0.021        | 0.014            | 1.25    |
| i8       | 19   | 0.474          | 0.108                           | 0.047           | 3.18    | 0.052        | 0.044            | 9.48    |
| alu4     | 21   | 0.937          | 0.174                           | 0.080           | 6.07    | 0.028        | 0.014            | 15.97   |
| AVG      |      |                | 1                               | 0.487           |         | 0.267        | 0.173            |         |

Table 1: The comparison between noise delays

Propagation Delay in VLSI Circuits," *Proc. of DAC*, pp. 259-300, 2001.

- [4] P. Chen, Y. Kukimoto, and K. Keutzer, "Refining Switching Window by Time Slots for Crosstalk Noise Calculation," *Proc. of ICCAD*, pp. 583-586, 2002.
- [5] M. Hashimoto, J. Yamaguchi, and H. Onodera, "Timing Analysis Considering Spatial Power/Ground Level Variation," *Proc. of ICCAD*, pp. 814-820, 2004.
- [6] C. T. Hsieh, J. C. Lin, and S. C. Chang, "A Vectorless Estimation of Maximum Instantaneous Current for Sequential Circuit," *Proc. of ICCAD*, pp. 537-540, 2004.
- [7] H. Jiang, M. Marek-Sadowska, and S. R. Nassif, "Benefits and Costs of Power-Gating Technique," *Proc. of ICCD*, pp. 559-566, 2005.
- [8] Y. M. Jiang, K. T. Cheng, and A. Krstic, "Estimation of Maximum Power and Instantaneous Current using a Genetic Algorithm," *Proc. of CICC*, pp. 135-138, 1997.
- [9] H. Kriplani, F. N. Najm, and I. N. Hajj, "Pattern Independent Maximum Current Estimation in Power and Ground Buses of CMOS VLSI Circuits: Algorithms, Signal Correlations, and Their Resolution," *IEEE Trans. on CAD*, Vol. 14, No. 8, pp. 998-1012, Aug. 1995.
- [10] A. Krstic and K. T. Cheng, "Vector Generation for Maximum Instantaneous Current through Supply Lines for CMOS Circuits," *Proc. of DAC*, pp. 383-388, 1997.
- [11] A. Krstic, Y. M. Jiang, and K. T. Cheng, "Pattern Generation for Delay Testing and Dynamic Timing Analysis Considering Power-Supply Noise Effects,"

*IEEE Trans. on CAD*, Vol. 20, No. 3, pp. 416-425, Mar. 2001.

- [12] C. Long and L. He, "Distributed Sleep Transistor Network for Power Reduction," *IEEE Trans. on VLSI*, Vol. 12, No. 9, pp. 937-946, Sep. 2004.
- [13] S. Mutoh, T. Douseki, Y. Matsuya, T. Aoki, S. Shigematsu, and J. Yamada, "1-v Power Supply Highspeed Digital Circuit Technology with Multithresholdvoltage CMOS," *IEEE J. Solid-State Circuits*, pp. 847-854, Aug. 1995.
- [14] S. Pant and D. Blaauw, "Static Timing Analysis Considering Power Supply Variations," Proc. of ICCAD, pp. 365-371, 2005.
- [15] K. Tseng and M. Horowitz, "False Coupling Exploration in Timing Analysis," *IEEE Trans. on CAD*, Vol. 24, No. 11, pp. 1795-1805, Nov. 2005.
- [16] D. S. Chiou, D. C. Juan, Y. T. Chen, and S. C. Chang, "Fine-Grained Sleep Transistor Sizing Algorithm for Leakage Power Minimization," *Proc. of DAC*, pp. 81-86, 2007.
- [17] D. S. Chiou, S. H. Chen, S. C. Chang and C. W. Yeh, "Timing Driven Power Gating," *Proc. of DAC*, pp. 121-124, 2006.