24.2

 

QuickYield: An Efficient Global-Search Based Parametric Yield Estimation with Performance Constraints Fang Gong1 , Hao Yu2 , Yiyu Shi1 , Daesoo Kim1 , Junyan Ren3 , and Lei He1 1

University of California, Los Angeles, Los Angeles, US 2

3

Nanyang Technological University, Singapore

State Key Lab of ASIC, Fudan University, Shanghai, China

ABSTRACT

In general, the parametric yield can be estimated in either the performance domain or the parameter domain, as shown in Fig.(1). Monte Carlo simulation is commonly used for yield estimation in the performance domain. With the probability distribution of variable parameters, it first generates tens of thousands of random-sampling points in the parameter space, and then performs circuit simulations at each sampling point to evaluate the performance metric of interest. Within the obtained performance, Monte Carlo method screens out those missing the given performance constraints, and identify the acceptable or successful samplings. In this way, by dividing the number of successful samplings by that of the total sampling points, one can obtain the yield rate. The advantage of Monte Carlo simulation is its simplicity and generality. However, the Monte Carlo method is timeconsuming. The lengthy simulation at each sampling point limits its application for the yield estimation and hence a fast analysis is required.

With technology scaling down to 90nm and below, many yield-driven design and optimization methodologies have been proposed to cope with the prominent process variation and to increase the yield. A critical issue that affects the efficiency of those methods is to estimate the yield when given design parameters under variations. Existing methods either use Monte Carlo method in performance domain where thousands of simulations are required, or use local search in parameter domain where a number of simulations are required to characterize the point on the yield boundary defined by performance constraints. To improve efficiency, in this paper we propose QuickYield, a yield surface boundary determination by surface-point finding and globalsearch. Experiments on a number of different circuits show that for the same accuracy, QuickYield is up to 519X faster compared with the Monte Carlo approach, and up to 4.7X faster compared with YENSS, the fastest approach reported in literature.

Categories and Subject Descriptors: B.7.[Hardware]: Integrated Circuits-Design Aids General Terms: Algorithms, Performance Keywords: Parametric yield, Circuit simulation

1.

Yield body Yield body

INTRODUCTION

Simulation

As integrated circuits enter into the nanometer era, process variation has become a major challenge for both design and fabrication [1, 2, 3, 4, 5, 6]. Circuit parameters such as effective channel length and threshold voltage of transistors can deviate significantly from their nominal values specified by designers, due to many uncertainties from the manufacturing process such as lithography, chemical mechanical polishing (CMP), etching and so on. The circuit performance such as delay and output swing may differ from the design specification under the nominal condition. Parametric yield, defined as the percentage of circuits that can work under a performance metric within the acceptable region, is a common measure to evaluate the design robustness in the presence of process variations.

Mapping

(a) Parameter Domain

Figure 1: Yield estimation in (a) parameter domain and (b) performance domain. Several works [1, 2, 3] have been proposed to utilize the surface boundary, which separates the success and failure regions to estimate the yield rate. These methods avoid the massive sampling simulations by only using points on the surface boundary. The approach in [1] uses linear constraints to approximate the yield surface. It has two limitations: first, the linear functions cannot approximate the yield body with a good accuracy, especially for yields defined by nonlinear performance constraints. Second, this method is based on the assumption that the yield body needs to be convex, which does not hold in general. To address these issues, approaches in [2] and [3] derive a nonlinear

 

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. DAC'10, June 13-18, 2010, Anaheim, California, USA Copyright 2010 ACM 978-1-4503-0002-5 /10/06...$10.00

 

(b) Performance Domain

392

24.2

procedure, called YENSS, which can handle nonlinear performance constraints as well as non-convex yield body. It starts from the nominal performance space, and searches along the tangent of the surface boundary to approach the surface boundary point, which is called local search in this paper. This method can provide the good accuracy without using the Monte Carlo method. However, expensive simulations are still required to locate each point on the surface boundary due to local searching. In this paper, we propose an efficient parametric yield calculation method, called QuickYield. QuickYield applies a surface-point finding together with a global-search to locate points on the surface boundary. We first include performance constraints into the differential algebra equation (DAE) that describes the circuit and build an augmented equation system. The points on the surface boundary in the parameter domain can be determined by solving the augmented system equation. Then yield can be estimated from the obtained surface boundary. QuickYield is able to benefit problems where simulation is extremely expensive. Experimental results show that for the same accuracy, QuickYield is up to 519X faster compared with the Monte Carlo approach, and up to 4.7X faster compared with YENSS, the fastest approach reported in the literature. The rest of the paper is organized in the following manner. We first review the background of the yield estimation in Section 2. In Section 3, we present the details about yield estimation in QuickYield. We present experimental results in Section 4 and conclude the paper in Section 5.

2.

be swing, gate delay, oscillator frequency and so on. They can be obtained by SPICE simulations. fworst is the worstcase performance that can be accepted. Unfortunately, in most cases it is either prohibitively expensive or technically impossible to obtain an explicit expression for fm (γp ). P2 P2max P2norm P2min

P1min P1norm P1max

Figure 2: Performance Constraints defines hypervolume in parameter space There are some alternative ways proposed to obtain this surface. As shown in Fig.(3(a)), existing methods [1, 2, 3] start with an initial γp , and perform SPICE simulation with γp to get performance fm (γp ). Then, the performance constraint H(γp ; fm ) is checked with the obtained fm . If γp cannot satisfy the constraints, a new γp is selected and the above procedure is repeated until the constraint is satisfied. In this way, the final calculated parameter lies on the surface boundary. However, those methods all require quite a number of simulations to identify points on the surface. The efficiency should be further improved.

PRELIMINARIES

Generally, the variational parameters with performance constraint that defines the acceptable region can form a hypervolume with respect to the nominal point in the parameter space, as shown in the Fig(2). The shaded region is the “safe” region where parameters can result in the acceptable performance. The parametric yield estimation, accordingly, is to calculate the ratio of the volume of the safe region to that of the entire space bounded by the min and max of all parameters. Here, all parameters are assumed independent after decomposing the correlation. One direct approach to find the yield is to determine all the points in the hyper-volume. However, it is expensive and unnecessary, because the points on the hyper-surface boundary can provide enough information to calculate the volume of the yield body and thus the yield rate [3]. Therefore, it is desirable to directly identify the hyper-surface boundary (or called surface boundary). The hyper-surface that encloses the acceptable or safe region can be mathematically described by the performance constraint as 1 H(γp ; fm ) = fm (γp ) − fworst = 0,

P1

Jp Change

fm (J p )

SPICE simulations

Jp H (J p ; fm )

H (J p ; fm ) z 0

Return

fm (J p )  f worest

H (J p ; fm ) 0

Jp

(a) Existing methods ­d ° q ( x (t ))  f ( x (t ))  b (t ) ® dt ° H (J p ; f m ) 0 ¯

0

SPICE-style simulations

x (t ), J p

(b) QuickYield Figure 3: Comparison of boundary surface finding methods by existing methods and QuickYield

(1)

where γp denotes the parameters subject to the process variations. Depending on the specific circuit problem under study, γp can be different parameters, such as the threshold voltage, width, length, load capacitance and etc. fm (γp ) is the performance metric corresponding to γp , which can

3. YIELD ESTIMATION In this section, we present details for the yield estimation in QuickYield. Without loss of generality, we assume that all the parameters are uniformly distributed. Non-uniform distribution and correlated parameters can be mapped into uniform parameter domain with pre-scale and PCA techniques[2, 3].

1

For simplicity of our presentation and similar to [3], from now on we restrict our discussion to the simple case with only one performance constraint. It is understood, however, that our method can also be applied to multiple constraints scenario by handling one constraint at a time.

393

24.2

3

Performance Surface Performance

In order to improve the efficiency when locating the yield surface boundary, the “surface-point finding” is developed to locate surface boundary points with aid of a global search, which is in contrast to the approach in YENSS [3]. YENSS performs the search of surface point mainly based on a local search, where the tangent direction is calculated during each step of searching. The determination of the tangent direction requires to calculate the sensitivity ∂fm /∂ p, which is expensive for each step of searching. Our approach switches the role of performance constraint H(γp ; fm ) and variable parameters γp , which is shown in Fig.(3(b)). We treat the γp as an unknown, while introducing H(γp ; fm ) as a extra equation into the simulation environment. The objective of the simulation is to find the value of γp that can satisfy the performance constraint H(γp ; fm ) exactly. Therefore, the simulator needs to solve one augmented nonlinear system. With the obtained parameter values, the points on the surface boundary can be located in the parameter domain, and the parametric yield can be evaluated accordingly. Moreover, during the searching, we develop a strategy by the global-search, which avoids the expensive local search used in [3] yet with a high accuracy. In this way, we can develop an efficient yield estimation algorithm, called QuickYield. The flow is presented below.

Constraint Plane

10 5

2.5 Parameter 2

3.1 Algorithm Overview

2 1.5

0

0 4

1

2

2 Parameter 1

0 4

Parameter 2

0.5 0

(a) Rotated View

1 2 Parameter 1

3

(b) Topview

Figure 4: (a)rotated view and (b) topview of the intersection boundary of performance surface and constraint plane.

tion system determines a performance surface in the parameter domain. The performance constraint H(γp ; fm ) further locates a constraint plane fm (γp ) = fworst 2 . It is clear that the yield surface boundary in the parameter domain, shown in Fig.(4(b)), is the intersection boundary of these two surfaces. Therefore, by solving the equation system together with the performance constraint, one can obtain their intersection boundary, which in fact separates the success and failure regions in the parameter domain. For this sake, we can first locate boundary points on the nonlinear yield surface boundary as many as possible, and then the exact surface boundary can be approximated by connecting those identified surface points with the further refinement beginning from the centroid point of those exsisiting surface points. Such a surface-point finding strategy is called global-search in this paper, which is different from the local-search in YENSS by computing tangent direction at each point.

Algorithm 1 Flow of QuickYield 1: /* Setup */ 2: Input circuit constructs, nominal parameters and performance constraints; 3: Build the augmented equation system with performance constraint equation; 4: 5: /* Initial Yield Approximation */ 6: Find the intersection of surface boundary with each axis using Surface-point Finding Strategy. 7: Initial yield approximation can be calculated with the yield volume defined by these intersections. 8: 9: while Yield estimation can be refined do 10: /* Surface-point Finding */ 11: Solve the augmented system to obtain the parameter values that fix an exact point on the surface boundary. 12: /* Yield Refinement */ 13: Refine the yield estimate from the additional points on the surface boundary. 14: end while

P2

P2max

P02

P04

P12

P02

P10

P03

P05

P13

P11

P2norm P1norm P3norm

P01

P1

P03

P2norm

P3

P1norm

P01

(a) 2D example

P1max

(b) 3D example

Figure 5: Surface-point Finding Strategy Let’s illustrate the global-search by an example as follows. In order to find surface points, QuickYield first calculates the intersection points at each parameter axis: P01 , P02 shown in Fig.(5(a)). Then, it fixes one coordinate of P01 or P02 at its nominal values, while introducing the other coordinate as the extra unknown γp as shown in Fig.(3(b)). By connecting these points, an initial approximation to the yield body can be generated. To have an accurate approximation of the yield body, additional surface points such as P04 and P05 in between can be further found by fixing one parameter of the middle point (P03 ) of (P01 , P02 ) and solving for the other parameter. This procedure is repeated to find more surface points to refine the yield estimation. Note that

3.2 Surface-point Finding Strategy For the purpose of illustration, we first consider two uncorrelated and uniform-distributed parameters p1 and p2 as shown in Fig.(2). Assuming that the performance constraint can be successfully added at one pair of nominal points (p1 norm ,p2 norm ), there exists a “safe” region inside the region bounded by (p1 min ,p1 max ) and (p2 min ,p2 max ). The “safe” region thereby can be viewed as a nonlinear surface boundary separating the success and failure regions in the parameter domain. As such, the yield can be approximated by the area ratio of shaded region to the entire region bounded by min and max values of two parameters. We elaborate the need to find surface points further from a geometrical perspective as shown in Fig.(4(a)). The equa-

2

For the simplicity of illustration, a linear performance constraint is used in Fig.(4(a)). But QuickYield can handle fully nonlinear performance constraints in general.

394

24.2

this approach can be generalized to the case when there are multiple parameters. As shown in Fig.(5(b)), there exists a hyper-surface by connecting all intersections on axes, and hence the centroid point of the hyper-surface can be used to find additional surface points with the similar procedure in 2D problem.

Constraint Plane

P0

3

P1 P3 P4

2

Perfomnce

Perfomnce

3 P2

0

1 10

5 5

P1 2

P0

P2 0

1 10

5 5

0 10 Parameter 1

0 10

Parameter 2

Parameter 2

Parameter 1

(a) YENSS[3]. Figure 6: Strategy

Note that for the periodical circuits such as ring oscillators, the periodical steady state analysis (PSS) is required to find their periods T [9, 7, 8]. Similarly, one augmented system for PSS analysis can be built in discrete time domain with the finite difference method as [8, 7]: 3 3 2 2 1 (q − qn ) + f1 + b 0 h 1 1 7 6 0 7 6 h (q2 − q1 ) + f2 + b 7 7 6 6 7 6 . 7 6 .. (5) 7 = 6 .. 7 6 . 7 7 6 6 4 1 (qn − qn−1 ) + fn + b 5 4 0 5 h 0 H(xT ; γp )

Performance Surface

Performance Surface Constraint Plane

unknowns. For DC analysis, the Jacobian matrix becomes simple as: – » ∂f /∂x ∂f /∂γp + b (4) J(X) = ∂H(γp ; x)/∂x ∂H(γp ; x)/∂γp

(b) QuickYield

Accordingly, the Jacobian matrix Jf d can be expressed as equation (6) at the top of the next page, where C = ∂q/∂x, and G = ∂f /∂x. When Newton-Raphson iteration converges, it returns the state-variable x(t) and the variational parameter γp which resides on the boundary of the yield body in the parameter domain.

Comparison of Surface-point Finding

It is worthwhile to further illustrate that our QuickYield uses the global-search, different from YENSS [3]. As shown in Fig.(6(b))), P0 is the initial middle point of the line connecting two axis intersection points. Obviously, its determination needs no simulation. The identified surface points, such as P1 and P2 , only require one simulation in QuickYield. Shown in Fig.(6(a))), YENSS involves more runs of simulations than QuickYield to locate a single surface point. This is because YENSS starts from the nominal design point and locally searches along the tangent direction of performance surface. The determination of the tangent direction is requires to calculate the sensitivity ∂fm /∂ p, which is expensive. In contrast, the surface-point finding strategy in QuickYield method is more efficient since it searches in global fashion by starting from the middle of the identified points. The efficiency of QuickYield is also demonstrated in experiments.

3.4 Surface-point Based Yield Estimation To estimate the yield rate, we first employ an analytical formula [3] to compute the volume of the parallelotope and the simplex defined by the intersection points of the boundary surface with each parameter axis. We then use it as the initial approximation to the yield. When additional points on the yield boundary identified, QuickYield will refine the yield volume approximation by adding or subtracting the hyper-volume increments. Accordingly, the yield can be simply obtained if one divides the calculated volume by that of the parameter space.

4. EXPERIMENTAL RESULTS We have implemented QuickYield in a Matlab-based circuit simulator, and all experiments are carried out on a Linux server with a 2.4GHz Xeon processor and 4GB memory. We use a Schmitt Trigger and a three-stage ring oscillator to compare the accuracy and efficiency of QuickYield with Monte Carlo and YENSS [3]. As an illustration, we use the widths of the MOSFETs as parameters γp subject to process variation, but QuickYield can handle other variations, such as threshold voltage, load capacitance and channel length as well.

3.3 Surface-point Finding Method To determine the surface point numerically, we apply the parameter finding used for device optimization [7, 8] in the content of yield estimation, i.e., the surface-point finding. We start from differential algebra equation (DAE) in (2) d q(x(t)) + f (x(t)) + b = 0 (2) dt where x(t) are unknown state variables. First, QuickYield leverages the DAE with one performance constraint H(γp ; fm ) in the augmented system equation (3). As such, we have a new system with x(t) and γp as the unknowns. j d q(x(t), γp ) + f (x(t), γp ) + b(t) = 0 dt (3) H(x(t), γp ) = fm (γp ) − fworst = 0

4.1 Schmitt Trigger We first use a Schimitt Trigger (shown in Fig.(7) to verify the accuracy and efficiency of QuickYield combined with DC analysis. In this case, we consider the lower swiching threshold VT L as the performance metric. We consider the channel widths of NMOS Wn1 and PMOS Wp2 as variational parameters γp , which have 30% variations from their nominal values. As such, VT L can differ from nominal value, which change the lower switching threshold VT L . The performance constraint is given as: when the input VT L is at 0.4V and the output is initially set to Vdd, the output VOU H should be greater than 1.7V. First, 30% random variations are introduced to Wp2 and Wn1 separately. The result from 6000 Monte Carlo DC simulations is shown as Fig.(8(b)) in the parameter space

We can use F (x(t), γp ) to denote the left hand side of equation (3), and F : Rm × R to rewrite the equations as F (x(t), γp ) = 0. The system consists of a number of nonlinear equations, and can be solved with Newton-Raphson iterations. The Newton-Raphson iteration3ˆ requires ˜ the Jacobian matrix J(X) = ∂F /∂X, where X = xT ; γp is the vector of the 3

For the simplicity of illustration, from now on we omit the iteration index k in our discussion about Newton-Raphson iteration.

395

24.2

2 6 6 Jf d = 6 6 4

1 C h 1

..

1 ∂q1 ( h ∂γp

− h1 Cn

+ G1 ..

.

. − h1 Cn−1 ···

∂H ∂x1

1 C + h n ∂H ∂xn

1 ∂qn ( h ∂γp

Gn

Vdd



∂qn ∂γp

∂f1 ∂b1 ) + ∂γ + ∂γ p p .. . ∂qn−1 n n − ∂γ ) + ∂f + ∂b ∂γp ∂γp p

3 7 7 7 7 5

(6)

∂H ∂γp

4.2 3-Stage Ring Oscillator We further consider a 3-stage ring oscillator as shown in Fig.(9). The oscillator period is chosen to be the performance metric of interest, which is determined by the delay of the inverters. The nominal period of the oscillator Tnorm is 7.2028ns calculated via periodical steady state (PSS) simulation. The design specification requires that the variation in period δT should be within ±2.5% of Tnorm . We consider the effect of random variations in the width of MOSFET in the first stage with 40% perturbation range from their nominal values. The nominal width of M p1 is 3um and that of M n1 is 2um.

Mp1 Mp3

Mp2

Vin

Vout

Vdd Mn1

Mn3

Mn2 GND

Figure 7: Schmitt Trigger design

10

9

9

8 7

Figure 9: 3-Stage Ring Oscillator Actually, there are two performance constraints for the oscillator as shown in equation (7), so there exist two boundary curves in the parameter space but they correspond to the Tmax and Tmin , respectively. We treat them individually in this case. j H1 (γp ; T ) = T − Tmin = 0 (7) H2 (γp ; T ) = T − Tmax = 0

success failure

8 7

6

6 30

35 40 Wp(um)

45

30

(a) QuickYield

35 40 Wp(um)

45

First, the Monte-Carlo method is used to calculate the period T of the ring oscillator with width variations, as shown in Fig.(10(b)). The blue or success region is constitued of these samplings that lead to acceptable T , while the rest is the failure region. It is obvious that the success region is bounded by two nonlinear boundary curves. As such, the parametric yield can be estimated by the percentage of the samplings that locate in the success region.

(b) Monte Carlo

Figure 8: Comparison of simulation results from QuickYield and Monte Carlo Moreover, we compare the accuracy as well as runtime for QuickYield and Monte Carlo in the Table(1). Since we do not have results from [3] for this example, QuickYield can not compare with YENSS here. From the table we can see that QuickYield can not only achieve 0.4% accuracy, but also gain 349X speedup over Monte Carlo method.

2.8

Wn(um)

2.4

T

2.4

min

2.2

2.2 2 max

1.8

1.6

1.6

method

yield

time (second)

speedup

Monte Carlo (2000) Monte Carlo (6000) QuickYield (6 points)

0.7014 0.70185 0.70159

66.2 197.1 0.564

2.97x 1x 349x

1.4

2 1.8

T

Table 1: Accuracy and Runtime Comparison

success failure

2.6

2.6

Wn

10

Wn(um)

Wn(um)

(Wp , Wn ). The success and fail regions are separated by one boundary curve, and the hyper-volume can be estimated to calculate the yield. Then we perform the QuickYield on the Schmitt Trigger example to find the boundaries, and result is shown in Fig.(8(a)). By comparing with the Monte Carlo results in Fig.(8(b)), we can observe that QuickYield can obtain the same boundary curve as that in Monte Carlo result.

1.4 2

2.5

3 Wp(um)

3.5

(a) QuickYield

4

1.2

2

2.5

3 Wp(um)

3.5

4

(b) Monte Carlo

Figure 10: Simulation results from QuickYield and Monte Carlo on Ring Oscillator

396

24.2

700

0.65

600

0.645

500

Parametric Yield

Runtime(second)

We further validate QuickYield against Monte Carlo simulation as shown in Fig.(10(a)). There are also two curves: upper bound for Tmin and low bound for Tmax , and they are identical to the result from Monte Carlo in Fig.(10(b)). Moreover, we compare the accuracy and runtime of MonteCarlo, YENSS obtained from [3] by normalizing with respect to its Monte Carlo runtime, and QuickYield in Table.(2). From this table, the Monte Carlo with 5000 simulations can generate a more accurate result, while it needs much more computation time. According to [3], YENSS can achieve 3.4% accuracy with 139X speedup over Monte Carlo. But QuickYield can obtain 519X speedup over Monte Carlo at a similar accuracy. Note that QuickYield and YENSS only use 10 points to locate two boundary curves. The accuracy can be improved with additional points. We also study how the accuracy and runtime of QuickYield scales with the number of points, and the results is shown in Fig. 11. From the figure we can see that the runtime increases linearly, while the yield quickly converges when the point is over 60.

400 300 200

20

30

40 50 60 Number of Points

70

80

0.038

0.038 0.036

0.036 4

0.034

0.034

4 2.5

3

3 2 Wn(um)

1.5

2

2.5

Wp(um)

(a) QuickYield

2

1.5

2 Wp(um)

Wn(um)

(b) Monte Carlo

Figure 12: Simulation results from QuickYield and Monte Carlo on Ring Oscillator Table 3: Accuracy and Time Comparison for Ring Oscillator with three variational parameters method

yield

time (second)

speedup

Monte Carlo (1000) Monte Carlo (5000) YENSS (20 points) QuickYield (20 points)

0.648 0.617 0.623 0.621

11840 63128 236.9

5.33X 1X 57X 267X

0.64 0.635

mance constraint. QuickYield leverages the DAE equation with the performance constraint together to build an augmented system. By solving it, QuickYield locates the yield boundary points in the parameter space with the globalsearch, and further calculates the yield rate efficiently. Experimental results show that for the same accuracy, QuickYield is up to 519X faster compared with the Monte Carlo approach, and up to 4.7X faster compared with YENSS [2, 3]. Future work is to extend QuickYield in general to handle more variables under a problem formulation in the highdimensional parameter space.

0.63

0.62 10

(a) Runtime

20

30 40 50 60 Number of Points

70

80

(b) Yield

Figure 11: Runtime and accuracy scalability w.r.t. the number of points. Table 2: Accuracy and Time Comparison for Ring Oscillator with two variational parameters method

yield

time (second)

speedup

Monte Carlo (1000) Monte Carlo (5000) YENSS (10 points) QuickYield (10 points)

0.67465 0.62658 0.6482 0.6463

8514.7 44073.8 84.9

5.17x 1x 139X 519x

To demonstrate the ability to handle many variational parameters, we increase the complexity of the problem by introducing random variation to the load capacitance C1 in the first stage. First, Monte Carlo samples the 3-D parameter space, and simulation results are plotted in Fig.(12(b)) (blue domain denotes the success region). Next, QuickYield is applied to find the boundary surfaces separating the success/fail regions shown in Fig.(12(a)), which is identical to the hyper-volume from Monte Carlo. Note that we remove the front and back surfaces to exhibit the surfaces for Tmin and Tmax . Similarly, the accuracy and time are summarized in Table(3). With 20 points on each boundary surface, QuickYield can not only achieve up to 0.6% accuracy, but also obtain 267X speedup over Monte Carlo and 4.6X over YENSS.

5.

0.04

0.04

Cap(pF)

Cap(pF)

0.042

0.625

100 0 10

success failure

0.044

CONCLUSIONS AND FUTURE WORK

In this paper, we have proposed a fast algorithm, QuickYield, to calculate the parametric yield with the perfor-

397

6. REFERENCES

[1] P. Cox, P. Yang, and P. Chatterjee, “Statistical modeling for efficient parametric yield estimation of MOS VLSI circuits,” in IEDM’83, 1983. [2] S. Srivastava and J. Roychowdhury, “Rapid estimation of the probability of SRAM failure due to MOS threshold variations,” in CICC ’07., 2007. [3] C. Gu and J. Roychowdhury, “An efficient, fully nonlinear, variability-aware non-monte-carlo yield estimation procedure with applications to sram cells and ring oscillators,” in ASP-DAC ’08, 2008. [4] S. C. G. Lucas and D. Chen, “Fastyield: Variation-aware, layout-driven simultaneous binding and module selection for performance yield optimization,” in ASP-DAC ’09, 2009. [5] F. Gong, H. Yu, and L. He, “Picap: a parallel and incremental full-chip capacitance extraction considering random process variation,” in DAC ’09, 2009. [6] H. Yu, X. Liu, H. Wang, and S. Tan, “A fast analog mismatch analysis by an incremental and stochastic trajectory piecewise linear macromodel,” in ASP-DAC ’10, 2010. [7] I. Vytyaz, D. C. Lee, S. Lu, A. Mehrotra, U.-K. Moon, and K. Mayaram, “Parameter finding methods for oscillators with a specified oscillation frequency,” in DAC ’07, 2007. [8] I. Vytyaz, P. K. Hanumolu, U.-K. Moon, and K. Mayaram, “Periodic steady-state analysis augmented with design equality constraints,” in DATE ’08, 2008. [9] J. W. K. Kundert and A. Sangiovanni-Vincentelli, “Steady-state methods for simulating analog and microwave circuits,” Kluwer Academic Publishers, 1990.

QuickYield: An Efficient Global-Search Based ...

3. , and Lei He. 1. 1. University of California, Los Angeles, Los Angeles, US ... ABSTRACT. With technology scaling down to 90nm and below, many yield-driven.

2KB Sizes 1 Downloads 173 Views

Recommend Documents

An Efficient Genetic Algorithm Based Optimal Route Selection ... - IJRIT
Wireless sensor Network (WSN) is getting popular especially for applications where installation of the network infrastructure is not possible, such as.

An Efficient Genetic Algorithm Based Optimal Route Selection ... - IJRIT
infrastructure, but imposes some drawbacks and limitations (mainly on .... Networks”, http://www.monarch.cs.rice.edu/monarch-papers/dsr-chapter00.pdf.

An efficient load balancing strategy for grid-based ...
Feb 20, 2007 - 4 – the importance of the cost of the communications make load balancing more difficult. Thus, the load bal- ..... This guarantees the low cost of the unfold operator. As (13) .... RENATER4 national network, by a 2.5 Gigabit connecti

An efficient hybrid algorithm based on harmony search ...
different methods of structural optimization have been introduced which can be ... There are several papers utilizing heuristic methods in structural optimization field, but ... Corresponding author: Tel: +98-21-44202710; fax: +98-21-77240398.

An Efficient Induction Motor Rotor Flux Estimator Based ...
(rad/s). 0.0 0.1 0.2 0.3 0.4 0.5 0.6. -0.6. -0.4. -0.2. 0.0. 0.2. 0.4. 0.6. Time (sec) ? ar. (Wb). Actual. RTRL. Modified Adaptive Integration. (c). (a). (b). Figure 7.

An Efficient Ontology-Based Expert Peering System
Our objective is to develop a real-time search engine for an online com- .... an expert on quantum mechanics and computational physics might be able to help the .... such degree of accuracy nor required for the application areas considered.

SQLGraph: An Efficient Relational-Based ... - Research at Google
mark Council [2], where a Social Network Benchmark is ... social graph query workloads. It was ... schema layout outlined for RDF as in [5] or whether a back-.

An Efficient Mutual "nformation Based Feature Delection ... - GitHub
Besides feature selection, the other main class of di- .... idea that features that exhibit a high degree of mutual information con- ...... [5 ] H . L iu and R . Setiono.

Efficient Histogram-Based Sliding Window
this is computationally very intensive. .... ern CPU. {mb}B b=1 is either a model histogram or a support vector to which the feature histogram h is ..... Applications.

Efficient Histogram-Based Sliding Window - CiteSeerX
a sliding window is common in solving computer vision problems such as object ..... 2006 dataset [15] and test four object classes, i.e., person, motorbike, bicycle ...

Efficient Hierarchical Graph-Based Video Segmentation
els into regions and is a fundamental problem in computer vision. Video .... shift approach to a cluster of 10 frames as a larger set of ..... on a laptop. We can ...

Efficient Backprojection-based Synthetic Aperture ...
Nov 16, 2012 - on, and is thus constrained to, mathematical equivalence, ..... tation of backprojection stage, and (b) partitioning and blocking of it. The 3D ...

Computationally Efficient Template-Based Face ... - IEEE Xplore
head poses, illuminations, ages and facial expressions. Template images could come from still images or video frames. Therefore, measuring the similarity ...

An Efficient Auction
first or second price) cannot achieve an efficient outcome because the bids submitted by bidders 1 and 2 .... Call this strengthened version of A3, A3". ...... (1999): “An Ex-Post Efficient Auction," Discussion Paper *200, Center for Rationality an

An Efficient Synchronization Technique for ...
Weak consistency model. Memory read/write sequential ordering only for synchronization data. All the data can be cached without needing coherence protocol, while synchronization variables are managed by the. SB. Cache invalidation required for shared

EWAVES: AN EFFICIENT DECODING ... - Semantic Scholar
We call it inheritance since each node must inherit its score from its par- ents. This is unnatural ... activate a child of a node in ซดุต, we need to check whether.

An Efficient SIFT-Based Mode-Seeking Algorithm For Sub-Pixel ieee ...
and multisensor images is a fundamental building block in. a variety of remote sensing .... single thread style and tested it on scores of remotely sensed. image pairs [11]. ... PC (Intel Q8200 with 3-GB RAM and Vista OS). Note that the ... An Effici

An Efficient Packet Scheduler
DiffServ clouds. This means that in the ... address or traffic type and assigned to a specific traffic class. Traffic classifiers may ... RR can be applied to the data packet scheduling problems. The CPU ..... Computing, and Comm. Conf. (IPCCC '02) .

Importance Weighting Without Importance Weights: An Efficient ...
best known regret bounds for FPL in online combinatorial optimization with full feedback, closing ... Importance weighting is a crucially important tool used in many areas of ...... Regret bounds and minimax policies under partial monitoring.

AN EFFICIENT INTEGRATED GENDER DETECTION ...
phone was determined using the Bayesian Information Criterion as described in [6]. ..... accuracy speech recognition system for mobile applications,”. IEEE Transactions on ... matrices by basis expansion,” in ICASSP, Orlando, Florida,. 2002 ...

An efficient synthesis of tetrahydropyrazolopyridine ... - Arkivoc
generate a product, where all or most of the starting material atoms exist in the final .... withdrawing and electron-donating groups led to the formation of products ...

An Efficient Synchronization Technique for ...
Low-cost and low-power. Programmed with ad ... and low-power optimization of busy-wait synchronization ... Using ad hoc sync. engine is often a must for embedded systems ... Critical operation is the Associative Search (AS) phase. On lock ...

Froctomap: An efficient spatio-temporal environment representation ...
Nov 2, 2017 - Full-text (PDF) | Froctomap: An efficient spatio-temporal environment representation. ... spatio-temporal mapping framework is available as an open-source C++ library. and a ROS module [3] which allows its easy integration in robotics p

EWAVES: AN EFFICIENT DECODING ... - Semantic Scholar
The algorithm traverses the list of active nodes in a way that maximizes speed. For that ... cal items. We build a static lexical tree based on states. That is, the.