Investigating the Local-Meta-Model CMA-ES for Large ...

Viewer
Transcript

Investigating the Local-Meta-Model CMA-ES for Large Population Sizes Zyed Bouzarkouna1,2 1 IFP 2 TAO

Anne Auger2

Didier Yu Ding1

(Institut Fran¸cais du P´ etrole)

Team, INRIA Saclay-Ile-de-France, LRI

April 07, 2010

Statement of the Problem

Objective To solve a real-world optimization problem formulated in a black-box scenario with an objective function f : Rn 7→ R. multimodal noisy f may be: non-separable ...

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

non-smooth non-convex computationally expensive

page 2 of 15

A Real-World Problem in Petroleum Engineering History Matching The act of adjusting a reservoir model until it closely reproduces the past behavior of a production history.

A fluid flow simulation takes several minutes to several hours !!

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 3 of 15

Statement of the Problem (Cont’d) Difficulties Evolutionary Algorithms (EAs) are usually able to cope with noise, multiple optima . . . Computational cost build a model of f , based on true evaluations ; use this model during the optimization to save evaluations.

⇒

How to decide whether: the quality of the model is good enough to continue exploiting this model ?

or new evaluations on the “true” objective function should be performed ? Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 4 of 15

Table of Contents

1

CMA-ES with Local-Meta-Models Covariance Matrix Adaptation-ES Locally Weighted Regression Approximate Ranking Procedure

2

A New Variant of lmm-CMA A New Meta-Model Acceptance Criterion nlmm-CMA Performance

3

Conclusions

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 5 of 15

Covariance Matrix Adaptation-ES CMA-ES (Hansen & Ostermeier 2001) Initialize distribution parameters m, σ and C, set population size λ ∈ N. while not terminate Sample xi = m + σNi (0, C), for i = 1 . . . λ according to a multivariate normal distribution Evaluate x1 , . . . , xλ on f Update distribution parameters (m, σ, C) ← (m, σ, C, x1 , . . . , xλ , f (x1 ), . . . , f (xλ )) where m ∈ Rn : the mean of the multivariate normal distribution σ ∈ R+ : the step-size C ∈ Rn×n : the covariance matrix. Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 6 of 15

Covariance Matrix Adaptation-ES (Cont’d) Moving the mean µ(= λ2 )

m=

P

ωi xi:λ .

i=1 where xi:λ is the i th ranked individual: f (x1:λ ) ≤ . . . f (xµ:λ ) ≤ . . . f (xλ:λ ) , µ P ω1 ≥ . . . ≥ ωµ > 0, ωi = 1. i=1

Other updates Adapting the Covariance Matrix Step-Size Control

⇒

Updates rely on the ranking of individuals according to f and not on their exact values on f .

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 7 of 15

Locally Weighted Regression

q ∈ Rn : A point to evaluate

⇒

fˆ(q) : a full quadratic meta-model on q.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

A training set containing m points with their objective function values (xj , yj = f (xj )) , j = 1..m

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

We select the k nearest neighbor data points to q according to Mahalanobis distance with respect to the current covariance matrix C.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

h is the bandwidth defined by the distance of the k th nearest neighbor data point to q.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

Building the meta-model fˆ on q 2 k n(n+3) P ˆ min f (xj , β) − yj ωj , w.r.t β ∈ R 2 +1 . j=1

T fˆ (q) = β T q12 , · · · , qn2 , · · · , q1 q2 , · · · , qn−1 qn , q1 , · · · , qn , 1 . Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Approximate Ranking Procedure Every generation g , CMA-ES has λ points to evaluate.

⇒

Which are the points that must be evaluated with: the true objective function f ? the meta-model fˆ ?

Approximate ranking procedure (Kern et al. 2006) 1 2 3 4 5 6 7 8 9 10 11

approximate fˆ and rank the µ best individuals evaluate f on the ninit best individuals “ λ−n ” init for nic := 1 to do n b

approximate fˆ and rank the µ best individuals if (the exact ranking of the µ best individuals changes) then evaluate f on the nb best unevaluated individuals else break fi od adapt ninit depending on nic

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 9 of 15

A New Meta-Model Acceptance Criterion

Requiring the preservation of the exact ranking of the µ best individuals is a too conservative criterion to measure the quality of the meta-model. New acceptance criteria (nlmm-CMA) The meta-model is accepted if it succeeds in keeping: the best individual and the ensemble of the µ best individuals unchanged or the best individual unchanged, if more than one fourth of the population is evaluated.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 10 of 15

nlmm-CMA Performance Success Performance (SP1): SP1 =

mean (number of function evaluations for successful runs) . ratio of successful runs

Speedup (algo) =

SP1(algo) SP1(CMA−ES) .

8

Speedup

6

4

2

0

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

(Dimension, Population Size)

page 11 of 15

nlmm-CMA Performance 4 nlmm-CMA fSchwefel1/4 8

6

6

6

4

Speedup

8

2

4

2

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

(16, 12)

0 (2, 6)

fRosenbrock

4

2

(4, 8) (5, 8) (Dimension, Population Size)

(8, 10)

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

fAckley 8

6

6

6

2

Speedup

8

4

4

2

0 (2, 6)

(4, 8) (5, 8) (Dimension, Population Size)

⇒

(8, 10)

0 (2, 5)

(16, 12)

fRastrigin

8

Speedup

Speedup

fNoisySphere

8

Speedup

Speedup

fSchwefel

lmm-CMA

4

2

(5, 7) (Dimension, Population Size)

(10, 10)

0 (2, 50)

(5, 140) (Dimension, Population Size)

nlmm-CMA outperforms lmm-CMA, on the test functions investigated with a speedup between 1.5 and 7.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 11 of 15

nlmm-CMA Performance for Increasing Population Sizes 4 nlmm-CMA

lmm-CMA Dimension n = 5 fRosenbrock

fRastrigin 5

4

4

4

3 2 1 0 8

Speedup

5

Speedup

Speedup

fSchwefel1/4 5

3 2 1

16 24 32

⇒

48 Population Size

96

0 8

3 2 1

16 24 32

48 Population Size

96

0 70

140 Population Size

280

nlmm-CMA maintains a significant speedup,between 2.5 and 4, when increasing λ while the speedup of lmm-CMA drops to one.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 12 of 15

Impact of the Recombination Type

nlmm-CMA a default weighted recombination type ωi =

ln(µ+1)−ln(i) µ ln(µ+1)−ln(µ!) ,

for i = 1 . . . µ.

nlmm-CMAI an intermediate recombination type ωi = µ1 , for i = 1 . . . µ.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 13 of 15

Impact of the Recombination Type (Cont’d) nlmm-CMAI (with equal RT) fSchwefel1/4 fNoisySphere 8

8

6

6

6

4

2

4

2

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

(16, 12)

0 (2, 6)

fRosenbrock

4

2

(4, 8) (Dimension, Population Size)

(8, 10)

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

fAckley 8

6

6

6

2

0 (2, 6)

⇒ ⇒

Speedup

8

4

4

2

(4, 8) (Dimension, Population Size)

(8, 10)

0 (2, 5)

(16, 12)

fRastrigin

8

Speedup

Speedup

Speedup

8

Speedup

Speedup

4 nlmm-CMA fSchwefel

4

2

(5, 7) (Dimension, Population Size)

(10, 10)

0 (2, 50)

(5, 140) (Dimension, Population Size)

nlmm-CMA outperforms nlmm-CMAI . The ranking obtained with the new acceptance criterion still has an amount of information to guide CMA-ES.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 14 of 15

Summary CMA-ES with meta-models The speedup of lmm-CMA with respect to CMA-ES drops to one when the population size λ increases.

⇒

The meta-model acceptance criterion is too conservative.

New variant of CMA-ES with meta-models A new meta-model acceptance criterion: It must keep: the best individual and the ensemble of the µ best individuals unchanged the best individual unchanged, if more than one fourth of the population is evaluated. nlmm-CMA outperforms lmm-CMA on the test functions investigated with a speedup in between 1.5 and 7. nlmm-CMA maintains a significant speedup, between 2.5 and 4, when increasing the population size on tested functions.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 15 of 15

Thank You For Your Attention

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 16 of 15

Investigating the Local-Meta-Model CMA-ES for Large Population Sizes Zyed Bouzarkouna1,2 1 IFP 2 TAO

Anne Auger2

Didier Yu Ding1

(Institut Fran¸cais du P´ etrole)

Team, INRIA Saclay-Ile-de-France, LRI

April 07, 2010