Investigating the Local-Meta-Model CMA-ES for Large Population Sizes Zyed Bouzarkouna1,2 1 IFP 2 TAO

Anne Auger2

Didier Yu Ding1

(Institut Fran¸cais du P´ etrole)

Team, INRIA Saclay-Ile-de-France, LRI

April 07, 2010

Statement of the Problem

Objective To solve a real-world optimization problem formulated in a black-box scenario with an objective function f : Rn 7→ R. multimodal noisy f may be: non-separable ...

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

non-smooth non-convex computationally expensive

page 2 of 15

A Real-World Problem in Petroleum Engineering History Matching The act of adjusting a reservoir model until it closely reproduces the past behavior of a production history.

A fluid flow simulation takes several minutes to several hours !!

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 3 of 15

Statement of the Problem (Cont’d) Difficulties Evolutionary Algorithms (EAs) are usually able to cope with noise, multiple optima . . . Computational cost build a model of f , based on true evaluations ; use this model during the optimization to save evaluations.



How to decide whether: the quality of the model is good enough to continue exploiting this model ?

or new evaluations on the “true” objective function should be performed ? Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 4 of 15

Table of Contents

1

CMA-ES with Local-Meta-Models Covariance Matrix Adaptation-ES Locally Weighted Regression Approximate Ranking Procedure

2

A New Variant of lmm-CMA A New Meta-Model Acceptance Criterion nlmm-CMA Performance

3

Conclusions

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 5 of 15

Covariance Matrix Adaptation-ES CMA-ES (Hansen & Ostermeier 2001) Initialize distribution parameters m, σ and C, set population size λ ∈ N. while not terminate Sample xi = m + σNi (0, C), for i = 1 . . . λ according to a multivariate normal distribution Evaluate x1 , . . . , xλ on f Update distribution parameters (m, σ, C) ← (m, σ, C, x1 , . . . , xλ , f (x1 ), . . . , f (xλ )) where m ∈ Rn : the mean of the multivariate normal distribution σ ∈ R+ : the step-size C ∈ Rn×n : the covariance matrix. Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 6 of 15

Covariance Matrix Adaptation-ES (Cont’d) Moving the mean µ(= λ2 )

m=

P

ωi xi:λ .

i=1 where xi:λ is the i th ranked individual: f (x1:λ ) ≤ . . . f (xµ:λ ) ≤ . . . f (xλ:λ ) , µ P ω1 ≥ . . . ≥ ωµ > 0, ωi = 1. i=1

Other updates Adapting the Covariance Matrix Step-Size Control



Updates rely on the ranking of individuals according to f and not on their exact values on f .

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 7 of 15

Locally Weighted Regression

q ∈ Rn : A point to evaluate



fˆ(q) : a full quadratic meta-model on q.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

A training set containing m points with their objective function values (xj , yj = f (xj )) , j = 1..m

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

We select the k nearest neighbor data points to q according to Mahalanobis distance with respect to the current covariance matrix C.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

h is the bandwidth defined by the distance of the k th nearest neighbor data point to q.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Locally Weighted Regression

Building the meta-model fˆ on q  2  k n(n+3) P ˆ min f (xj , β) − yj ωj , w.r.t β ∈ R 2 +1 . j=1

T fˆ (q) = β T q12 , · · · , qn2 , · · · , q1 q2 , · · · , qn−1 qn , q1 , · · · , qn , 1 . Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 8 of 15

Approximate Ranking Procedure Every generation g , CMA-ES has λ points to evaluate.



Which are the points that must be evaluated with: the true objective function f ? the meta-model fˆ ?

Approximate ranking procedure (Kern et al. 2006) 1 2 3 4 5 6 7 8 9 10 11

approximate fˆ and rank the µ best individuals evaluate f on the ninit best individuals “ λ−n ” init for nic := 1 to do n b

approximate fˆ and rank the µ best individuals if (the exact ranking of the µ best individuals changes) then evaluate f on the nb best unevaluated individuals else break fi od adapt ninit depending on nic

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 9 of 15

A New Meta-Model Acceptance Criterion

Requiring the preservation of the exact ranking of the µ best individuals is a too conservative criterion to measure the quality of the meta-model. New acceptance criteria (nlmm-CMA) The meta-model is accepted if it succeeds in keeping: the best individual and the ensemble of the µ best individuals unchanged or the best individual unchanged, if more than one fourth of the population is evaluated.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 10 of 15

nlmm-CMA Performance Success Performance (SP1): SP1 =

mean (number of function evaluations for successful runs) . ratio of successful runs

Speedup (algo) =

SP1(algo) SP1(CMA−ES) .

8

Speedup

6

4

2

0

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

(Dimension, Population Size)

page 11 of 15

nlmm-CMA Performance 4 nlmm-CMA fSchwefel1/4 8

6

6

6

4

Speedup

8

2

4

2

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

(16, 12)

0 (2, 6)

fRosenbrock

4

2

(4, 8) (5, 8) (Dimension, Population Size)

(8, 10)

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

fAckley 8

6

6

6

2

Speedup

8

4

4

2

0 (2, 6)

(4, 8) (5, 8) (Dimension, Population Size)



(8, 10)

0 (2, 5)

(16, 12)

fRastrigin

8

Speedup

Speedup

fNoisySphere

8

Speedup

Speedup

fSchwefel

 lmm-CMA

4

2

(5, 7) (Dimension, Population Size)

(10, 10)

0 (2, 50)

(5, 140) (Dimension, Population Size)

nlmm-CMA outperforms lmm-CMA, on the test functions investigated with a speedup between 1.5 and 7.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 11 of 15

nlmm-CMA Performance for Increasing Population Sizes 4 nlmm-CMA

 lmm-CMA Dimension n = 5 fRosenbrock

fRastrigin 5

4

4

4

3 2 1 0 8

Speedup

5

Speedup

Speedup

fSchwefel1/4 5

3 2 1

16 24 32



48 Population Size

96

0 8

3 2 1

16 24 32

48 Population Size

96

0 70

140 Population Size

280

nlmm-CMA maintains a significant speedup,between 2.5 and 4, when increasing λ while the speedup of lmm-CMA drops to one.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 12 of 15

Impact of the Recombination Type

nlmm-CMA a default weighted recombination type ωi =

ln(µ+1)−ln(i) µ ln(µ+1)−ln(µ!) ,

for i = 1 . . . µ.

nlmm-CMAI an intermediate recombination type ωi = µ1 , for i = 1 . . . µ.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 13 of 15

Impact of the Recombination Type (Cont’d)  nlmm-CMAI (with equal RT) fSchwefel1/4 fNoisySphere 8

8

6

6

6

4

2

4

2

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

(16, 12)

0 (2, 6)

fRosenbrock

4

2

(4, 8) (Dimension, Population Size)

(8, 10)

0 (2, 6) (4, 8) (8, 10) (Dimension, Population Size)

fAckley 8

6

6

6

2

0 (2, 6)

⇒ ⇒

Speedup

8

4

4

2

(4, 8) (Dimension, Population Size)

(8, 10)

0 (2, 5)

(16, 12)

fRastrigin

8

Speedup

Speedup

Speedup

8

Speedup

Speedup

4 nlmm-CMA fSchwefel

4

2

(5, 7) (Dimension, Population Size)

(10, 10)

0 (2, 50)

(5, 140) (Dimension, Population Size)

nlmm-CMA outperforms nlmm-CMAI . The ranking obtained with the new acceptance criterion still has an amount of information to guide CMA-ES.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 14 of 15

Summary CMA-ES with meta-models The speedup of lmm-CMA with respect to CMA-ES drops to one when the population size λ increases.



The meta-model acceptance criterion is too conservative.

New variant of CMA-ES with meta-models A new meta-model acceptance criterion: It must keep: the best individual and the ensemble of the µ best individuals unchanged the best individual unchanged, if more than one fourth of the population is evaluated. nlmm-CMA outperforms lmm-CMA on the test functions investigated with a speedup in between 1.5 and 7. nlmm-CMA maintains a significant speedup, between 2.5 and 4, when increasing the population size on tested functions.

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 15 of 15

Thank You For Your Attention

Zyed Bouzarkouna, Anne Auger, Didier Yu Ding

page 16 of 15

Investigating the Local-Meta-Model CMA-ES for Large Population Sizes Zyed Bouzarkouna1,2 1 IFP 2 TAO

Anne Auger2

Didier Yu Ding1

(Institut Fran¸cais du P´ etrole)

Team, INRIA Saclay-Ile-de-France, LRI

April 07, 2010

Investigating the Local-Meta-Model CMA-ES for Large ...

Apr 7, 2010 - Approximate Ranking Procedure. 2 A New Variant of lmm-CMA. A New Meta-Model Acceptance Criterion nlmm-CMA Performance.

3MB Sizes 4 Downloads 126 Views

Recommend Documents

Investigating Threads.pdf
end sub. call thread2. writeln("In main"). do. loop. end. Notes: 1. The “as thread” construct marks the subroutine executable as a thread. 2. You may wish to save the above code in a file or paste it in Notebook. Briefly explain what the above co

Investigating Threads.pdf
... to modify the above program statements for the next set of exercises. You. can do this in two ways: 1) Modify the existing source, 2) Copy the code and paste it into a. new editor window (you can use the NEW... button for this in the compiler win

Investigating the Receptor-independent ...
Linert, W., Bridge, M. H., Huber, M., Bjugstad, K. B., Grossman, S., and Arendash,. G. W. (1999) Biochim. Biophys. Acta 1454, 143–152. 10. Liu, Q., Tao, Y., and ...

Investigating the Orientation Process
Eastland Fairfield Connections: Graphic Design, Performing Arts ... Fairfield Connections: Financial Services, International Business, Marketing and Logistics.

Investigating, Part 2
1) Move the frequency slider to 20 (Note: this does NOT mean the frequency is 20 Hz). Use the timer and ruler to make the measurements requested below. Use the pause and step buttons if you need to stop the simulation for ease of observation. a) Meas

Data Retention as a Tool for Investigating Internet Child ...
Data Retention as a Tool for Investigating Internet Child Pornography and Other Internet Crimes.pdf. Data Retention as a Tool for Investigating Internet Child ...

Investigating an Invariant Item Ordering for ...
Jan 21, 2010 - discrimination parameter ai; then, the ISRF for score x on item i is defined as. PًXi ≥ xjyق ¼ exp½aiًy ہ bixقٹ. 1 exp½aiًy ہ bixقٹ. , x ¼ 1, ... , m.

Investigating LSTMs for Joint Extraction of Opinion Entities and Relations
first such attempt using a deep learning approach. Perhaps surprisingly, we find that standard LSTMs are not competitive with a state-of-the-art CRF+ILP joint in- ference approach (Yang and Cardie, 2013) to opinion entities extraction, perform- ing b

Semantic Integration as a method for investigating ...
relevance. Participants in one condition were told that the descriptive information presented to .... for a sentence that expresses the integrated representation.

Large body size for winning and large swords for ...
period of 64 h without social interaction to mitigate the behavioural effects of prior ..... Social control of adult size in males of Xipho- phorus variatus. Nature, 245 ...

If Sentences Could See: Investigating Visual Information for Semantic ...
human similarity scores, directly employ full-blown MT systems and next ..... the translation matrices stochastically with the Adam algorithm (Kingma and Ba,.

An experimental approach for investigating consumers ... - Springer Link
Feb 10, 2000 - service encounter satisfaction, overall service quality, ... and monitoring [5,7,8]. ...... Regimens," in Applications of Social Science to Clinical.