Data based modelling for rapid process understanding and design under uncertainty Tao Chen, Yanhui Yang School of Chemical and Biomedical Engineering, Nanyang Technological University, 62 Nanyang Drive, Singapore 637459, Singapore (e-mail: [email protected]). Abstract: We consider an off-line process design problem where the response variable is affected by several factors. We present a data-based modelling approach that iteratively allocates new experimental points, update the model, and search for the optimal process factors. A flexible non-linear modelling technique, Gaussian process (GP), forms the cornerstone of this approach. GP model is capable of providing accurate predictive mean and variance, the latter being a quantification of its prediction uncertainty. We present a strategy for utilizing this prediction uncertainty through application to the design of a catalytic reaction system. Finally, we describe a sensitivity analysis approach to model interpretation, which is similar to analysis of variance (ANOVA) in traditional response surface methodology. Keywords: Design of experiments, Gaussian process model, model interpretation, process optimization, response surface methodology, uncertainty. 1. INTRODUCTION Mathematical models are the foundation of the systems approach to the design of chemical and other processes (Klatt and Marquardt, 2009). Models can be developed through the representation of fundamental principles that govern the process, and thus they are termed firstprinciples or mechanistic models. Alternatively, models may be purely based on experimental data and are called data-based or empirical. Although data-based models are typically reliable only within the operating region where the data are collected, they have seen wide applications due to the simplicity of model development and implementation. This is especially true if the process is still in its early design stage, whereby the time and resources needed for mechanistic modelling are hardly justifiable. This study is further restricted to batch-wise (as opposed to time-dependent) modelling that relates the process response (y, e.g. product yield) to the operating factors (x = [x1 , . . . , xd ]T , e.g. reaction temperature and pressure), where d is the number of factors. These models are typically used in off-line design tasks to facilitate the understanding and optimization of the process. This is in contrast to dynamic process models which are mainly used for on-line control and optimization purposes. Such a data-based model is the central component of the so-called response surface methodology (RSM) for rational process design (Myers and Montgomery, 1995). The traditional method in RSM is to fit a polynomial function (typically linear, quadratic or cubic polynomial) to the experimental data, followed by identifying the process factors that optimize the objective function. However, the prediction accuracy of the polynomial regression is usually

unsatisfactory due to the restrictive functional form, and consequently the model-based process understanding and optimization may be unreliable. To address this issue, flexible non-linear models have been applied to provide a more accurate approximation of the process behaviour, such as artificial neural network (ANN) (Shao et al., 2007), support vector machine (SVM) (Hadjmohammadi and Kamel, 2008), and Gaussian process (GP) regression (Yuan et al., 2008). GP models are particularly attractive since it not only attains high prediction accuracy, but also quantifies the uncertainty of the prediction. As a consequence, the model-based optimization problem can be formulated to account for the uncertainty, and the identified optimal process factors are expected to be more robust against modelling uncertainty. This paper extends the previous GP-based process optimization (Yuan et al., 2008) to an iterative approach, whereby the GP model is used to help search for the best process performance incrementally. We adopt a worst-case approach to handle the prediction uncertainty. We further presents a statistical procedure, based on sensitivity analysis, to interpret how the process response is affected by the factors. The proposed method is successfully applied to a catalytic reaction system to convert trans-stilbene to epoxides, which is an important intermediate in chemical and pharmaceutical industry. 2. ITERATIVE MODELLING AND OPTIMIZATION The overall approach is illustrated in Fig. 1. Initially, statistical design of experiments (DoE) is applied to give the initial design points for conducting experiments. The experimental data are used to develop a GP model to relate the process response to the factors. Based on the

C(x(i) , x(j) ) = a0 + a1

d X

(i) (j)

xk xk

k=1

+ v0 exp −

d X

k=1

Fig. 1. The flowchart of the proposed algorithm.

(i)

model, we allocate new design points by maximizing a certain objective function, and new experiments will be conducted to update the model. We discuss the various components of this algorithm in this section. 2.1 Initial experimental design Initial DoE is required to obtain the data for empirical modelling. The classical fractional factorial and central composite designs were proposed to investigate the interactions of process factors based on polynomial models (Myers and Montgomery, 1995). These classical designs typically assign two or three pre-determined levels for each process factor, and experiments are conducted at the combination of the levels of different factors. Using a small number of levels is especially appealing if the factors’ values are difficult to change in practice. However, this strategy may not have an optimal coverage of the design space due to limited levels of the factors being studied, and thus it may result in a less reliable empirical model. The recognition of this disadvantage of classical DoE methods has motivated the concept of “space-filling” designs that allocate design points to be uniformly distributed within the range of each factor (McKay et al., 1979). Among this class of designs, the Latin hypercube sampling (LHS) (McKay et al., 1979) has received wide application as a result of its simple implementation and good performance. LHS is adopted in this study for initial DoE. 2.2 Overview of Gaussian process model GP, also termed kriging model in the literature with slightly different formulation (Sacks et al., 1989), is a flexible modelling technique that can be used for both regression and classification purposes (Rasmussen and Williams, 2006). The GP regression model can be derived from the perspectives of ANN and Bayesian non-parametric regression; see (Rasmussen and Williams, 2006) for details. In this subsection a brief overview of Gaussian process regression model is given, including the formulation and implementation of the model. Consider a data set with n data points, {x(i) , y (i) ; i = 1, . . . , n}, where we use super-script to index data points and sub-script to index dimensions. A GP for regression is defined such that the regression function y(x) has a Gaussian prior distribution with zero mean, or in discrete form: y = (y (1) , . . . , y (n) )T ∼ G(0, C)

(1)

where C is an n × n covariance matrix of which the ij-th element is defined by the covariance function: C(x(i) , x(j) ). An example of such a covariance function is:

(i) wk (xk



(j) xk )2

!

+ δij σ 2 (2)

(i)

where x(i) = (x1 , . . . , xd ); δij = 1 if i = j, otherwise it is equal to zero. We denote θ = (a0 , a1 , v0 , w1 , · · · , wd , σ 2 ) as “hyper-parameters” defining the covariance function. The hyper-parameters must be non-negative to ensure that the covariance matrix is non-negative definite. The term “hyper-parameter” is used to differentiate Gaussian processes from parametric regression, where the parameter is required to be estimated. For the covariance function depicted in eq. (2), the first two terms represent a constant bias (offset) and a linear correlation term, respectively. The exponential term is similar to the form of a radial basis function, and it takes into account the potentially strong correlation between the outputs for nearby inputs. The term σ 2 captures the random error effect. By combining both linear and non-linear terms in the covariance function, Gaussian process is capable of handling both linear and non-linear data structures. Other forms of covariance functions are discussed in (Rasmussen and Williams, 2006). The prediction at a new data point x∗ is also Gaussian distributed: y ∗ ∼ G(ˆ y ∗ , σy2ˆ∗ ) with yˆ∗ = kT (x∗ ) C−1 y σy2ˆ∗ = C(x∗ , x∗ ) − kT (x∗ ) C−1 k(x∗ )

(3) (4)

where k(x∗ ) = [C(x∗ , x(1) ), . . . , C(x∗ , x(n) )]T . The capability to providing the prediction uncertainty in terms of the variance is an important feature of Gaussian process for robust process design (Yuan et al., 2008). The hyper-parameters θ can be estimated by maximizing the following log-likelihood function using optimization algorithms: 1 1 n L = − log det C − yT C−1 y − log 2π 2 2 2

(5)

A conjugate gradient method is usually used to find the hyper-parameters that maximize the above likelihood (Rasmussen and Williams, 2006). A Matlab implementation of the GP model is publicly available from http:// www.gaussianprocess.org/gpml/, and it was used to produce the results in this study. 2.3 Model-based region-search and optimization The GP model that relates the process response to the factors provides the basis to guide the search for more promising process factors. The salient feature of GP model is the availability of predictive variance, which should be accounted for in the optimization framework. In this study, we adopt the method of worst-case scenario by maximizing the lower-bound of the response predicted by the GP

Table 1. Process factors considered to study the trans-stilbene epoxidation process. Range 60 − 120 0.2 − 0.8 1−5 200 − 1250 30 − 240

model. (Suppose the objective is to maximize the response variable.) That is, xnew = arg max (ˆ y ∗ − ασyˆ∗ ) , ∗

s.t. x∗ ∈ S

(6)

x

where S denotes the factors’ range, and α is a weighting coefficient, e.g. α = 1.645 corresponds to 95% lowerbound of the prediction. The above optimzation problem can be solved using sequential quadratic programming or other methods, depending on the property of the specific problems. An alternative approach is to allocate multiple experimental points for the new iteration. This is relevant in practical situations where the scheduling of equipments/instruments favours conducting several experiments within a certain period of time, or high-throughput facility is available to run multiple experiments simultaneously. To suit such situations, we allocate new design points to the region defined as follows {x∗ : yˆ∗ − ασyˆ∗ > b AND σyˆ∗ > c AND x∗ ∈ S}

(7)

Essentially, this is to search for the factors such that the 95% lower-bound of the response is greater than b. Furthermore, we add a constraint σyˆ∗ > c to avoid allocating design points to well-explored region, where the GP model is quite certain about its prediction (i.e. with small σyˆ∗ ) and thus further experiments in this region are not necessary. The proper value of b and c should be adjusted at each iteration based on the information from experimental data.

Iter 1 Iter 2

No. data 20 40

GP RMSE R2 5.42 0.85 3.77 0.96

Quadratic RMSE R2 7.66 0.70 5.31 0.92

60 50 Conversion (%)

Process factor Temperature, x1 (◦ C) Partial pressure of oxygen, x2 (Bar) Initial stilbene concentration, x3 (mmol/15 mL) Stirring rate, x4 (rpm) Reaction time, x5 (min)

Table 2. LOOCV results.

40 30 20 10 0 1

2 Iteration

Fig. 2. Box-plot of conversion rate at two iterations. 3.1 Results In the initial iteration, the knowledge about the process is relatively limited, and the LHS algorithm is used to obtain 20 design points within the whole range of five factors for experiments. After experimentation, the corresponding conversion rates are obtained for GP modelling. A distribution of these 20 conversion values is illustrated in Fig. 2 using a box-plot. Clearly, the initial results are not satisfactory. Although two runs gave conversion rate as high as 55%, the median conversion is only 6.2%. Subsequently, we will demonstrate the use of GP model to guide the new experiments in order to find better process performance.

3. CASE STUDY

Before the model is applied for subsequent region-searching or optimization purpose, its predictive capability should be assessed by the well-known cross-validation procedure. In this study, we adopt the method of leave-one-out crossvalidation (LOOCV) to validate the GP regression model, and the results are given in Table 2 in terms of root mean squared error (RMSE) and coefficient of determination (R2 ). For the purpose of comparison, a conventional multiple quadratic polynomial regression model with stepwise variable selection is also developed. Clearly, the GP model has attained significantly higher prediction accuracy than the quadratic regression. A GP model is then developed from all the available 20 data points, and this model is used to search for better process performance in the next iteration.

We consider to optimize a lab-scale catalytic reaction to achieve the maximal conversion rate in the epoxidation of trans-stilbene over Co2+ -NaX (cobalt ion-exchanged faujasite zeolite) catalyst. The catalyst was synthesized inhouse using ion-exchange techniques. Five process factors are considered: reaction temperature, partial pressure of oxygen, initial trans-stilbene concentration, stirring rate and reaction time. The range of these factors to be explored is listed in Table 1.

Due to the scheduling arrangement of our lab, we chose to allocate 20 new experimental points for the next iteration according to eq. 7. Since in the first iteration, only four experiments attained conversion rates higher than 15%, we set b = 15% in eq. (7), in the hope to explore the factors’ region with conversion higher than 15%. In addition, we set c to be the average standard deviation of predictions in the LOOCV procedure, so that to generate design points that are not well predicted by the current model. Based on these choices, the incremental LHS algorithm generates a

Clearly there are an infinite number of design points that satisfy eq. (7). Hence the region-searching essentially becomes a constrained DoE problem, where the factors’ values are selected to be uniformly distributed in the constrained space that is defined by eq. (7). This constrained DoE problem can be solved by using incremental LHS as detailed in (Tang et al., 2010).

new set of 20 design points, and the obtained conversion rates are illustrated in Fig. 2. The figure confirms that the proposed framework has successfully identified more promising region of the process factors. The median and maximal conversion rates at iteration 2 have been improved to be 31.1% and 61.1%, respectively. In addition, recall that the objective of the region-searching in iteration 1 is to find the process factors with conversion higher than 15%. This objective has been fulfilled for 18 out of 20 experiments at iteration 2. Therefore, it appears that the GP model is reasonably reliable for predicting the process response variable. In principle, the proposed method can be iterated multiple times as required, and the number of iterations should be decided by the experienced experimenters after careful examination of the results. In this study, we restrict the number of iterations to two. Indeed, the specialists in catalytic reactions also feel that the identified factors so far may be close to the optimal condition achievable given the current experimental environment. To enable the optimization in the final iteration, a new GP regression model is required to approximate the response surface, using all the 40 data points available. Again, the results of LOOCV from GP are significantly better than traditional quadratic regression models (Table 2). Also note that adding more data typically has positive effect on reducing the prediction error in the models. Based on the finally developed GP model from all the 40 experimental data, the optimization problem defined in eq. (6) is solved. The optimal process condition is found to be: x1 = 120 ◦ C (temperature), x2 = 0.63 bar (partial pressure of oxygen), x3 = 1.00 mmol/15mL (initial stilbene concentration), x4 = 1250 rpm (string rate), and x5 = 120 min (reaction time), and the GP model predicts the conversion rate to be 94.5%. The actual experiment at this claimed optimal condition attains a conversion rate of 93.5%, which is reasonably close to the predicted value and is regarded as satisfactory under the current constraints of experiments. 4. MODEL INTERPRETATION In this work, model interpretation is referred to identifying how the process factors influence the response variable. When traditional polynomial functions are used in RSM, the models are self-explanatory: the regression coefficients clearly indicate the sign and magnitude of impact of process factors and factor interactions. Furthermore, a powerful technique, analysis of variance (ANOVA), can be employed to identify whether the impact is statistically significant. Unfortunately, when more accurate and complex models (such as GP) are used, such a clear interpretation is lost (Brown, 2007). In this work, we suggest to apply global sensitivity analysis (SA) to address this issue. 4.1 Global sensitivity analysis Global SA is based on a decomposition of the model into main effects and interactions (Saltelli et al., 2008):

y = f (x) = E(y) +

d X

zi (xi ) +

X

zi,j (xi,j )

1≤i
i=1

+

X

zi,j,k (xi,j,k ) + . . . + z1,2,...,d (x)

(8)

1≤i
where E(·) denotes expectation. The main effects are zi (xi ) = E(y|xi ) − E(y), and the two-factor interactions are zi,j (xi,j ) = E(y|xi,j ) − zi (xi ) − zj (xj ) − E(y). Similar expressions can be derived for multi-factor interactions. Global SA is also termed probabilistic SA, because the expectation operations are carried out with respect to a certain probability density function (pdf ) of process factors (Oakley and O’Hagan, 2004). In the context of interpreting data-based process models, process factors are instead given a uniform distribution within a certain range. Note that the main effects and interactions are all functions of process factors. Therefore, computing and plotting the main effects and/or interactions against the factors is a powerful graphic tool to depict how the process responds to the factors and their interactions. For example, if xi is an important factor to influence the response, then the conditional expectation E(y|xi ) will have a large variation across xi values. This observation motivated the following variance-based measure to quantify the importance of factor xi : Vi = var{zi (xi )} = var{E(y|xi )}

(9)

and interactions Vi,j = var{zi,j (xi,j )} = var{E(y|xi,j )} − Vi − Vj

(10)

In the context of process design, the data are usually obtained from designed experiments. Typical DoEs, be it factorial design or Latin hypercube sampling, will ensure that process factors are independent. Under this assumption, the total variance of y, V = var(y), can be decomposed as

V =

d X i=1

Vi +

X

1≤i
Vi,j +

X

Vi,j,k + . . . + V1,2,...,d

1≤i
(11)

and thus the sensitivity indices are normalized as Si = Vi /V , Si,j = Vi,j /V , and so on. Another useful measure is the variance of total effect of the i-th factor defined by VT i = V − var{E(y|x−i )}

(12)

where x−i denotes the sub-vector of x containing all elements except xi . VT i measures the variance of y that remains if the true values of x−i can be determined. The corresponding total sensitivity index is then ST i = VT i /V . The total effect index accounts for the total contribution to the response variation due to factor xi , including its first-order effect Si plus all higher-order effects due to interactions. As such, ST i ≥ Si , where equality holds when xi is not interacting with any other factors.

Table 3. Global SA for the GP model of the trans-stilbene epoxidation process. The sum of sensitivity indices Si is 0.752.

Next, the computational strategy for global SA will be discussed, including the calculation of main effects and interactions as in eq. (8) and the sensitivity indices Si and ST i . The number of terms that need to be computed increases exponentially with the number of process factors. To alleviate the computational cost, the usual approach is to assess the main effects first. If two-factor interactions are deemed to be significant, then they should also be calculated. Finally, higher-order effects typically have small magnitude and thus are usually not considered.

Process factor Temperature Partial pressure of oxygen Initial stilbene concentration Stirring rate Reaction time

The computation for global SA mainly involves the evaluation of the following integral:

Temperature Pressure Concentration Stirring Rate Time

25

(13)

and its variance with respect to xi : var{E(y|xi )}, which is also an integral by definition. Here p(x−i |xi ) is the condition pdf of x−i given xi , which can be derived from the overall pdf p(x). The usual approach to computation is by using Monte Carlo methods. The basic idea is to draw random samples from the pdf, followed by evaluating the GP model at the samples for x. Then, the various integrals can be approximated based on the principle of Monte Carlo integration; see (Saltelli et al., 2008) for more details. The major computation in the MC method is to evaluate the GP model for many times. Although the model has a “complex” non-linear form, its evaluation is usually very quick in comparison with evaluating a first-principles model. For example, evaluating the GP model developed for the catalytic process for 10,000 times (i.e. 10,000 random samples) took 43.2 s, which is acceptable in most practical applications. (Computation was conducted under Matlab environment on a Pentium 3.4 GHz computer running Windows XP.) 4.2 The catalytic process revisited We demonstrate the use of global SA for model interpretation based on the catalytic trans-stilbene epoxidation process described previously. A GP model was developed using all available data (41 points) to approximate the response-factor relationship. See Table 1 for details about the process factors. The global sensitivity indices for the five factors are calculated based on 10,000 MC samples and are listed in Table 3. The main effects are also graphically illustrated in Fig. 3. The trans-stilbene epoxidation is an endothermic reaction, and thus higher temperature is preferred for higher conversion. This is confirmed by both a large sensitivity index S1 = 0.445 and the dominant impact of temperature on conversion. The reaction time x5 has the second highest impact, and the graph of E(y|x5 ) favours a longer reaction time. For this reaction, it appears that the oxygen pressure (x2 ) and stirring rate (x4 ) do not significantly affect conversion, and thus they may not need to be considered in further studies. Finally, the initial concentration of raw material, trans-stilbene, has an negative impact on conversion. The results from global SA also indicates that the impact from interaction terms is not negligible. In Table 3, there is appreciable difference between the main (Si )

20 i

y(x)p(x−i |xi )dx−i

ST i 0.683 0.030 0.192 0.021 0.357

30

E(y|x )

E(y|xi ) =

Z

Si 0.445 0.013 0.125 0.010 0.159

15

10

5

0 0

0.2

0.4 0.6 Scaled Process Factors (x )

0.8

1

i

Fig. 3. The main effect E(y|xi ) against xi for each process factor: the trans-stilbene epoxidation process. and total (ST i ) indices, in particular for temperature and reaction time. In addition, the sum of five main indices is 0.752, suggesting that the interactions would account for 1 − 0.752 = 24.8% of the total variation in the response variable. Therefore, further analysis is required to assess the contribution from interaction terms. Table 4 gives the two-factor interaction indices. Except for the interaction between reaction temperature and time (S1,5 = 0.167), the other items are insignificant. The summation of main effect and two-factor interaction indices is 0.992, which implies that investigation on higher-order interactions is not necessary. To further illustrate the interacting effect of reaction temperature and time, Fig. 4 depicts E(y|x1 , x5 ) as a function of these two factors (range scaled to [0, 1]). For comparison, the same graph for temperature and stirring rate where the interaction is small (S1,4 = 0.006) is given. In Fig. 4, when temperature is low, varying reaction time does not significantly affect conversion; when temperature is high, increasing reaction time is clearly advantageous. The graph indicates a clear positive interaction between these two factors. In contrast, Fig. 4 shows that temperature has large impact on conversion no matter what stirring rate is, and varying stirring rate has low influence no matter what temperature is. Hence, it can be concluded from the figure that the interaction between temperature and stirring rate is negligible. 5. CONCLUDING REMARKS In this work, we have demonstrated an iterative approach to data-based process modelling and optimization. We adopted the GP model that is capable of providing both

jectives functions. Furthermore, in real industrial applications, the process factors and catalysts may not be as closely controlled as in the laboratories, and thus process variability may become significant. The combination of robust design and optimization methodology within the GP-based RSM framework is also under study.

Table 4. Two-factor interaction indices for the GP model of the trans-stilbene epoxidation process. The summation of main effect and two-factor interaction indices is 0.992. x1 0.013 0.028 0.006 0.167

x2

x3

0.011 0.000 0.004

0.000 0.009

x4

ACKNOWLEDGEMENTS 0.002

The experimental work was conducted by the researchers and students in our group, notably Qinghu Tang, Ying Bin Lau, Shuangquan Hu and Wenjin Yan.

50

REFERENCES

40

Brown, P. (2007). Discussion of the paper by Chen, Morris and Martin. Chemometrics and Intelligent Laboratory Systems, 87(1), 94–95. Hadjmohammadi, M. and Kamel, K. (2008). Response surface methodology and support vector machine for the optimization of separation in linear gradient elution. Journal of Separation Science, 31, 3864–3870. Klatt, K. and Marquardt, W. (2009). Perspectives for process systems engineeringPersonal views from academia and industry. Computers and Chemical Engineering, 33(3), 536–550. McKay, M.D., Beckman, B.J., and Conover, W.J. (1979). A comparison of three methods for selecting values for input variables in the analysis of output from a computer code. Technometrics, 21, 239–245. Myers, R.H. and Montgomery, D.C. (1995). Response Surface Methodology. Wiley. Oakley, J. and O’Hagan, A. (2004). Probabilistic sensitivity analysis of complex models: a Bayesian approach. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 66(3), 751–769. Rasmussen, C.E. and Williams, C.K.I. (2006). Gaussian Processes for Machine Learning. MIT Press. Sacks, J., Welch, W., Mitchell, T., and Wynn, H. (1989). Design and analysis of computer experiments. Statistical Science, 4, 409–423. Saltelli, A., Ratto, M., Andres, T., Campolongo, F., Cariboni, J., Gatelli, D., Saisana, M., and Tarantola, S. (2008). Global Sensitivity Analysis: The Primer. WileyInterscience. Shao, P., Jiang, S.T., and Ying, Y.J. (2007). Optimization of molecular distillation for recovery of tocopherol from rapeseed oil deodorizer distillate using response surface and artificial neural network models. Food and Bioproducts Processing, 85, 85–92. Tang, Q., Lau, Y., Hu, S., Yan, W., Yang, Y., and Chen, T. (2010). Response surface methodology using Gaussian processes: towards optimizing the trans-stilbene epoxidation over Co2+ -NaX catalysts. Chemical Engineering Journal, in press. Yuan, J., Wang, K., Yu, T., and Fang, M. (2008). Reliable multi-objective optimization of high-speed WEDM process based on Gaussian process regression. International Journal of Machine Tools and Manufacture, 48, 47–60.

30

1

5

E(y|x ,x )

Si,j x2 x3 x4 x5

20 10 0 1 1 0.5 Time, x

0.5 0

5

0

Temperature, x

1

40

1

4

E(y|x ,x )

30 20 10 0 1 1 0.5 Stirring Rate (x ) 4

0.5 0

0

Temperature (x ) 1

Fig. 4. The expectation, E(y|xi , xj ), against reaction temperature (x1 ), stirring rate (x4 ), and reaction time (x5 ) for the trans-stilbene epoxidation process. Process factors are scaled. accurate mean prediction and reliable prediction variance. The desirable properties of GP model form the basis for model-based range-searching and optimization in the iterative framework. To aid the interpretation of the developed model, a global sensitivity analysis approach is adopted. The proposed methodology has been successfully applied to the optimization of a trans-stilbene epoxidation process. In principle, the presented framework is applicable to general “processes” in diverse fields of science, engineering, management, among others, where empirical models are developed from designed experiments to facilitate the rational design and optimization of the processes. Currently, we are investigating improved formulations of the objective function for optimization, and the extension of the methodology to simultaneously optimize multiple ob-

Data based modelling for rapid process understanding ...

We present a data-based modelling approach that iteratively allocates new ..... running Windows XP.) .... of molecular distillation for recovery of tocopherol from.

144KB Sizes 1 Downloads 148 Views

Recommend Documents

Iterative Data-based Modelling and Optimization for ...
factors that optimize the objective function. However, the .... ing the following likelihood function .... pre-selected basis functions (e.g. orthogonal polynomials.

Data-based mechanistic modelling of three ...
tion rate 200 m3 hА1 as illustration for calculation. In this figure the airspace temperature .... Jia, C.C., Sun, D.W., Cao, C.W., 2000a. Mathematical simulation of.

data based mechanistic modelling of three dimensional temperature ...
Temperature distribution in a Ventilated Room Filled with Obstacles ................................... 75 ...... of ventilation rate through material in a big chamber (Fig. 2.14).

A new approach for modelling and understanding ...
index and output -, while the goal of monetary policy is to define the optimal .... vector of dimension (1 × 2), and B a bi-dimensional Brownian motion defined as.

Designing tabletop-based systems for user modelling of ...
when these theories establish the ideal aims, the technology tradeoffs .... Analyzing Student- Tutor Interaction Logs to Improve Educational Outcomes, (2004).

Metamodelling for Agent-Based Modelling: An ...
Agent-Based Modelling (ABM) is a form of computational simulation that has become over the past years a widely used technique for research in very different ...

Process-Data-Warehousing-Based Operator Support ...
365 terms of semantic representation. 366. The data model plays the role of a guideline, .... of many computer-aided intelligent decision support systems 472.

Sketch-Based Interfaces for Parametric Modelling
Pedro Company ... current CAD tools are not well suited to the initial design stages of product ... GEGROSS is a CASk application than performs an online.

Designing tabletop-based systems for user modelling of ...
improving collaboration, particularly by capturing data that can be used to model the nature of the interactions ... a careful analysis of the nature of tabletop datasets and collaboration. Keywords: .... There are also software based solutions that 

StretchEBand: Enabling Fabric-Based Interactions through Rapid ...
May 11, 2017 - factor and the resulting interaction techniques as the main contributions of this paper, we also present technical novelty in how we leverage stretch sensing for multimodal interac- tion in different contexts. Building on previous work

Ensemble methods for environmental data modelling ...
Institute of Geomatics and Analysis of Risk, University of Lausanne ... The choice of a good statistical model for environmental data modelling is usually very ...

Content-based retrieval for human motion data
In this study, we propose a novel framework for constructing a content-based human mo- tion retrieval system. Two major components, including indexing and matching, are discussed and their corresponding algorithms are presented. In indexing, we intro

A Process-Theoretic State-Based Framework for Live ...
(3) the supervised system, where in order to ensure safety, the synthesis procedure ... using a process theory that uses signal emission [6] to specify the state-based ... the prominent (state-based) model checker UPPAAL [10]. To couple both ...

Model-based DRC for design and process integration
Enhanced checks, however, for non-lithographic failure classes (metal slotting rules, for example) .... *See Fig 4 (frames E-G) for an illustration. 5.2. Extracting ...

Read Online Guidelines for Risk Based Process Safety ...
Current structural design construction support inspection and maintenance ... is to opt for US based fish and shellfish says Grattan “In the US there are strict regulator ... relatively lower hazard activities - throughout the life-cycle of a c

A MDA-based Development Process for Collaborative ...
into account in a MDA approach for collaborative processes, in order to guar- antee that the ... standards based on Web Services Composition, such as BPEL (Business Process ..... esses, to express the global view and the parallel processing of the pa

Model-based DRC for design and process integration
Model-based DRC for design and process integration. Chi-Yuan Hung*a, .... provides the DBclassify function to perform this data reduction. It allows users to ...

Towards a Visualisation Process for Ontology-Based Conceptual ...
Towards a Visualisation Process for Ontology-Based Conceptual Modelling.pdf. Towards a Visualisation Process for Ontology-Based Conceptual Modelling.pdf.

Optimal Training Data Selection for Rule-based Data ...
affair employing domain experts, and hence only small .... free rules. A diverse set of difficult textual records are given to set of people making sure that each record is given to a ..... writes when presented with the 100 chosen patterns. A.

A Semantic-Based Ontology Matching Process for PDMS
and Ana Carolina Salgado1. 1 Federal University of ..... In: International Conference on Management of Data (SIGMOD), Software. Demonstration (2005). 7.