A MODEL SELECTION PROCEDURE IN MIXTURE-PROCESS EXPERIMENTS FOR INDUSTRIAL
               PROCESS OPTIMIZATION

Leão, Márcio Nascimento de Souza; Vieira, Antonio Fernando de Castro; Dal Bello, Luiz Henrique Abreu

doi:10.1590/0101-7438.2015.035.02.0377

Abstract

We present a model selection procedure for use in Mixture and Mixture-Process Experiments. Certain combinations of restrictions on the proportions of the mixture components can result in a very constrained experimental region. This results in collinearity among the covariates of the model, which can make it difficult to fit the model using the traditional method based on the significance of the coefficients. For this reason, a model selection methodology based on information criteria will be proposed for process optimization. Two examples are presented to illustrate this model selection procedure.

mixture experiments; process optimization; information criterion; multicollinearity

1 INTRODUCTION

Formulations obtained from Mixture Experiments (ME) are commonly found in the chemical, pharmaceutical, and food industries, as well as in other industrial segments. In those experiments, the decision variables are the proportions of the components in a mixture and the response is a variable that characterizes the quality of the product, assumed as a function of component proportion. In these experiments, the sum of component proportions is always equal to one. In certain industrial processes, there may be other variables, in addition to the mixture components, that affect the characteristics of the process and must be included in the experiment as factorial designs. Such experiments are called Mixture-Process Experiments (MPEs). Therefore, we intend to determine not only the optimal proportions of the mixture components but also the optimal levels of the process variables.

In MEs, it might be necessary to limit the proportion of one or more components that, for technical or practical reasons, cannot be present in all possible proportions. Those limitations of the components, which are very common in industrial cases, may be upper, lower, or a combination of both. Certain combinations of limitations on the proportions of the components may result in a very limited experimental region, which results in collinearity among the covariates of the model, making it difficult to fit the model using the traditional method based on the significance of the coefficients. Consequently, a model selection methodology based on information criteria will be proposed. In order to illustrate this methodology, two examples are used. Matlab^(r) routines were then written for the model selection and the process optimization.

^{Cornell (2002)}9 CORNELL JA . 2002. Experiments with Mixtures: Designs, Models and the Analysis of Mixture Data. Third edition, John Wiley and Sons, New York. is the main reference on ME, being the Chapter 7 dedicated to MPE cases. In it, a comprehensive and detailed exposition can be found. ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. dedicate Chapters 12 and 13 to ME and MPE, thus comprising a good introduction to the topic. ^{Piepel (2004)}24 PIEPEL GF. 2004. 50 Years of mixture experiment research: 1955-2004, in KHURI AI . (Editor).Response Surface Methodology and Related Topics. World Scientific Publishing, Singapore, 283-327. summarizes a survey related to mixture experiments for a period of 50 years, ranging from 1955 to 2004. Prescott et al. (2002) propose a quadratic model as an alternative to the models traditionally used in ME (Scheffe models). ^{Cornell (2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147., (^{Cornell 2002}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147., Chapter 6), ^{Cornell & Gorman (2003)}10 CORNELL JA & GORMAN JW. 2003. Two New Mixture Models: Living With Collinearity but Removing its Influence. Journal of Quality Technology, 35: 78-88. and ^{Khuri (2005)}21 KHURI AI. 2005. Slack Variable Models Versus Scheffés Mixture Models. Journal of Applied Statistics, 32: 887-908. carried out comparative studies between models that they named as slack-variable models and Scheff´e models. ^{Piepel
(2007)}25 PIEPEL GF . 2007. A Component Slope Linear Model for Mixture Experiments. Quality Technology & Quantitative Management, 4(3): 331-343. compares the CSLM (Component Slope Linear Model) with the SLM (Scheffé Linear Model) and the CLM (Cox Linear Model). They conclude that the models SLM, CLM and CSLM are mathematically equivalent and provide the same statistics for a given ME. The differences lie in the interpretations of their coefficients. ^{Dal Bello & Vieira (2011b)}12 DAL BELLO LHA & VIEIRA AFC. 2011a. Optimization of a product performance using mixture experiments. Journal of Applied Statistics, 38(8): 1701-1715. present a tutorial on mixture-process experiments.

^{Goos & Donev (2006)}17 GOOS P & DONEV AN. 2006. The D-Optimal Design of Blocked Experiments with MixtureComponents. Journal of Quality Technology, 38: 319-332. describe an algorithm to plan experiments in blocks involving mixtures. They show that, for restricted and unrestricted experimental regions, the resulting design of experiments is statistically more efficient than the options of experiments in blocks presented in the literature. ^{Goos & Donev (2007)}18 GOOS P & DONEV AN . 2007. Tailor-Made Split-Plot Designs for Mixture and Process Variables. Journal of Quality Technology, 39: 326-339. describe an algorithm to plan split-plot experiments in cases involving mixture and process variables. They use an optimization criterion for the choice of experimental points and show that it is preferable to spread the replications all over the experiment region, instead of concentrating them in central points.

^{Kowalski et al. (2002)}22 KOWALSKI SM, CORNELL JA& VINING GG. 2002. Split-Plot Designs and Estimation Methods for Mixture Experiments with Process Variables. Technometrics, 44: 72-79., ^{Prescott (2004)}26 PRESCOTT P. 2004. Modeling in Mixture Experiments Including Interactions with Process Variables. Quality Technology & Quantitative Management, 1(1): 87-103. and ^{Sahni et al.
(2009)}27 SAHNI NS, PIEPEL GF& NÆS T. 2009. Product and Process Improvement Using Mixture-Process Variable Methods and Optimization Techniques. Journal of Quality Technology, 41(2): 181-197. analyzed the MPE modeling. ^{Goldfarb et
al. (2004a)}15 GOLDFARB HB, BORROR CM, MONTGOMERY DC& ANDERSON-COOK CM . 2004a. Three-Dimensional Variance Dispersion Graphics for Mixture-Process Experiments. Journal of Quality Technology, 36: 109-124. propose the use of a plot method (variance dispersion plot) for MPE planning. The variance dispersion plot presents a visual way of assessing the variance properties of an MPE within the joint mixture and process area. That information may be used to select experiments with an acceptable variance profile.

^{Goldfarb et al. (2003)}14 GOLDFARB HB, BORROR CM& MONTGOMERY DC . 2003. Mixture-Process Variable Experiments with Noise Variables. Journal of Quality Technology, 35: 393-405., ^{Goldfarb et al. (2004b)}16 GOLDFARB HB, BORROR CM, MONTGOMERY DC& ANDERSON-COOK CM . 2004b. Evaluating Mixture-Process with Control and Noise Variables. Journal of Quality Technology, 36: 245-262. and ^{Chung et
al. (2007)}5 CHUNG PJ, GOLDFARB HB& MONTGOMERY DC . 2007. Optimal Designs for Mixture-Process Experiments with Control and Noise Variables. Journal of Quality Technology, 39: 179-190. consider the case where, in addition to the mixture components and process variables (controlled factors), there are uncontrolled factors in the productive process (noise variables), although they may be controlled in laboratory experiments. The authors address models that allow them to choose the controllable variable values (mixture and process) that make the process more robust in relation to the noise variables.

^{Dal Bello (2010)}11 DAL BELLO LHA. 2010. Modelagem em Experimentos Mistura-Processo para Otimização de Processos Industriais. Doctoral Thesis, PUC, Rio de Janeiro.^{Dal Bello & Vieira (2011a)}12 DAL BELLO LHA & VIEIRA AFC. 2011a. Optimization of a product performance using mixture experiments. Journal of Applied Statistics, 38(8): 1701-1715. present a methodology close to the spirit of this article.

A brief introduction to ME and MPE is presented in Sections 2 and 3. In Section 4, the information criteria used in this work are described and the models chosen according to those criteria are presented. In Section 5, we present a model selection methodology with two examples, and we apply this methodology in two other examples. The conclusions are in Section 6.

2 MIXTURE EXPERIMENTS

Consider x _i, the variables that represent the proportions of the q mixture components. Then:

In many MEs there are limitations on the component proportions, making the experimentalspace a sub-region of the original space. Therefore, upper and/or lower limits on the proportions are established, and are represented as follows:

where L _i is the lower limit and U _i is the upper limit of the component proportion i.

When the upper and lower limits on the proportions of one mixture are established, the experimental region is reduced to a sub-region of the original region. In these cases, the coordinates of the sub-regions may be redefined in terms of "pseudo"-components.

The models which are traditionally used in MEs are Scheffé's canonical polynomials (^{Scheffé, 1958}28 SCHEFFÉ H. 1958. Experiments with mixture. Journal of the Royal Statistical Society, B20: 344-366.). Scheffé's cubic model is as follows:

where the βs are the model's parameter coefficients. Note that this model does not have the intercept, as it is eliminated by a simplification originating from the basic limitation presented in Eq. (1).

3 MIXTURE-PROCESS EXPERIMENTS

An adequate model for r process variables z ₁, z ₂, ..., z _r involving second-order terms is:

where the δs are the model's parameter coefficients for process variables. The experiment for the process variables may be a factorial design with two or more levels. In order to include terms with the variable in the model, an experiment with at least three levels of each process variable and a total number of points sufficient to fit and test the model is required. In order to fit a model without the variable , considering only the main effects of the process variables and the interactions among them, only two levels of each variable are necessary.

We use the form of the simultaneous additive and multiplicative combined model, which includes Scheffé's cubic model for the mixture and the reduced quadratic model, considering only the main effects of the process variables and the interactions among them:

where the γ s are the parameters for the mixture's combined model including process variables and the δ s are the parameters for the process variables. The lower indexes of γ refer to mixture variables, whereas the upper ones refer to process variables. The lower indexes of δ refer to process variables.

4 INFORMATION CRITERIA AND MODEL SELECTION

An information criterion that has been widely used in model selection is Akaike's criterion (AIC) (^{Akaike,
1973}1 AKAIKE H. 1973. Information theory and an extension of the maximum likelihood principle, in Mehra RK & Csaki F. (editors), Second International Symposium on Information Theory. Akademiai Kiado, Budapest.).

where y _i is the i _th value of the response and is the estimate of y _i when a model of p parameters is fitted through maximization of the Log-Likelihood Function (LLF). The term added to LLF, called the penalty function, aims at correcting a bias originating from the comparison of models with different numbers of parameters. Among the several candidate models, the one with the lowest AIC value must be chosen. AIC was developed from Kullback-Leibler distance, which is a distance between the true model and the candidate model. ^{Burnham & Anderson (2002)}4 BURNHAM KP & ANDERSON DR. 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretical Approach. Second edition, Springer, New York. recommend the use of AIC only when n/p ≥ 40. Considering a case of responses with normal distribution, the AIC expression may be simplified to give the following:

where is the maximum likelihood estimator of the error variance.

Considering responses with normal distribution and small samples (n/p<40), ^{Hurvich & Tsai (1989)}20 HURVICH CM & TSAI C-L 1989. Regression and time series model selection in small samples. Biometrika, 76: 297-307. developed the AICc criterion:

^{Burnham & Anderson (2002)}4 BURNHAM KP & ANDERSON DR. 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretical Approach. Second edition, Springer, New York. recommend the calculation of AIC differences between the candidate models and the model with the lowest AlC _c value AlC _{c_min}.

The calculation methodology for AIC differences may also be used for AlC _c differences. Δ_i values can be interpreted easily and allow a quick comparison of candidate models. The higher the Δ_i, the less likely it is that the fitted model is the best model according to Kullback-Leibler distance. ^{Burnham & Anderson
(2002)}4 BURNHAM KP & ANDERSON DR. 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretical Approach. Second edition, Springer, New York. affirm that models with Δ_i > 10 may be omitted in future considerations and models with Δ_i between 0 and 2 may be regarded as non-different. The calculation of AlC _c differences is used in the proposed methodology for model selection which will be used in Section 5.

5 PROPOSED METHODOLOGY

In the first stage of this methodology we use the full Scheffé's canonical polynomials for a ME and a combined model for a full MPE. Thus, we obtain all the candidate terms for the model under study. Then, we use the AlC _c criterion to select the model with the lowest AlC _c according to the number of parameters. Afterwards, we calculate the AlC _c differences between the candidate models and the model that has the lowest AlC _c and we select the non-different models.

Analyzing the non-different models, we choose the model, now named the Base Model, which has the lowest mean-squared error (MSE) and prediction error sum of squares (PRESS).

In the second stage of the methodology we obtain a better model, taking into account the Base Model terms and all terms which are equivalent to the terms of the Base Model. Such Equivalent Terms are created considering Eq. (1), which is the basic restriction of MEs. For example, the term x ₁ x ₂ is equivalent to term x ₁(1 - x ₁ - x ₃) or term (1 - x ₂ - x ₃)x ₂. After determination of all the candidate terms (terms of the Base Model and Equivalent Terms), we use the AlC _c criterion again in order to select the model with the lowest AlC _c. Afterwards, we calculate the AlC _c differences between the candidate models and model that has the lowest AlC _c and we select the non-different models.

Analyzing the non-different models, we choose the model with the lowest PRESS and MSE as the Final Model. The proposed methodology is illustrated through two examples.

5.1 Example 1

The problem of Example 1 was presented by ^{Myers &
Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York.. An adhesive is being formulated for use in an aerospace application. The adhesive consists of a resin x ₁ and two crosslinkers, x ₂ and x ₃. The mixture constraints for these variables are x ₁ + x ₂ + x ₃ = 1; 0.70 ≤ x _i ≤ 0.90; 0.05 ≤ x ₂ ≤ 0.10; and 0.05 ≤ x ₂ ≤ 0.20.

The adhesive is applied to the components and then the entire assembly is cured for 12 h at controlled temperature and humidity. The temperature z ₁ and relative humidity z ₂ are process variables that can be controlled by the experimenter. The ranges of theses process variables that experimenters think are appropriate are 40ºF ≤ temperature ≤ 100ºF and 15% ≤ relative humidity ≤ 85%. The response variable of interest is the pulloff force required to separate the components after curing. It should exceed 40 pounds.

The authors use L-pseudocomponents according to the relation υ_i = ; i = 1,2, ..., q, where L = L _i and Table 1 presents the experiment.

Thumbnail

Table 1
The experiment of Example 1 with L-pseudocomponents for mixture components and coded for process variables.

Where

ŷ₁: Response obtained by ^{Myers &
Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. model.

ŷ₂: Response obtained by Final Model 1 in this article.

The model selected by ^{Myers & Montgomery
(2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. presented PRESS, MSE and AlC _c equal to 903.20, 15.45 and 122.57, respectively, and is shown in Eq. (11).

5.1.1 Model Selection Methodology

All the candidate terms for the MPE are the terms in Eq. (5). The model selected according to the AlC _c criterion was the following:

A Matlab^(r) routine was then written for the calculation and storage of AlC _c values and selection of non-different models considered, that is, those that presented AlC _c differences (Δ_i) between 0 and 2, as presented in Section 4. According to the AlC _c criterion, 23 models are considered non-different. The Model that presents the lowest PRESS (536.67) and MSE (10.44) was selected and it's now named as the Base Model.

The Base Model is shown in Eq. (13):

In this step of the methodology we will consider other models using the Base Model. For this, additional terms are generated from the terms of the Base Model. Table 2 presents the terms equivalent to the Base Model terms.

Thumbnail

Table 2
Equivalent Terms.

Once all the candidate terms (Base Model terms and Equivalent Terms) for the MPE model are known, we may then use the AlCc criterion again. The model selected is shown in Eq. (14). This model presents PRESS and MSE equal to 586.66 and 11.12, respectively.

This model presents higher MSE and PRESS than the model of Eq. (13). However, we will analyze models considered non-different to the model of Eq. (14), as described at the start in Section 5.1.1. The Model that presents the lowest PRESS (515.43) and MSE (10.13) was selected and now, it should be Final Model 1. The Final Model 1 is shown in Eq. (15):

Examining the model obtained by ^{Myers &
Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. and Final Model 1 obtained in this article, we observe that the application of the methodology led to a decrease of 43.93% in PRESS (from 903.20 to 515.43) and a decrease of 34.43% in the MSE (from 15.45 to 10.13) and kept the number of parameters of the Final Model 1 (12 parameters).

Table 3 shows the t-Student test for the Final Model 1.

Thumbnail

Table 3
Final Model 1 Test.

5.1.2 Response Optimization

In the Example 1, a response exceeding 40 pounds is desirable. Several formulations may result in a future response prediction greater than 40 pounds. Consequently, a desirable objective is to maximize the expected value for a future response.

The estimation vector for the coefficients is = ( ( W ' W ) -1 W ' y, the variance-covariance matrix is var( ) = σ 2 ( W ' W ) -1, where W is a matrix (n × p) whose elements are the mixture components proportion (x _i), the levels of the process variables (z _i) and functions of x _i and z _i (such as interactions), where p is the number of parameters and n the number of observations.

The general combined model with the inclusion of process variables is represented in a matrix form as

For n observations, y is the vector (n × 1) of observations, β is the vector (p × 1) of coefficients and ε is the vector (n × 1) of random errors. In the classical linear model, ε is considered with multivariate normal distribution, i.e. ε ~ N (0, Iσ 2 ). The estimated mean response at point w (w ' is a matrix line W) is and its variance is

and its variance is

The problem may then be formulated as follows:

max E[ŷ(w)] = w '

subject to:

υ₁ + υ₂ + υ₃ =1; 0 ≤ υ₁ ≤ 1; 0 ≤ υ₂ ≤ 0.25; 0 ≤ υ₃ ≤ 0.75; -1 ≤ z1 ≤ 1; -1 ≤ z2 ≤ 1.

Using a search routine in Matlab^(r), the solution for the problem formulated above was found, considering the model obtained by ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. and Final Model 1 obtained in this article. Table 4 presents the optimal values for the components proportions, in L-pseudo components (v _i) and in actual values (x _i) and the optimal values of the process variables, in coded variables (z _i) and in actual values (ºF and RH, respectively).

Thumbnail

Table 4
Solution for the maximization problem of example 1.

Table 5 compares the PRESS, MSE, AlC _c, the response prediction and the variance of a new response for both models. Analyzing the Table 4, we observe that the model obtained in this article presents lower value for PRESS, MSE and AlC _c, emphasizing that was obtained a higher response prediction with a lower variance of a new response.

Thumbnail

Table 5
Comparison of two models.

5.1.3 Model Adequacy

The use of studentized residuals to check the normality is recommended by ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York.. The studentized residuals (r _i) are defined as follows:

where e _i = y _i - ŷ _i and h _ii are elements of the hat matrix diagonals H = W ( W ' W ) -1 W '.

Figure 1 and 2 present the diagnosis plots used to check the adequacy of the Final Model 1 (a) and the model obtained by Myers & Montgomery (b).

Figure 1
Normal probability plot of the studentized residuals of Example 1.

Figure 2
Plot of studentized residuals versus fitted values of Example 1.

In the normal probability plots of the studentized residuals shown in Figure 1, we may observe that there isn't indication that the normality assumption should not be accepted, as there aren't points way off the alignment.

In order to check the additivity of the models regarding the linear model, there are the plots of studentized residuals versus fitted values, shown in Figure 2.

The residuals shown in the plot from Figure 2 are randomly distributed around zero. Therefore, the adequacy of Final Model 1 (a) and the model obtained by Myers and Montgomery (b) were checked.

The fitted values shown in the plot from Figure 3 are randomly distributed around actual values. Therefore, the adequacy of Final Model 1 (a) and the model obtained by Myers & Montgomery (b) were checked.

Figure 3
Plot of fitted values versus actual values of Example 1.

5.2 Example 2

The problem of Example 2 was presented by ^{Cornell
(2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147. and ^{Myers & Montgomery
(2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York.. Lauryl sulfate (A), cocamide (B), and lauramide (C) were ingredients in a shampoo whose proportionate values were varied in an experiment that was designed to study how shampoo foam height was functionally related to composition. The three ingredients made up 50% of the shampoo, while the other constituents, which were held fixed in all blends, were water, perfume, and coloring agents.

Upper and lower bound constraints were placed on the ingredient or component proportions in the form 0.20 ≤ A ≤ 0.30, 0.07 ≤ B ≤ 0.10, and 0.13 ≤ C ≤ 0.20, where A + B + C = 0.5. The lower and upper bound constraints, when converted to the mixture components constraints in Eq. (1), are rescaled as 0.40 ≤ xi ≤ 0.60, 0.14 ≤ x2 ≤ 0.20, and 0.26 ≤ x3 ≤ 0.40. The experimenter's objective was to formulate a product with foam height in excess of 170 mm. The authors use L-pseudocomponents and Table 6 presents the experiment.

Thumbnail

Table 6
The experiment of Example 2 with L-pseudocomponents for mixture components.

The models selected by ^{Cornell (2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147. and ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. both presented PRESS, MSE and AlCc equal to 657.08, 25.14 and 83.87, respectively, and the model by ^{Cornell (2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147. is shown in Eq. (20), while that by ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. is shown in Eq. (21).

Where

ŷ1: Response obtained by ^{Cornell (2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147. and ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. models.

ŷ2: Response obtained by Final Model 2 in this article.

5.2.1 Model Selection

All the candidate terms for the ME are the terms in Eq. (3). The model selected according to the AlCc criterion was the following:

Then, four models are considered non-different. The Model that presents the lowest PRESS (413.22) and MSE (17.09) was selected and named as the Base Model. The Base Model is shown in Eq. (22).

In this step of the methodology we will consider other models using the Base Model. For this, additional terms are generated from the terms of the Base Model. Table 7 presents the equivalent terms to the Base Model terms.

Thumbnail

Table 7
Equivalent Terms.

Once all the candidate terms (Base Model terms and Equivalent Terms) for the ME model are known, we may then use the AlCc criterion again. The model selected is shown in Eq. (23). This model presents PRESS and MSE equal to 348.73 and 16.61, respectively.

We will analyze models considered not different to the model of Eq. (23), as described at the start in Section 5.1.1. The Model that presents the lowest PRESS (348.73) and MSE (16.61) was selected and now, it should be Final Model 2.

Table 8 shows the t-Student test for the Final Model 2.

Thumbnail

Table 8
Final Model 2 Test.

Comparing the models obtained by ^{Cornell (2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147. and ^{Myers & Montgomery (2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. and Final Model 2 obtained in this article, we observed that the application of the methodology led to a decrease of 46.93% in PRESS (from 657.08 to 348.73) and decrease of 33.93% in the MSE (from 25.14 to 16.61). It should be emphasized that Final Model 2 has five parameters and the models presented by ^{Cornell
(2000)}8 CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147. and ^{Myers & Montgomery
(2002)}23 MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York. have seven parameters.

5.2.2 Response Optimization

In the Example 2, a response of exceeding 170 mm is desirable. Several formulations may result in future a response prediction greater than 170 mm. Consequently, a desirable objective is to maximize the expected value for a future response.

The problem may then be formulated as follows:

max E[ŷ(w)] = w'

subject to:

υ₁ + υ₂ + υ₃ =1; 0 ≤ υ₁ ≤ 1; 0 ≤ υ₂ ≤ 0.3; 0 ≤ υ₃ ≤ 0.7;

Table 9 presents the optimal values for the components proportions, in L-pseudo components (υ_i) and in actual values (A, B and C). Table 10 compares the PRESS, MSE, AlC _c, the response prediction and the variance of a new response for three models. Analyzing the Table 9, we observe that the model obtained in this article presents lower value for PRESS, MSE and AlC _c, emphasizing that was obtained a higher response prediction with a lower variance of a new response.

Thumbnail

Table 9
Solution for the maximization problem of Example 2.

Thumbnail

Table 10
Comparison of three models.

5.2.3 Model Adequacy

Figures 4 and 5 present the diagnosis plots used to check the adequacy of the Final Model 2 (a) and the models obtained by Cornell and, Myers & Montgomery (b).

Figure 4
Normal probability plot of the studentized residuals of Example 2.

Figure 5
Plot of studentized residuals versus fitted values of Example 2.

In the normal probability plot of the studentized residuals shown in Figure 4, we may observe that there isn't indication that the normality assumption should not be accepted, as there aren't points way off the alignment.

In order to check the additivity of the model regarding the linear model, there is the plot of studentized residuals versus fitted values, shown in Figure 5.

The residuals shown in the plot from Figure 5 are randomly distributed around zero. Therefore, the adequacy of Final Model 2 (a) and the models obtained by Cornell and, Myers & Montgomery (b) were checked.

The fitted values shown in the plot from Figure 6 are randomly distributed around actual values. Therefore, the adequacy of Final Model 2 (a) and the model obtained by Cornell and, Myers & Montgomery (b) were checked.

Figure 6
Plot of fitted values versus actual values of Example 2.

Below we present the steps of the methodology proposed in this article:

Step 1: Choose a full model based on Scheffé's canonical polynomials for cases of ME, shown in Eq. (3), or choose a full combined model for cases of MPE, shown in Eq. (5), to obtain all the candidate terms.

Step 2: Use the AlC _c criterion and select the model that provides the lowest AlC _c, accordingto the number of parameters.

Step 3: Calculate the AlC _c differences between the candidate models and the model thatprovides the lowest AlC _c and select the non-different models.

Step 4: Analyze the non-different models, and choose the model that provides the lowest MSE and PRESS. This model is now named as the Base Model.

Step 5: Determine all terms that are equivalent to the Base Model terms, and create all the candidate terms.

Step 6: Use the AlC _c criterion again, and select the model that provides the lowest AlC _c according to the number of parameters.

Step 7: Calculate the AlC _c differences between the candidate models and the model that provides the lowest AlC _c and select the non-different models again.

Step 8: Analyze the non-different models, and choose the Final Model that provides the lowest MSE and PRESS.

6 SIMULATION STUDY

A small simulation study is now presented once we know the true model leading to the responses. Considering the experiment of the Table 11 and the true model shown in Eq. (24), we developed a routine in Matlab^(r) to generate the normal responses for this experiment and to select the model using the proposed methodology.

where ε is the random errors with normal distribution, i.e. ε ~ N(0, σ²⁾.

Thumbnail

Table 11
The experiment of the Simulation Study.

In the simulation study we generated 1,000 experiments with the model shown in Eq. (24), considering σ equal to 0.5 up to 1.5. Using the proposed methodology, the Table 12 shows the identified models with the same terms of the model shown in Eq. (24).

Thumbnail

Table 12
σ versus Identified Models.

7 CONCLUSIONS

In this article, the statistical techniques necessary for the planning and analysis of mixtureexperiments with or without process variables were gathered and a methodology for selecting models in MPE and ME was presented with two examples.

The use of Information Theory constituted an evolution in ME and MPE. Multicollinearity may cause the estimators of model coefficients to be instable and very inflated. Therefore, certain terms of the model may be significant in the presence of some terms and not significant in the presence of other terms. In this context, stepwise forward and backward selection may result in arbitrary selection of variables that belong to the model (^{Harrell, 2001}19 HARRELL FE JR. 2001. Regression Modeling Strategies with Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer-Verlag, New York.). An alternative technique was to consider all possible combinations of terms in the full model and the number of parameters and to use selection criteria for models based on Information Theory. From the results obtained in this article, we concluded that the use of the AICc information criterion may result in lower PRESS and MSE.

Finally, a model selection methodology in ME and MPE was presented. In the first stage of the methodology, a Base Model was fitted using the AICc criterion. In the following stage, a better model was obtained, taking into account, besides the Base Model terms, all the Equivalent Terms of the Base Model, also using the AICc criterion for the selection of the proposed model terms. We may then conclude that the second stage of the methodology provided models that were better than the Base Model and also better than the models obtained previously.

¹
AKAIKE H. 1973. Information theory and an extension of the maximum likelihood principle, in Mehra RK & Csaki F. (editors), Second International Symposium on Information Theory. Akademiai Kiado, Budapest.
²
ANDERSON-COOK CM, GOLDFARB HB, BORROR CM, MONTGOMERY DC, CANTER KG & TWIST JN. 2004. Mixture and mixture-process variable experiments for pharmaceutical applications. Pharmaceutical Statistics, 3(4): 247-260.
³
BORROR CM, MONTGOMERY DC& MYERS RH. 2002. Evaluation of statistical designs for experiments involving noise variables. Journal of Quality Technology, 34(1): 54-70.
⁴
BURNHAM KP & ANDERSON DR. 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretical Approach. Second edition, Springer, New York.
⁵
CHUNG PJ, GOLDFARB HB& MONTGOMERY DC . 2007. Optimal Designs for Mixture-Process Experiments with Control and Noise Variables. Journal of Quality Technology, 39: 179-190.
⁶
CHUNG PJ, GOLDFARB HB, MONTGOMERY DC& BORROR CM . 2009. Optimal Designs for Mixture-Process Experiments Involving Continuous and Categorical Noise Variables. Quality Technology & Quantitative Management, 6(4): 451-470.
⁷
CORNELL JA. 1995. Fitting models to data from mixture experiments containing other factors. Journal of Quality Technology, 27(1): 13-33.
⁸
CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147.
⁹
CORNELL JA . 2002. Experiments with Mixtures: Designs, Models and the Analysis of Mixture Data. Third edition, John Wiley and Sons, New York.
¹⁰
CORNELL JA & GORMAN JW. 2003. Two New Mixture Models: Living With Collinearity but Removing its Influence. Journal of Quality Technology, 35: 78-88.
¹¹
DAL BELLO LHA. 2010. Modelagem em Experimentos Mistura-Processo para Otimização de Processos Industriais. Doctoral Thesis, PUC, Rio de Janeiro.
¹²
DAL BELLO LHA & VIEIRA AFC. 2011a. Optimization of a product performance using mixture experiments. Journal of Applied Statistics, 38(8): 1701-1715.
¹³
DAL BELLO LHA & VIEIRA AFC . 2011b. Tutorial for mixture-process experiments with an industrial application. Pesquisa Operacional, 31(3): 1-21.
¹⁴
GOLDFARB HB, BORROR CM& MONTGOMERY DC . 2003. Mixture-Process Variable Experiments with Noise Variables. Journal of Quality Technology, 35: 393-405.
¹⁵
GOLDFARB HB, BORROR CM, MONTGOMERY DC& ANDERSON-COOK CM . 2004a. Three-Dimensional Variance Dispersion Graphics for Mixture-Process Experiments. Journal of Quality Technology, 36: 109-124.
¹⁶
GOLDFARB HB, BORROR CM, MONTGOMERY DC& ANDERSON-COOK CM . 2004b. Evaluating Mixture-Process with Control and Noise Variables. Journal of Quality Technology, 36: 245-262.
¹⁷
GOOS P & DONEV AN. 2006. The D-Optimal Design of Blocked Experiments with MixtureComponents. Journal of Quality Technology, 38: 319-332.
¹⁸
GOOS P & DONEV AN . 2007. Tailor-Made Split-Plot Designs for Mixture and Process Variables. Journal of Quality Technology, 39: 326-339.
¹⁹
HARRELL FE JR. 2001. Regression Modeling Strategies with Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer-Verlag, New York.
²⁰
HURVICH CM & TSAI C-L 1989. Regression and time series model selection in small samples. Biometrika, 76: 297-307.
²¹
KHURI AI. 2005. Slack Variable Models Versus Scheffés Mixture Models. Journal of Applied Statistics, 32: 887-908.
²²
KOWALSKI SM, CORNELL JA& VINING GG. 2002. Split-Plot Designs and Estimation Methods for Mixture Experiments with Process Variables. Technometrics, 44: 72-79.
²³
MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York.
²⁴
PIEPEL GF. 2004. 50 Years of mixture experiment research: 1955-2004, in KHURI AI . (Editor).Response Surface Methodology and Related Topics. World Scientific Publishing, Singapore, 283-327.
²⁵
PIEPEL GF . 2007. A Component Slope Linear Model for Mixture Experiments. Quality Technology & Quantitative Management, 4(3): 331-343.
²⁶
PRESCOTT P. 2004. Modeling in Mixture Experiments Including Interactions with Process Variables. Quality Technology & Quantitative Management, 1(1): 87-103.
²⁷
SAHNI NS, PIEPEL GF& NÆS T. 2009. Product and Process Improvement Using Mixture-Process Variable Methods and Optimization Techniques. Journal of Quality Technology, 41(2): 181-197.
²⁸
SCHEFFÉ H. 1958. Experiments with mixture. Journal of the Royal Statistical Society, B20: 344-366.

Publication Dates

Publication in this collection
May-Aug 2015

History

Received
19 Sept 2012
Accepted
27 Sept 2014

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ¹
AKAIKE H. 1973. Information theory and an extension of the maximum likelihood principle, in Mehra RK & Csaki F. (editors), Second International Symposium on Information Theory. Akademiai Kiado, Budapest.

[2] ²
ANDERSON-COOK CM, GOLDFARB HB, BORROR CM, MONTGOMERY DC, CANTER KG & TWIST JN. 2004. Mixture and mixture-process variable experiments for pharmaceutical applications. Pharmaceutical Statistics, 3(4): 247-260.

[3] ³
BORROR CM, MONTGOMERY DC& MYERS RH. 2002. Evaluation of statistical designs for experiments involving noise variables. Journal of Quality Technology, 34(1): 54-70.

[4] ⁴
BURNHAM KP & ANDERSON DR. 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretical Approach. Second edition, Springer, New York.

[5] ⁵
CHUNG PJ, GOLDFARB HB& MONTGOMERY DC . 2007. Optimal Designs for Mixture-Process Experiments with Control and Noise Variables. Journal of Quality Technology, 39: 179-190.

[6] ⁶
CHUNG PJ, GOLDFARB HB, MONTGOMERY DC& BORROR CM . 2009. Optimal Designs for Mixture-Process Experiments Involving Continuous and Categorical Noise Variables. Quality Technology & Quantitative Management, 6(4): 451-470.

[7] ⁷
CORNELL JA. 1995. Fitting models to data from mixture experiments containing other factors. Journal of Quality Technology, 27(1): 13-33.

[8] ⁸
CORNELL JA . 2000. Fitting a slack-variable model to mixture data: some questions raised. Journal of Quality Technology, 32(2): 133-147.

[9] ⁹
CORNELL JA . 2002. Experiments with Mixtures: Designs, Models and the Analysis of Mixture Data. Third edition, John Wiley and Sons, New York.

[10] ¹⁰
CORNELL JA & GORMAN JW. 2003. Two New Mixture Models: Living With Collinearity but Removing its Influence. Journal of Quality Technology, 35: 78-88.

[11] ¹¹
DAL BELLO LHA. 2010. Modelagem em Experimentos Mistura-Processo para Otimização de Processos Industriais. Doctoral Thesis, PUC, Rio de Janeiro.

[12] ¹²
DAL BELLO LHA & VIEIRA AFC. 2011a. Optimization of a product performance using mixture experiments. Journal of Applied Statistics, 38(8): 1701-1715.

[13] ¹³
DAL BELLO LHA & VIEIRA AFC . 2011b. Tutorial for mixture-process experiments with an industrial application. Pesquisa Operacional, 31(3): 1-21.

[14] ¹⁴
GOLDFARB HB, BORROR CM& MONTGOMERY DC . 2003. Mixture-Process Variable Experiments with Noise Variables. Journal of Quality Technology, 35: 393-405.

[15] ¹⁵
GOLDFARB HB, BORROR CM, MONTGOMERY DC& ANDERSON-COOK CM . 2004a. Three-Dimensional Variance Dispersion Graphics for Mixture-Process Experiments. Journal of Quality Technology, 36: 109-124.

[16] ¹⁶
GOLDFARB HB, BORROR CM, MONTGOMERY DC& ANDERSON-COOK CM . 2004b. Evaluating Mixture-Process with Control and Noise Variables. Journal of Quality Technology, 36: 245-262.

[17] ¹⁷
GOOS P & DONEV AN. 2006. The D-Optimal Design of Blocked Experiments with MixtureComponents. Journal of Quality Technology, 38: 319-332.

[18] ¹⁸
GOOS P & DONEV AN . 2007. Tailor-Made Split-Plot Designs for Mixture and Process Variables. Journal of Quality Technology, 39: 326-339.

[19] ¹⁹
HARRELL FE JR. 2001. Regression Modeling Strategies with Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer-Verlag, New York.

[20] ²⁰
HURVICH CM & TSAI C-L 1989. Regression and time series model selection in small samples. Biometrika, 76: 297-307.

[21] ²¹
KHURI AI. 2005. Slack Variable Models Versus Scheffés Mixture Models. Journal of Applied Statistics, 32: 887-908.

[22] ²²
KOWALSKI SM, CORNELL JA& VINING GG. 2002. Split-Plot Designs and Estimation Methods for Mixture Experiments with Process Variables. Technometrics, 44: 72-79.

[23] ²³
MYERS RH & MONTGOMERY DC . 2002. Response Surface Methodology: Process and Product Optimization Using Designed Experiments. Second edition. John Wiley and Sons, New York.

[24] ²⁴
PIEPEL GF. 2004. 50 Years of mixture experiment research: 1955-2004, in KHURI AI . (Editor).Response Surface Methodology and Related Topics. World Scientific Publishing, Singapore, 283-327.

[25] ²⁵
PIEPEL GF . 2007. A Component Slope Linear Model for Mixture Experiments. Quality Technology & Quantitative Management, 4(3): 331-343.

[26] ²⁶
PRESCOTT P. 2004. Modeling in Mixture Experiments Including Interactions with Process Variables. Quality Technology & Quantitative Management, 1(1): 87-103.

[27] ²⁷
SAHNI NS, PIEPEL GF& NÆS T. 2009. Product and Process Improvement Using Mixture-Process Variable Methods and Optimization Techniques. Journal of Quality Technology, 41(2): 181-197.

[28] ²⁸
SCHEFFÉ H. 1958. Experiments with mixture. Journal of the Royal Statistical Society, B20: 344-366.

Std	Run	υ₁	υ₂	υ₃	z₁	z₂	Force (lb)	ŷ₁	ŷ₂
1	26	1.000	0.000	0.000	-1.000	-1.000	44	46.830	42.410
2	6	1.000	0.000	0.000	1.000	-1.000	70	65.470	71.010
3	21	1.000	0.000	0.000	-1.000	1.000	19	15.850	17.250
4	8	1.000	0.000	0.000	1.000	1.000	33	34.490	31.490
5	5	0.000	0.250	0.750	-1.000	-1.000	48	45.958	48.273
6	2	0.000	0.250	0.750	1.000	-1.000	72	75.963	74.638
7	30	0.000	0.250	0.750	-1.000	1.000	32	33.153	32.358
8	24	0.000	0.250	0.750	1.000	1.000	59	55.358	58.723
9	17	0.250	0.000	0.750	-1.000	-1.000	32	35.743	33.245
10	23	0.250	0.000	0.750	1.000	-1.000	58	59.719	56.460
11	33	0.250	0.000	0.750	-1.000	1.000	21	22.283	19.125
12	9	0.250	0.000	0.750	1.000	1.000	38	41.459	38.750
13	11	0.750	0.250	0.000	-1.000	-1.000	51	50.675	51.434
14	20	0.750	0.250	0.000	1.000	-1.000	76	79.615	74.956
15	19	0.750	0.250	0.000	-1.000	1.000	22	17.350	16.251
16	22	0.750	0.250	0.000	1.000	1.000	49	46.290	45.459
17	10	0.500	0.125	0.375	-1.000	1.000	17	15.132	16.087
18	16	0.125	0.125	0.750	1.000	-1.000	69	67.841	67.978
19	32	0.125	0.125	0.750	-1.000	-1.000	40	39.350	38.329
20	3	0.375	0.250	0.375	-1.000	-1.000	37	41.289	36.113
21	7	0.500	0.125	0.375	1.000	1.000	46	45.826	43.705
22	29	0.375	0.250	0.375	-1.000	1.000	21	18.224	20.338
23	27	0.375	0.250	0.375	1.000	-1.000	82	76.215	80.948
24	34	0.375	0.250	0.375	1.000	1.000	43	49.250	48.469
25	15	0.625	0.000	0.375	-1.000	-1.000	32	32.760	36.563
26	1	0.625	0.000	0.375	1.000	-1.000	60	61.021	62.470
27	28	0.750	0.063	0.187	-1.000	1.000	14	13.160	15.602
28	4	0.750	0.63	0.187	1.000	1.000	38	38.155	39.956
29	14	0.626	0.187	0.187	-1.000	-1.000	45	42.680	43.160
30	13	0.375	0.250	0.375	-1.000	1.000	18	18.224	20.338
31	18	0.125	0.125	0.750	1.000	-1.000	70	67.841	67.978
32	25	0.750	0.250	0.000	-1.000	1.000	10	28.350	16.251
33	12	0.375	0.250	0.375	1.000	1.000	52	49.250	48.469
34	31	0.750	0.250	0.000	-1.000	1.000	42	46.290	45.459

Model	υ₁ (x1)	υ₂ (x2)	υ₃ (x3)	z₁ (T(ºF))	z₂ (RH(%))
Myers and Montgomery	0.7400 (08480)	0.2600 (0.1020)	0.000 (0.500)	1.0 (100.0)	-1.0 (15.0)
Final Model 1	0.4638 (0.7928)	0.2600 (0.1020)	0.2762 (0.1052)	1.0 (100.0)	-1.0 (15.0)

Std	Run	υ1	υ2	υ3	Height (mm)	ŷ1	ŷ2
1	11	1.000	0.000	0.000	125.0	146.740	146.670
2	12	1.000	0.000	0.000	140.0	146.740	146.670
3	3	0.700	0.300	0.000	150.0	148.162	147.140
4	6	0.700	0.300	0.000	145.0	148.162	147.140.
5	5	0.000	0.300	0.700	141.0	139.428	138.600
6	2	0.000	0.300	0.700	138.0	139.428	138.600
7	10	0.300	0.000	0.700	153.0	148.926	149.484
8	4	0.300	0.000	0.700	147.0	148.926	149.484
9	8	0.850	0.150	0.000	165.0	164.223	164.107
10	7	0.650	0.000	0.350	170.0	170.276	169.052
11	1	0.350	0.300	0.350	148.0	146.945	150.297
12	13	0.750	0.075	0.175	175.0	171.203	173.145
13	9	0.400	0.075	0.525	163.0	167.831	166.273

Model	υ₁ (A)	υ₂ (B)	υ₃ (C)
Cornell, Myers & Montgomery	0.6040 (0.2604)	0.0855 (0.0786)	0.3105 (0.1611)
Final Model 2	0.6103 (0.2610)	0.1397 (0.0840)	0.2500 (0.1550)

Std	υ₁	υ₂	υ₃
1	1.000	0.000	0.000
2	1.000	0.000	0.000
3	0.700	0.300	0.000
4	0.700	0.300	0.000
5	0.000	0.300	0.700
6	0.000	0.300	0.700
7	0.300	0.000	0.700
8	0.300	0.000	0.700
9	0.850	0.150	0.000
10	0.650	0.000	0.350
11	0.350	0.300	0.350
12	0.750	0.075	0.175
13	0.400	0.075	0.525

Brasil

Brasil

A MODEL SELECTION PROCEDURE IN MIXTURE-PROCESS EXPERIMENTS FOR INDUSTRIAL PROCESS OPTIMIZATION

Abstract

1 INTRODUCTION

2 MIXTURE EXPERIMENTS

3 MIXTURE-PROCESS EXPERIMENTS

4 INFORMATION CRITERIA AND MODEL SELECTION

5 PROPOSED METHODOLOGY

5.1 Example 1

5.1.1 Model Selection Methodology

5.1.2 Response Optimization

5.1.3 Model Adequacy

5.2 Example 2

5.2.1 Model Selection

5.2.2 Response Optimization

5.2.3 Model Adequacy

6 SIMULATION STUDY

7 CONCLUSIONS

Publication Dates

History

Model	PRESS	MSE	AlCc	var[ŷ(w)]	E[ŷ(w)]
Myers & Montgomery	903.20	15.45	122.57	21.5172	80.1773
Final Model 1	515.43	10.13	108.13	16.0093	82.3177

σ	Identified Models
0.5	1.000
0.6	1.000
0.7	1.000
0.8	1.000
0.9	998
1.0	994
1.1	980
1.2	962
1.3	952
1.4	932
1.5	926