A New Approach for Multivariate Data Analysis in Interlaboratory Comparisons Based on Multidimensional Scaling and Robust Confidence Ellipse

Leal, Luiz Henrique C.; Rocha, Werickson F. C.

doi:10.21577/0103-5053.20220121

Abstract

Interlaboratory comparisons (IC) present a challenge related to multivariate data analysis. ISO 13528:2015 is a reference document for interlaboratory comparisons. This standard does not provide descriptions of statistical methods for multivariate analysis and, according to our best knowledge, there is no practical guidance for the organizing and evaluation of multivariate data analysis for interlaboratory comparisons available. Due to this reason, some researchers have made efforts to develop methodologies that make it possible to analyze multivariate data in IC. Generally, these approaches are based on dimensionality-reduction methods like principal component analysis. This paper proposes a new approach to reduce the dimensionality of large data set and check the performance of laboratories based on multidimensional scaling (MDS) and robust confidence ellipse/ellipsoid (RCE). MDS is a multivariate analysis technique that allows grouping laboratories according to their similarity in a Euclidean space. On the other hand, RCE is a statistical method for outlier detection in a multivariate data set. In this work, it is proposed to combine MDS and RCE to evaluate laboratory proficiency in interlaboratory comparison. This methodology was compared with the multivariate z-score and both methodologies identified the same outlying laboratories. This preliminary result indicates that MDS/RCE is promising for classifying IC results.

Keywords:
multidimensional scaling; robust confidence ellipse; interlaboratory comparisons; multivariate analysis

Introduction

Interlaboratory comparison (IC) seeks to assess the performance of a given measurement method. To organize an IC, the same or similar items test are sent to each participating under predetermined conditions.¹1 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015. In some cases, the reported results by participants are multivariate data and univariate techniques are not suitable to analyze this kind of data.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] For this situation, multivariate analysis techniques can be used for investigating the inherent structure of data without losing valuable information of measurement.

Multivariate analysis, in a general way, refers to all statistical methods that simultaneously analyze multiple measures in each individual or object (in this case laboratory).³3 Hair Jr., J. F.; Black, W. C.; Babin, B. J.; Anderson, R. E.; Multivariate Data Analysis, 7th ed.; Prentice Hall: New Jersey, USA, 2009. The official document for interlaboratory comparison, ISO 13528:2015,¹1 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015. does not provide descriptions for multivariate analysis. Due to this gap left by the official document, some researchers have been proposed many ways to realize multivariate data analysis in interlaboratory comparison. Sheen et al.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] have proposed multivariate z-score to identify outlying laboratory results.

In the proposed methodology by Sheen et al.,²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] the spectra should be grouped so that the cluster consists of multiple nuclear magnetic resonance (NMR) spectra of the k-th sample provided by the participants. From these clusters, the interspectral distance matrix D_k is calculated, whose elements are the distances D_ij,k = d(s_i,k, s_j,k) where s_i,k is the spectrum of laboratory i, s_j,k is the spectrum of laboratory j (both belonging to the conglomerate S_k) and d(.) is a multivariate distance measure. The authors suggest Euclidean, Mahalanobis, Hellinger, Kullback-Leibler, Jensen-Shannon and Jeffreys distances.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

Based on the values D_ij,k, the average distance ${\hat{D}}_{i, k} = (1 / n) \sum_{j} D_{i j, k}$ is computed. The ${\hat{D}}_{i, k}$ values need to be fitted to a given probability distribution for each laboratory i. After this step, the matrix Z is obtained, where Z_i is the z-score vector of the i-th laboratory. The vector Z_i is obtained by $Z_{i, k} = C^{* - 1} (C_{k} ({\hat{D}}_{i, k}))$ where C_k is the cumulative distribution function after being fitted to the conglomerate k and C* is the corresponding standard distribution function.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

In this approach, the next step is to perform the principal component analysis (PCA) on the Z matrix. In the PCA model, T = ZP^t where T is the matrix of the scores of the principal components and P is the matrix of the loadings. The most significant L principal components are identified by obtaining T_L = ZP_L^t. For each participant, the Euclidean norm ||T_i,L|| is calculated and these statistical distances are adjusted to a new probability distribution with distribution function $\hat{C}$ .²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

This new distribution has a z-score associated with it. The authors called the projected statistical score ( ${\hat{Z}}_{i}$ ). This score is calculated using ${\hat{Z}}_{k} = {\hat{C}}^{* - 1} (\hat{C} (∥ T_{i, L} ∥))$ . If any ${\hat{Z}}_{i}$ value falls outside the 95% confidence interval, the corresponding laboratory is considered an outlier and removed from the data set. The process is repeated until no dataset falls outside the 95% confidence interval.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] In this study, 10 participants provide a one-dimensional ¹H NMR spectrum and three of them were identified as outlying by the proposed methodology.

Another approach to deal with multivariate data was proposed by Viant et al.⁴4 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref] It was proposed principal component analysis (PCA) to clustering participating results. The items test sent to laboratories were synthetic metabolite mixtures and European flounder liver extracts (biological) from clean and contaminated sites. The goal of the intercomparison exercise was to evaluate the effectiveness of ¹H NMR metabolomics to generate comparable data sets from environmentally derived samples and each participant provides a one-dimensional spectrum. The associated PCA scores plots were used to visually assess if individual laboratories could reveal which metabolites discriminated the synthetic mixtures and biological samples. In both cases, the PCA approach allows concluding that there was a high degree of similarity across all laboratories. The comparability and precision of the laboratories participating were good, reflecting good results from NMR spectra.⁴4 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

Gallo et al.⁵5 Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref] have proposed a new performance index, named Q_p-score, to assess the laboratory performance in multi-component analyses. Eight nuclear magnetic resonance signals (3 for aldicarb, 1 for methamidophos, 2 for oxadixyl, and 2 for pirimicarb) were obtained by 36 participants in interlaboratory comparison. It was proposed the following parameter: $Q_{p} = (a_{i} \cdot \bar{a}) / σ_{slope}$ where a_i is the slope of the calibration line determined by the i-th participant, $\bar{a}$ is the consensus slope value, and σ_slope is the inter-laboratory standard deviation on slopes. The Q_p -score is considered satisfactory when |Q_p| ≤ 2, questionable when 2 < |Q_p| < 3 and unsatisfactory when |Q_p| ≥ 3. The proposed methodology allows classifying the reported results by participants in each NMR signal. In general, 9 laboratories were classified as unsatisfactory, 2 were classified as questionable and 25 were classified as satisfactory.⁵5 Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref]

Other authors reported multivariate data analysis by PCA in interlaboratory comparisons. Danzer et al.⁶6 Danzer, K.; Wank, U.; Wienke, D.; Chemom. Intell. Lab. Syst. 1991, 12, 69. [Crossref], Henrion⁷7 Henrion, R.; Chemom. Intell. Lab. Syst. 1994, 25, 1. [Crossref] and, Škrbić et al.⁸8 Škrbić, B.; Héberger, K.; Ðurišić-Mladenović, N.; Anal. Bioanal. Chem. 2013, 405, 8363. [Crossref] suggested PCA to identify outlying laboratories in interlaboratory comparison. Minkkinen⁹9 Minkkinen, P.; Chemom. Intell. Lab. Syst. 1995, 29, 263. [Crossref] proposes principal components score plot to visualize the variation of the results between and within laboratories. Heininger et al.¹⁰10 Heininger, P.; Pelzer, J.; Henrion, R.; Henrion, G.; Fresenius’ J. Anal. Chem. 1998, 360, 344. [Crossref`] have proposed PCA to group the laboratories and identify the type of method used to analyze the samples by each participants. Aoki et al.¹¹11 Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref] proposed multiple hypothesis testing to assess the equivalence among the laboratories measurements with respect to the reference laboratory. The authors suggest build a confidence regions between each participating and reference laboratory based on Wald statistic.¹¹11 Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref]

This paper proposes a new procedure to assess the performance of interlaboratory comparisons that are different from the methodologies mentioned above. A new methodology is derived from the concepts of outlying detection in two- and three-dimensional Euclidean space. The methodology is based on following steps.

Initially the data (Figure 1a) is arranged as matrix X_n×p where n represents the number of variables, that is, the chemical shift of the spectrum provided by i-th laboratory and p represents the number of participants in the interlaboratory comparison (Figure 1b).

Figure 1
The proposed methodology: (a) multivariate reported spectrum by each participating; (b) multivariate data set organized in matrix where the first column contains the chemical shift and the remaining columns contain the spectra reported by each participant; (c) multidimensional scaling combined with robust confidence ellipse (MDS/RCE 2D); (d) multidimensional scaling combined with robust confidence ellipsoid (MDS/RCE 3D).

The second step refers to performing a multidimensional scaling (MDS) on the multivariate data to transform them into a dimension that can be used to visualize the results of the laboratories in two or three dimensions. After that, a confidence limit based on a robust confidence ellipse/ellipsoid (RCE) is plotted to identify outlying laboratory result (Figures 1c and 1d). Points outside of robust confidence ellipse/ellipsoid provide evidence that laboratory result is an outlier with a specific confidence level (95%, for example). On the other hand, points inside of RCE are not considered outlying result with same confidence level.

Multidimensional scaling is a multivariate technique that allows revealing “hidden” structures in a multivariate data set.¹²12 Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref] In other words, it is a method that allows visualizing the similarity/dissimilarity among laboratories participating in an interlaboratory comparison which are represented as points in a two- or three-dimensional space.¹³13 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref] The proximity/distance between the points represents the similarity/dissimilarity among laboratories.¹³13 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]

Robust confidence ellipse/ellipsoid is a multivariate analysis technique for outlier detection. A confidence region is built from the variance-covariance matrix of original data set, which allows identifying if there are laboratories that differ statistically from the others at some specific significance level.¹⁴14 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2005.

The combination of these two techniques (Figure 1) constitutes the proposal for evaluating the performance of the results reported by the participants.

Methodology

Data set

The data used in this study can be obtained from the available data set described by Sheen et al.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] In this data set, an interlaboratory comparison was carried out to assess the effectiveness of ¹H NMR metabolomics. Seven laboratories from the United States, one from Canada, one from the United Kingdom, and one from Australia participated in the intercomparison. Mixtures of synthetic metabolites and samples of biological origin from liver extracts of European flounder from clean and contaminated sites were analyzed.⁴4 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

The data described by Sheen et al.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] refers to adult female European flounder collected from Tyne in the United Kingdom. This is a biological sample selected from a polluted site. The fish were sacrificed, liver tissues were dissected, snap-frozen in liquid nitrogen, and stored at -80 °C until extraction by IC participating.⁴4 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref] The samples were extracted using methanol:chloroform:water method and Precellys-24 bead-based homogenizer (Stretton Scientific Ltd., U.K.).⁴4 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

Each participating laboratory obtained a one-dimensional ¹H NMR spectrum. The spectra are reported as chemical shift frequencies with a range from 10.0 to 0.2 ppm. The region from 4.7 to 5.2 ppm was excluded due to water solvent suppression artifacts and the NMR spectra were renormalized. The spectra were binned with a bin width of 0.005 ppm, for a total of 1860 variables in each spectrum.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

In this work, the multivariate techniques of multidimensional scaling and robust confidence ellipse/ellipsoid (Figures 1c and 1d) were explored to assess the performance of laboratories participating in a laboratory intercomparison classifying them as outlier or not. The basis for the development of this methodology is described below.

Multidimensional scaling

Let p be the number of different laboratories (Figure 1b) and δ_ij the dissimilarity between laboratories i and j. The coordinates are gathered in the matrix X_n×p where n is the dimensionality of the solution to be specified. Thus, the column i of X_n×p provides the coordinates of the laboratory i (Figure 1b). Let d_ij(X) be the Euclidean distance (most used)¹⁵15 Agrafiotis, D. K.; Rassokhin D. N.; Lobanov, V. S.; J. Comput. Chem. 2001, 22, 488. [Crossref],¹⁶16 Morrison, A.; Ross, G.; Chalmers, M.; Inf. Vis. 2003, 2, 68. [Crossref] between columns i and j defined as:

(1)

d_{i j} (X) = {(\sum_{s = 1}^{n} {(x_{i s} - x_{j s})}^{2})}^{1 / 2}

which is the shortest distance between laboratories i and j. In the equation 1, x_is is the spectrum of laboratory i and x_js is the spectrum of laboratory j.

The Euclidean distance is more favorable in visual representations because a more isotropic display is obtained using it.¹⁷17 Asan, U.; Ercan, S. In Computational Intelligence Systems in Industrial Engineering; Kahraman, C., ed.; Atlantis Press: Paris, France, 2012, ch. 14. [Link] The purpose of multidimensional scaling is to find an X_n×p matrix that d_ij(X) is equal to δ_ij as much as possible.¹³13 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]

To obtain this matrix, the least squares MDS model is used, which consists of minimizing the equation:

(2)

σ^{2} (X) = \sum_{i = 2}^{p} \sum_{j = 1}^{i - 1} w_{i j} {(δ_{i j} - d_{i j} (X))}^{2}

where w_ij is a user defined weight which must be nonnegative. The minimization problem of σ²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref](X) is quite complex and it is necessary to use interactive algorithms to find the matrix X_n×p that minimizes σ²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref](X). The most used for the solution of this is the SMACOF algorithm.¹²12 Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref],¹³13 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref],¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

Robust confidence ellipse

After applied MDS technique, the dimensionality data was reduced for two or three dimensions. The following step is identifying the outlying laboratories by robust confidence ellipse (Figure 1c).

The robust confidence ellipse is built from the matrix equation:

(3)

{\hat{μ}}_{r o b} + (\sqrt{2 \cdot F_{2; (n - 1); (1 - α)}}) \cdot U \cdot Q

where ${\hat{μ}}_{r o b} = [\begin{array}{ll} {\bar{X}}_{r o b} & {\bar{y}}_{r o b} \end{array}]$ is the vector of robust means; F_{2;(n-1);(1-α)} is the quantile of the Fisher-Snedecor distribution with 2 and (n - 1) degrees of freedom and confidence level of (1 - α)%; Q is the Cholesky decomposition of the robust variance-covariance matrix S_rob.¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.,¹⁹19 Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref] Both ${\hat{μ}}_{r o b}$ and S_rob are estimated by the iterative process described in the next section.

The matrix U is the unit circle defined by

(4)

U = [\begin{array}{ll} \cos (a) & \sin (a) \end{array}]

where a = [a₁ … a_m] is a vector of size m (0 ≤ a ≤ 2π).

Robust means ( ${\hat{μ}}_{r o b}$ ) and variance-covariance matrix (S_rob)

The vector of robust means ${\hat{μ}}_{r o b}$ and the variance-covariance matrix S_rob, for bi-dimensional data set, mentioned in the previous sub-section, are fitted from the following iterative process.¹⁹19 Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref] Let (x; y) a bi-dimensional data set.

(5)

(x; y) = [\begin{array}{cc} x_{1} & y_{1} \\ x_{2} & y_{2} \\ ⋮ & ⋮ \\ x_{n} & y_{n} \end{array}]

Step (i): $\hat{μ} = [\begin{array}{ll} \bar{x} & \bar{y} \end{array}]$ and wi = 1 + p/υ ∀ i = 1, …, n where p is the number of variables and υ is degree of freedom of multivariate t distribution.

Step (ii): compute the matrix:

(6)

X = [\begin{array}{cc} (x_{1} - \bar{x}) & (y_{1} - \bar{y}) \\ ⋮ & ⋮ \\ (x_{n} - \bar{x}) & (y_{n} - \bar{y}) \end{array}]

Step (iii): compute the singular value decomposition of matrix A, svd(A) = USV^T, where:

(7)

A = [\begin{array}{cc} \sqrt{\frac{w_{1}}{\sum_{i = 1}^{n} w_{i}}} (x_{1} - \bar{x}) & \sqrt{\frac{w_{1}}{\sum_{i = 1}^{n} w_{i}}} (y_{1} - \bar{y}) \\ ⋮ & ⋮ \\ \sqrt{\frac{w_{n}}{\sum_{i = 1}^{n} w_{i}}} (x_{n} - \bar{x}) & \sqrt{\frac{w_{n}}{\sum_{i = 1}^{n} w_{i}}} (y_{n} - \bar{y}) \end{array}]

Step (iv): compute the matrix:

(8)

w^{'} = {X V S}^{'} = [\begin{array}{cc} W_{11}^{'} & W_{12}^{'} \\ ⋮ & ⋮ \\ W_{n 1}^{'} & w_{n 2}^{'} \end{array}]

where,

(9)

S^{'} = [\begin{array}{cc} 1 / s_{1} \\ 1 / s_{2} \end{array}]

Step (v): compute the vector Q = [Q₁ … Q_n] = [(w’₁₁)²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] + w’₁₂)²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] … (w’_n1)²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] + w’_n2)]

Step (vi): compute the new weights:

(10)

w_{i}^{*} = \frac{(v + p)}{(v + Q_{i})} \forall i = 1, \dots, n

Step (vii): compute the robust vector of means:

(11)

{\hat{μ}}_{rob} = [\begin{array}{ll} {\bar{x}}_{rob} & {\bar{y}}_{rob} \end{array}] = [\begin{array}{ll} \frac{\sum_{i = 1}^{n} w_{i}^{*} x_{i}}{\sum_{i = 1}^{n} w_{i}^{*}} & \frac{\sum_{i = 1}^{n} w_{i}^{*} y_{i}}{\sum_{i = 1}^{n} w_{i}^{*}} \end{array}]

Step (viii): the new fitted vector ${\hat{μ}}_{r o b}$ and the weights w_i^* are used to obtain new values in step (i) ( $\hat{μ} = {\hat{μ}}_{r o b}$ and w_i = w_i^* ). This procedure is repeated until the values of w_i^* converge, that is, |w_i - w_i^*| < ε.

Step (ix): compute the robust variance-covariance matrix S_rob.¹⁹19 Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref]

(12)

S_{r o b} = [\begin{array}{cc} \frac{\sum_{i = 1}^{n} w_{i}^{*} {(x_{i} - {\bar{x}}_{r o b})}^{2}}{n} & \frac{\sum_{i = 1}^{n} w_{i}^{*} (x_{i} - {\bar{x}}_{r o b}) (y_{i} - {\bar{y}}_{r o b})}{n} \\ \frac{\sum_{i = 1}^{n} w_{i}^{*} (x_{i} - {\bar{x}}_{r o b}) (y_{i} - {\bar{y}}_{r o b})}{n} & \frac{\sum_{i = 1}^{n} w_{i}^{*} {(y_{i} - {\bar{y}}_{r o b})}^{2}}{n} \end{array}]

Robust confidence ellipsoid

It is recommended to investigate through the robust confidence ellipsoid (Figure 1d) the possible presence of outliers that were not identified in the 2D analysis mentioned in previous sub-section.

The robust confidence ellipsoid is built from an equation analogous to the robust confidence ellipse:

(13)

{\hat{μ}}_{r o b} + (\sqrt{3 \cdot F_{3; (n - 1); (1 - α)}}) \cdot U \cdot Q

where ${\hat{μ}}_{r o b} = [\begin{array}{lll} {\bar{x}}_{r o b} & {\bar{y}}_{r o b} & {\bar{z}}_{r o b} \end{array}]$ is the vector of robust means; F_{3;(n-1);(1-α)} is the quantile of the Fisher-Snedecor distribution with 3 and (n - 1) degrees of freedom and confidence level of (1 - α)%; Q is the Cholesky decomposition of the robust variance-covariance matrix. Furthermore, ${\hat{μ}}_{r o b}$ and S_rob are estimated analogous to iterative process described in previous sub-section.

The m × 3 matrix U is the sphere of radius 1 defined by:

(14)

U = [\begin{array}{lll} \cos (θ) \sin (φ) & \sin (θ) \sin (φ) & \cos (φ) \end{array}]

where θ = [θ₁ … θ_m] is a vector of size m (0 ≤ θ ≤ 2π) and φ = [ϕ₁ … ϕ_m] is a vector of size m (0 ≤ φ ≤ π).¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

Software

All statistical analyses were performed using the R statistical software, an open-source free environment for statistical computing and graph creation.¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. The 2D plot analysis (Figure 1c) was built using the R packages (CAR and STATS).¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. In addition, an R code was built to define which points obtained in the multidimensional scaling technique were outside the robust confidence ellipse. The MASS R package¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. was necessary for this purpose.

Regarding the 3D analysis (Figure 1d), the plot was built using PLOTLY R package¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. which is a graphing library that makes interactive plots. It is worth pointing out that to build the robust confidence ellipsoid it was necessary to develop an R code based on the dataEllipse R function from CAR R package.¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. Moreover, an R code was built to identify outlying results. This code also depends on the MASS R package.¹⁸18 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

Results and Discussion

The results of the participants (Figure 1a) were arranged in a matrix with 1860 rows and 10 columns (Figure 1b). The columns contain the results of the participants in which each column is the spectrum reported by the i-th participant. The lines represent the variables (chemical shift in ppm).

It should be noted that this is the first study combining multidimensional scaling and robust confidence ellipse/ellipsoid (MDS/RCE) to evaluate laboratory proficiency in interlaboratory comparison. In order to validate the methodology suggested in this article, the MDS/RCE results will be compared with multivariate z-score results obtained by Sheen et al.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

In the proposed methodology, the two-dimensional multidimensional scaling provides a location in Euclidean space of each laboratory participating according to similarity/dissimilarity between ¹H NMR spectra. Additionally, the two-dimensional robust confidence ellipse allows identifying if there is an outlier spectrum (Figure 1c). The Figure 2 presents the results obtained by two-dimensional approach.

Figure 2
Multidimensional scaling from ¹H NMR spectra reported by participating and 95% robust confidence ellipse (2D plot).

It can be seen in Figure 2 that the laboratory 7042 is located out of ellipse. This provides evidence that its result differs statistically from other participants. At 5% significance level (Figure 2) the spectrum reported by laboratory 7042 is classified as an outlier when compared to other participants. On the other hand, laboratories that are located within the robust confidence ellipse do not differ statistically from each other. In this case, there is no evidence that the results reported by these participants are outliers.

The following step is analyzing the three-dimensional limit according to Figure 1d. The 3D plot allows seeing an outlier result that is “hidden” in the 2D plot. In other words, the three-dimensional approach provides more information about participating performances.

It can be seen from Figure 3 that more laboratories show results that differ from the others (at the 5% significance level) when compared to the bi-dimensional plot showed in Figure 2. Altogether there are three laboratories (0714, 7042, and 9541) out of ellipsoid which means that their results may be considered outliers, that is, statistically different from the others. The rest, inside the ellipsoid, do not differ statistically from each other. By an analogous reasoning followed for the univariate z-score suggested by ISO 13528:2015,¹1 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015. performance assessment by the MDS/RCE is considered acceptable when results are inside of ellipse/ellipsoid and unacceptable when results are outside.

Figure 3
Multidimensional scaling from ¹H NMR spectra reported by participating and 95% robust confidence ellipsoid (3D plot).

The confidence level adopted to consider the reported result as an outlier was 95% in 2D (Figure 2) and 3D (Figure 3) plot analysis.

In some cases, two (Figure 1c) and three (Figure 1d) dimensional analysis can provide the same conclusions, however, there are situations where this does not occur (such as Figures 2 and 3). In this situation, the two-dimensional analysis differs from the three-dimensional analysis, thus the latter must be adopted because it offers a more comprehensive (and reliable) analysis of the participants’ results.

In the multivariate z-score approach suggested by Sheen et al.,²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] the interspectral distance matrix D_k was obtained from the Kullback-Leibler, Mahalanobis, Hellinger and Jensen-Shannon distances. The values were fitted to a lognormal distribution. The choice of this probability distribution was based on the Q-Q plot. Additionally, according to the authors, the lognormal is the maximum entropy distribution for a specified mean and standard deviation.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

Each Z_i,k value is an indication of where the s_i,k spectrum is in relation to the others in the S_k cluster. In the case of the lognormal distribution, Z_i,k(1/2) = 1 and Z_i,k(0.95) ca. 5. In this context, Z_i,k = 1 indicates that s_i,k is closer to the center of S_k while Z_i,k greater than 5 indicates that s_i,k is outside the range 95% confidence in the conglomerate S_k.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

The ||T_i,L|| values were fitted to a lognormal distribution and the projected statistical score was calculated for each data set. The scores calculated from the Kullback-Leibler, Mahalanobis, Hellinger, and Jensen-Shannon distances showed evidence that participants 0714, 7042 and 9541 reported results (spectra) are outliers.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

The methodology based on multidimensional scaling combined with the robust confidence ellipse/ellipsoid (Figure 1) identified the same outlying participant results as the multivariate z-score method proposed by Sheen et al.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] The main advantage of the methodology proposed in this article is that it does not depend on choosing a probability distribution. This provides evidence that MDS/RCE has potential to be considered as performance evaluation method in interlaboratory comparisons.

Conclusions

In this paper, it was proposed a new approach to analyze multivariate data from interlaboratory comparisons. The methodology combines multidimensional scaling and robust confidence ellipse/ellipsoid to identify an outlying laboratory result. The results obtained by the methodology suggested in this work were compared with the results obtained by the multivariate z-score method described by Sheen et al.²2 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] The MDS/RCE method found the same three outlying laboratories (0714, 7042, and 9541) identified in the multivariate z-score method. The proposed methodology has the advantage in relation to the latter of not depending on the choice, sometimes subjective, of a probability distribution.

This approach proved to be promising as a performance evaluation method for multivariate data analysis in interlaboratory comparison. It should be noted that this methodology can be used, in a similar way, to evaluate the performance of laboratories participating in proficiency testing scheme. In this approach, it is suggested classifying the participating laboratory results located outside of robust confidence ellipse/ellipsoid as unacceptable and inside as acceptable like univariate z-score showed in ISO 13528:2015.

The proposed method therefore constitutes a valuable tool that contribute to filling a gap in the literature regarding the multivariate data analysis in interlaboratory comparisons and proficiency trials. Other multivariate data analysis techniques like factor analysis and Kohonen’s self-organizing map, for example, are multivariate techniques that can be considered in future works for performance assessment tool in interlaboratory comparisons.

References

¹
ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015.
²
Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]
³
Hair Jr., J. F.; Black, W. C.; Babin, B. J.; Anderson, R. E.; Multivariate Data Analysis, 7^th ed.; Prentice Hall: New Jersey, USA, 2009.
⁴
Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]
⁵
Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref]
⁶
Danzer, K.; Wank, U.; Wienke, D.; Chemom. Intell. Lab. Syst. 1991, 12, 69. [Crossref]
⁷
Henrion, R.; Chemom. Intell. Lab. Syst. 1994, 25, 1. [Crossref]
⁸
Škrbić, B.; Héberger, K.; Ðurišić-Mladenović, N.; Anal. Bioanal. Chem. 2013, 405, 8363. [Crossref]
⁹
Minkkinen, P.; Chemom. Intell. Lab. Syst. 1995, 29, 263. [Crossref]
¹⁰
Heininger, P.; Pelzer, J.; Henrion, R.; Henrion, G.; Fresenius’ J. Anal. Chem. 1998, 360, 344. [Crossref`]
¹¹
Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref]
¹²
Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref]
¹³
Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]
¹⁴
ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2005.
¹⁵
Agrafiotis, D. K.; Rassokhin D. N.; Lobanov, V. S.; J. Comput. Chem. 2001, 22, 488. [Crossref]
¹⁶
Morrison, A.; Ross, G.; Chalmers, M.; Inf. Vis. 2003, 2, 68. [Crossref]
¹⁷
Asan, U.; Ercan, S. In Computational Intelligence Systems in Industrial Engineering; Kahraman, C., ed.; Atlantis Press: Paris, France, 2012, ch. 14. [Link]
¹⁸
Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.
¹⁹
Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref]

Edited by

Editor handled this article: Maria Cristina Canela (Associate)

Publication Dates

Publication in this collection
10 Mar 2023
Date of issue
Mar 2023

History

Received
02 May 2022
Published
16 Sept 2022

This is an Open Access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

[1] ¹
ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015.

[2] ²
Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

[3] ³
Hair Jr., J. F.; Black, W. C.; Babin, B. J.; Anderson, R. E.; Multivariate Data Analysis, 7^th ed.; Prentice Hall: New Jersey, USA, 2009.

[4] ⁴
Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

[5] ⁵
Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref]

[6] ⁶
Danzer, K.; Wank, U.; Wienke, D.; Chemom. Intell. Lab. Syst. 1991, 12, 69. [Crossref]

[7] ⁷
Henrion, R.; Chemom. Intell. Lab. Syst. 1994, 25, 1. [Crossref]

[8] ⁸
Škrbić, B.; Héberger, K.; Ðurišić-Mladenović, N.; Anal. Bioanal. Chem. 2013, 405, 8363. [Crossref]

[9] ⁹
Minkkinen, P.; Chemom. Intell. Lab. Syst. 1995, 29, 263. [Crossref]

[10] ¹⁰
Heininger, P.; Pelzer, J.; Henrion, R.; Henrion, G.; Fresenius’ J. Anal. Chem. 1998, 360, 344. [Crossref`]

[11] ¹¹
Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref]

[12] ¹²
Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref]

[13] ¹³
Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]

[14] ¹⁴
ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2005.

[15] ¹⁵
Agrafiotis, D. K.; Rassokhin D. N.; Lobanov, V. S.; J. Comput. Chem. 2001, 22, 488. [Crossref]

[16] ¹⁶
Morrison, A.; Ross, G.; Chalmers, M.; Inf. Vis. 2003, 2, 68. [Crossref]

[17] ¹⁷
Asan, U.; Ercan, S. In Computational Intelligence Systems in Industrial Engineering; Kahraman, C., ed.; Atlantis Press: Paris, France, 2012, ch. 14. [Link]

[18] ¹⁸
Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

[19] ¹⁹
Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref]

Brasil

Brasil

A New Approach for Multivariate Data Analysis in Interlaboratory Comparisons Based on Multidimensional Scaling and Robust Confidence Ellipse

Abstract

Introduction

Methodology

Data set

Multidimensional scaling

Robust confidence ellipse

Robust means ( ${\hat{μ}}_{r o b}$ ) and variance-covariance matrix (S_rob)

Robust confidence ellipsoid

Software

Results and Discussion

Conclusions

References

Edited by

Publication Dates

History

Brasil

Brasil

A New Approach for Multivariate Data Analysis in Interlaboratory Comparisons Based on Multidimensional Scaling and Robust Confidence Ellipse

Abstract

Introduction

Methodology

Data set

Multidimensional scaling

Robust confidence ellipse

Robust means (μ^rob) and variance-covariance matrix (Srob)

Robust confidence ellipsoid

Software

Results and Discussion

Conclusions

References

Edited by

Publication Dates

History

Robust means ( ${\hat{μ}}_{r o b}$ ) and variance-covariance matrix (S_rob)