Acessibilidade / Reportar erro

A New Approach for Multivariate Data Analysis in Interlaboratory Comparisons Based on Multidimensional Scaling and Robust Confidence Ellipse

Abstract

Interlaboratory comparisons (IC) present a challenge related to multivariate data analysis. ISO 13528:2015 is a reference document for interlaboratory comparisons. This standard does not provide descriptions of statistical methods for multivariate analysis and, according to our best knowledge, there is no practical guidance for the organizing and evaluation of multivariate data analysis for interlaboratory comparisons available. Due to this reason, some researchers have made efforts to develop methodologies that make it possible to analyze multivariate data in IC. Generally, these approaches are based on dimensionality-reduction methods like principal component analysis. This paper proposes a new approach to reduce the dimensionality of large data set and check the performance of laboratories based on multidimensional scaling (MDS) and robust confidence ellipse/ellipsoid (RCE). MDS is a multivariate analysis technique that allows grouping laboratories according to their similarity in a Euclidean space. On the other hand, RCE is a statistical method for outlier detection in a multivariate data set. In this work, it is proposed to combine MDS and RCE to evaluate laboratory proficiency in interlaboratory comparison. This methodology was compared with the multivariate z-score and both methodologies identified the same outlying laboratories. This preliminary result indicates that MDS/RCE is promising for classifying IC results.

Keywords:
multidimensional scaling; robust confidence ellipse; interlaboratory comparisons; multivariate analysis


Introduction

Interlaboratory comparison (IC) seeks to assess the performance of a given measurement method. To organize an IC, the same or similar items test are sent to each participating under predetermined conditions.11 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015. In some cases, the reported results by participants are multivariate data and univariate techniques are not suitable to analyze this kind of data.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] For this situation, multivariate analysis techniques can be used for investigating the inherent structure of data without losing valuable information of measurement.

Multivariate analysis, in a general way, refers to all statistical methods that simultaneously analyze multiple measures in each individual or object (in this case laboratory).33 Hair Jr., J. F.; Black, W. C.; Babin, B. J.; Anderson, R. E.; Multivariate Data Analysis, 7th ed.; Prentice Hall: New Jersey, USA, 2009. The official document for interlaboratory comparison, ISO 13528:2015,11 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015. does not provide descriptions for multivariate analysis. Due to this gap left by the official document, some researchers have been proposed many ways to realize multivariate data analysis in interlaboratory comparison. Sheen et al.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] have proposed multivariate z-score to identify outlying laboratory results.

In the proposed methodology by Sheen et al.,22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] the spectra should be grouped so that the cluster consists of multiple nuclear magnetic resonance (NMR) spectra of the k-th sample provided by the participants. From these clusters, the interspectral distance matrix Dk is calculated, whose elements are the distances Dij,k = d(si,k, sj,k) where si,k is the spectrum of laboratory i, sj,k is the spectrum of laboratory j (both belonging to the conglomerate Sk) and d(.) is a multivariate distance measure. The authors suggest Euclidean, Mahalanobis, Hellinger, Kullback-Leibler, Jensen-Shannon and Jeffreys distances.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

Based on the values Dij,k, the average distance D^i,k=(1/n)jDij,k is computed. The D^i,k values need to be fitted to a given probability distribution for each laboratory i. After this step, the matrix Z is obtained, where Zi is the z-score vector of the i-th laboratory. The vector Zi is obtained by Zi,k=C1(Ck(D^i,k)) where Ck is the cumulative distribution function after being fitted to the conglomerate k and C* is the corresponding standard distribution function.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

In this approach, the next step is to perform the principal component analysis (PCA) on the Z matrix. In the PCA model, T = ZPt where T is the matrix of the scores of the principal components and P is the matrix of the loadings. The most significant L principal components are identified by obtaining TL = ZPLt. For each participant, the Euclidean norm ||Ti,L|| is calculated and these statistical distances are adjusted to a new probability distribution with distribution function C^.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

This new distribution has a z-score associated with it. The authors called the projected statistical score (Z^i). This score is calculated using Z^k=C^1(C^(Ti,L)). If any Z^i value falls outside the 95% confidence interval, the corresponding laboratory is considered an outlier and removed from the data set. The process is repeated until no dataset falls outside the 95% confidence interval.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] In this study, 10 participants provide a one-dimensional 1H NMR spectrum and three of them were identified as outlying by the proposed methodology.

Another approach to deal with multivariate data was proposed by Viant et al.44 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref] It was proposed principal component analysis (PCA) to clustering participating results. The items test sent to laboratories were synthetic metabolite mixtures and European flounder liver extracts (biological) from clean and contaminated sites. The goal of the intercomparison exercise was to evaluate the effectiveness of 1H NMR metabolomics to generate comparable data sets from environmentally derived samples and each participant provides a one-dimensional spectrum. The associated PCA scores plots were used to visually assess if individual laboratories could reveal which metabolites discriminated the synthetic mixtures and biological samples. In both cases, the PCA approach allows concluding that there was a high degree of similarity across all laboratories. The comparability and precision of the laboratories participating were good, reflecting good results from NMR spectra.44 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

Gallo et al.55 Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref] have proposed a new performance index, named Qp-score, to assess the laboratory performance in multi-component analyses. Eight nuclear magnetic resonance signals (3 for aldicarb, 1 for methamidophos, 2 for oxadixyl, and 2 for pirimicarb) were obtained by 36 participants in interlaboratory comparison. It was proposed the following parameter: Qp=(aia¯)/σslope where ai is the slope of the calibration line determined by the i-th participant, a¯ is the consensus slope value, and σslope is the inter-laboratory standard deviation on slopes. The Qp -score is considered satisfactory when |Qp| ≤ 2, questionable when 2 < |Qp| < 3 and unsatisfactory when |Qp| ≥ 3. The proposed methodology allows classifying the reported results by participants in each NMR signal. In general, 9 laboratories were classified as unsatisfactory, 2 were classified as questionable and 25 were classified as satisfactory.55 Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref]

Other authors reported multivariate data analysis by PCA in interlaboratory comparisons. Danzer et al.66 Danzer, K.; Wank, U.; Wienke, D.; Chemom. Intell. Lab. Syst. 1991, 12, 69. [Crossref], Henrion77 Henrion, R.; Chemom. Intell. Lab. Syst. 1994, 25, 1. [Crossref] and, Škrbić et al.88 Škrbić, B.; Héberger, K.; Ðurišić-Mladenović, N.; Anal. Bioanal. Chem. 2013, 405, 8363. [Crossref] suggested PCA to identify outlying laboratories in interlaboratory comparison. Minkkinen99 Minkkinen, P.; Chemom. Intell. Lab. Syst. 1995, 29, 263. [Crossref] proposes principal components score plot to visualize the variation of the results between and within laboratories. Heininger et al.1010 Heininger, P.; Pelzer, J.; Henrion, R.; Henrion, G.; Fresenius’ J. Anal. Chem. 1998, 360, 344. [Crossref`] have proposed PCA to group the laboratories and identify the type of method used to analyze the samples by each participants. Aoki et al.1111 Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref] proposed multiple hypothesis testing to assess the equivalence among the laboratories measurements with respect to the reference laboratory. The authors suggest build a confidence regions between each participating and reference laboratory based on Wald statistic.1111 Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref]

This paper proposes a new procedure to assess the performance of interlaboratory comparisons that are different from the methodologies mentioned above. A new methodology is derived from the concepts of outlying detection in two- and three-dimensional Euclidean space. The methodology is based on following steps.

Initially the data (Figure 1a) is arranged as matrix Xn×p where n represents the number of variables, that is, the chemical shift of the spectrum provided by i-th laboratory and p represents the number of participants in the interlaboratory comparison (Figure 1b).

Figure 1
The proposed methodology: (a) multivariate reported spectrum by each participating; (b) multivariate data set organized in matrix where the first column contains the chemical shift and the remaining columns contain the spectra reported by each participant; (c) multidimensional scaling combined with robust confidence ellipse (MDS/RCE 2D); (d) multidimensional scaling combined with robust confidence ellipsoid (MDS/RCE 3D).

The second step refers to performing a multidimensional scaling (MDS) on the multivariate data to transform them into a dimension that can be used to visualize the results of the laboratories in two or three dimensions. After that, a confidence limit based on a robust confidence ellipse/ellipsoid (RCE) is plotted to identify outlying laboratory result (Figures 1c and 1d). Points outside of robust confidence ellipse/ellipsoid provide evidence that laboratory result is an outlier with a specific confidence level (95%, for example). On the other hand, points inside of RCE are not considered outlying result with same confidence level.

Multidimensional scaling is a multivariate technique that allows revealing “hidden” structures in a multivariate data set.1212 Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref] In other words, it is a method that allows visualizing the similarity/dissimilarity among laboratories participating in an interlaboratory comparison which are represented as points in a two- or three-dimensional space.1313 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref] The proximity/distance between the points represents the similarity/dissimilarity among laboratories.1313 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]

Robust confidence ellipse/ellipsoid is a multivariate analysis technique for outlier detection. A confidence region is built from the variance-covariance matrix of original data set, which allows identifying if there are laboratories that differ statistically from the others at some specific significance level.1414 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2005.

The combination of these two techniques (Figure 1) constitutes the proposal for evaluating the performance of the results reported by the participants.

Methodology

Data set

The data used in this study can be obtained from the available data set described by Sheen et al.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] In this data set, an interlaboratory comparison was carried out to assess the effectiveness of 1H NMR metabolomics. Seven laboratories from the United States, one from Canada, one from the United Kingdom, and one from Australia participated in the intercomparison. Mixtures of synthetic metabolites and samples of biological origin from liver extracts of European flounder from clean and contaminated sites were analyzed.44 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

The data described by Sheen et al.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] refers to adult female European flounder collected from Tyne in the United Kingdom. This is a biological sample selected from a polluted site. The fish were sacrificed, liver tissues were dissected, snap-frozen in liquid nitrogen, and stored at -80 °C until extraction by IC participating.44 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref] The samples were extracted using methanol:chloroform:water method and Precellys-24 bead-based homogenizer (Stretton Scientific Ltd., U.K.).44 Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]

Each participating laboratory obtained a one-dimensional 1H NMR spectrum. The spectra are reported as chemical shift frequencies with a range from 10.0 to 0.2 ppm. The region from 4.7 to 5.2 ppm was excluded due to water solvent suppression artifacts and the NMR spectra were renormalized. The spectra were binned with a bin width of 0.005 ppm, for a total of 1860 variables in each spectrum.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

In this work, the multivariate techniques of multi­dimensional scaling and robust confidence ellipse/ellipsoid (Figures 1c and 1d) were explored to assess the performance of laboratories participating in a laboratory intercomparison classifying them as outlier or not. The basis for the development of this methodology is described below.

Multidimensional scaling

Let p be the number of different laboratories (Figure 1b) and δij the dissimilarity between laboratories i and j. The coordinates are gathered in the matrix Xn×p where n is the dimensionality of the solution to be specified. Thus, the column i of Xn×p provides the coordinates of the laboratory i (Figure 1b). Let dij(X) be the Euclidean distance (most used)1515 Agrafiotis, D. K.; Rassokhin D. N.; Lobanov, V. S.; J. Comput. Chem. 2001, 22, 488. [Crossref],1616 Morrison, A.; Ross, G.; Chalmers, M.; Inf. Vis. 2003, 2, 68. [Crossref] between columns i and j defined as:

(1) d i j ( X ) = ( s = 1 n ( x i s x j s ) 2 ) 1 / 2

which is the shortest distance between laboratories i and j. In the equation 1, xis is the spectrum of laboratory i and xjs is the spectrum of laboratory j.

The Euclidean distance is more favorable in visual representations because a more isotropic display is obtained using it.1717 Asan, U.; Ercan, S. In Computational Intelligence Systems in Industrial Engineering; Kahraman, C., ed.; Atlantis Press: Paris, France, 2012, ch. 14. [Link] The purpose of multidimensional scaling is to find an Xn×p matrix that dij(X) is equal to δij as much as possible.1313 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]

To obtain this matrix, the least squares MDS model is used, which consists of minimizing the equation:

(2) σ 2 ( X ) = i = 2 p j = 1 i 1 w i j ( δ i j d i j ( X ) ) 2

where wij is a user defined weight which must be nonnegative. The minimization problem of σ22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref](X) is quite complex and it is necessary to use interactive algorithms to find the matrix Xn×p that minimizes σ22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref](X). The most used for the solution of this is the SMACOF algorithm.1212 Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref],1313 Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref],1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

Robust confidence ellipse

After applied MDS technique, the dimensionality data was reduced for two or three dimensions. The following step is identifying the outlying laboratories by robust confidence ellipse (Figure 1c).

The robust confidence ellipse is built from the matrix equation:

(3) μ ^ r o b + ( 2 F 2 ; ( n 1 ) ; ( 1 α ) ) U Q

where μ^rob=[X¯roby¯rob] is the vector of robust means; F2;(n-1);(1-α) is the quantile of the Fisher-Snedecor distribution with 2 and (n - 1) degrees of freedom and confidence level of (1 - α)%; Q is the Cholesky decomposition of the robust variance-covariance matrix Srob.1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.,1919 Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref] Both μ^rob and Srob are estimated by the iterative process described in the next section.

The matrix U is the unit circle defined by

(4) U = [ cos ( a ) sin ( a ) ]

where a = [a1 … am] is a vector of size m (0 ≤ a ≤ 2π).

Robust means (μ^rob) and variance-covariance matrix (Srob)

The vector of robust means μ^rob and the variance-covariance matrix Srob, for bi-dimensional data set, mentioned in the previous sub-section, are fitted from the following iterative process.1919 Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref] Let (x; y) a bi-dimensional data set.

(5) ( x ; y ) = [ x 1 y 1 x 2 y 2 x n y n ]

Step (i): μ^=[x¯y¯] and wi = 1 + p/υ ∀ i = 1, …, n where p is the number of variables and υ is degree of freedom of multivariate t distribution.

Step (ii): compute the matrix:

(6) X = [ ( x 1 x ¯ ) ( y 1 y ¯ ) ( x n x ¯ ) ( y n y ¯ ) ]

Step (iii): compute the singular value decomposition of matrix A, svd(A) = USVT, where:

(7) A = [ w 1 i = 1 n w i ( x 1 x ¯ ) w 1 i = 1 n w i ( y 1 y ¯ ) w n i = 1 n w i ( x n x ¯ ) w n i = 1 n w i ( y n y ¯ ) ]

Step (iv): compute the matrix:

(8) w = X V S = [ W 11 W 12 W n 1 w n 2 ]

where,

(9) S = [ 1 / s 1 1 / s 2 ]

Step (v): compute the vector Q = [Q1 … Qn] = [(w’11)22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] + w’12)22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] … (w’n1)22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] + w’n2)]

Step (vi): compute the new weights:

(10) w i = ( v + p ) ( v + Q i ) i = 1 , , n

Step (vii): compute the robust vector of means:

(11) μ ^ rob = [ x ¯ rob y ¯ rob ] = [ i = 1 n w i x i i = 1 n w i i = 1 n w i y i i = 1 n w i ]

Step (viii): the new fitted vector μ^rob and the weights wi* are used to obtain new values in step (i) (μ^=μ^rob and wi = wi* ). This procedure is repeated until the values of wi* converge, that is, |wi - wi*| < ε.

Step (ix): compute the robust variance-covariance matrix Srob.1919 Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref]

(12) S r o b = [ i = 1 n w i ( x i x ¯ r o b ) 2 n i = 1 n w i ( x i x ¯ r o b ) ( y i y ¯ r o b ) n i = 1 n w i ( x i x ¯ r o b ) ( y i y ¯ r o b ) n i = 1 n w i ( y i y ¯ r o b ) 2 n ]

Robust confidence ellipsoid

It is recommended to investigate through the robust confidence ellipsoid (Figure 1d) the possible presence of outliers that were not identified in the 2D analysis mentioned in previous sub-section.

The robust confidence ellipsoid is built from an equation analogous to the robust confidence ellipse:

(13) μ ^ r o b + ( 3 F 3 ; ( n 1 ) ; ( 1 α ) ) U Q

where μ^rob=[x¯roby¯robz¯rob] is the vector of robust means; F3;(n-1);(1-α) is the quantile of the Fisher-Snedecor distribution with 3 and (n - 1) degrees of freedom and confidence level of (1 - α)%; Q is the Cholesky decomposition of the robust variance-covariance matrix. Furthermore, μ^rob and Srob are estimated analogous to iterative process described in previous sub-section.

The m × 3 matrix U is the sphere of radius 1 defined by:

(14) U = [ cos ( θ ) sin ( φ ) sin ( θ ) sin ( φ ) cos ( φ ) ]

where θ = [θ1 … θm] is a vector of size m (0 ≤ θ ≤ 2π) and φ = [ϕ1 … ϕm] is a vector of size m (0 ≤ φ ≤ π).1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

Software

All statistical analyses were performed using the R statistical software, an open-source free environment for statistical computing and graph creation.1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. The 2D plot analysis (Figure 1c) was built using the R packages (CAR and STATS).1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. In addition, an R code was built to define which points obtained in the multidimensional scaling technique were outside the robust confidence ellipse. The MASS R package1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. was necessary for this purpose.

Regarding the 3D analysis (Figure 1d), the plot was built using PLOTLY R package1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. which is a graphing library that makes interactive plots. It is worth pointing out that to build the robust confidence ellipsoid it was necessary to develop an R code based on the dataEllipse R function from CAR R package.1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022. Moreover, an R code was built to identify outlying results. This code also depends on the MASS R package.1818 Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.

Results and Discussion

The results of the participants (Figure 1a) were arranged in a matrix with 1860 rows and 10 columns (Figure 1b). The columns contain the results of the participants in which each column is the spectrum reported by the i-th participant. The lines represent the variables (chemical shift in ppm).

It should be noted that this is the first study combining multidimensional scaling and robust confidence ellipse/ellipsoid (MDS/RCE) to evaluate laboratory proficiency in interlaboratory comparison. In order to validate the methodology suggested in this article, the MDS/RCE results will be compared with multivariate z-score results obtained by Sheen et al.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

In the proposed methodology, the two-dimensional multidimensional scaling provides a location in Euclidean space of each laboratory participating according to similarity/dissimilarity between 1H NMR spectra. Additionally, the two-dimensional robust confidence ellipse allows identifying if there is an outlier spectrum (Figure 1c). The Figure 2 presents the results obtained by two-dimensional approach.

Figure 2
Multidimensional scaling from 1H NMR spectra reported by participating and 95% robust confidence ellipse (2D plot).

It can be seen in Figure 2 that the laboratory 7042 is located out of ellipse. This provides evidence that its result differs statistically from other participants. At 5% significance level (Figure 2) the spectrum reported by laboratory 7042 is classified as an outlier when compared to other participants. On the other hand, laboratories that are located within the robust confidence ellipse do not differ statistically from each other. In this case, there is no evidence that the results reported by these participants are outliers.

The following step is analyzing the three-dimensional limit according to Figure 1d. The 3D plot allows seeing an outlier result that is “hidden” in the 2D plot. In other words, the three-dimensional approach provides more information about participating performances.

It can be seen from Figure 3 that more laboratories show results that differ from the others (at the 5% significance level) when compared to the bi-dimensional plot showed in Figure 2. Altogether there are three laboratories (0714, 7042, and 9541) out of ellipsoid which means that their results may be considered outliers, that is, statistically different from the others. The rest, inside the ellipsoid, do not differ statistically from each other. By an analogous reasoning followed for the univariate z-score suggested by ISO 13528:2015,11 ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015. performance assessment by the MDS/RCE is considered acceptable when results are inside of ellipse/ellipsoid and unacceptable when results are outside.

Figure 3
Multidimensional scaling from 1H NMR spectra reported by participating and 95% robust confidence ellipsoid (3D plot).

The confidence level adopted to consider the reported result as an outlier was 95% in 2D (Figure 2) and 3D (Figure 3) plot analysis.

In some cases, two (Figure 1c) and three (Figure 1d) dimensional analysis can provide the same conclusions, however, there are situations where this does not occur (such as Figures 2 and 3). In this situation, the two-dimensional analysis differs from the three-dimensional analysis, thus the latter must be adopted because it offers a more comprehensive (and reliable) analysis of the participants’ results.

In the multivariate z-score approach suggested by Sheen et al.,22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] the interspectral distance matrix Dk was obtained from the Kullback-Leibler, Mahalanobis, Hellinger and Jensen-Shannon distances. The values were fitted to a lognormal distribution. The choice of this probability distribution was based on the Q-Q plot. Additionally, according to the authors, the lognormal is the maximum entropy distribution for a specified mean and standard deviation.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

Each Zi,k value is an indication of where the si,k spectrum is in relation to the others in the Sk cluster. In the case of the lognormal distribution, Zi,k(1/2) = 1 and Zi,k(0.95) ca. 5. In this context, Zi,k = 1 indicates that si,k is closer to the center of Sk while Zi,k greater than 5 indicates that si,k is outside the range 95% confidence in the conglomerate Sk.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

The ||Ti,L|| values were fitted to a lognormal distribution and the projected statistical score was calculated for each data set. The scores calculated from the Kullback-Leibler, Mahalanobis, Hellinger, and Jensen-Shannon distances showed evidence that participants 0714, 7042 and 9541 reported results (spectra) are outliers.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]

The methodology based on multidimensional scaling combined with the robust confidence ellipse/ellipsoid (Figure 1) identified the same outlying participant results as the multivariate z-score method proposed by Sheen et al.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] The main advantage of the methodology proposed in this article is that it does not depend on choosing a probability distribution. This provides evidence that MDS/RCE has potential to be considered as performance evaluation method in interlaboratory comparisons.

Conclusions

In this paper, it was proposed a new approach to analyze multivariate data from interlaboratory comparisons. The methodology combines multidimensional scaling and robust confidence ellipse/ellipsoid to identify an outlying laboratory result. The results obtained by the methodology suggested in this work were compared with the results obtained by the multivariate z-score method described by Sheen et al.22 Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref] The MDS/RCE method found the same three outlying laboratories (0714, 7042, and 9541) identified in the multivariate z-score method. The proposed methodology has the advantage in relation to the latter of not depending on the choice, sometimes subjective, of a probability distribution.

This approach proved to be promising as a performance evaluation method for multivariate data analysis in interlaboratory comparison. It should be noted that this methodology can be used, in a similar way, to evaluate the performance of laboratories participating in proficiency testing scheme. In this approach, it is suggested classifying the participating laboratory results located outside of robust confidence ellipse/ellipsoid as unacceptable and inside as acceptable like univariate z-score showed in ISO 13528:2015.

The proposed method therefore constitutes a valuable tool that contribute to filling a gap in the literature regarding the multivariate data analysis in interlaboratory comparisons and proficiency trials. Other multivariate data analysis techniques like factor analysis and Kohonen’s self-organizing map, for example, are multivariate techniques that can be considered in future works for performance assessment tool in interlaboratory comparisons.

References

  • 1
    ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2015.
  • 2
    Sheen, D. A.; Rocha, W. F. C.; Lippa, K. A.; Bearden, D. W.; Chemom. Intell. Lab. Syst. 2017, 162, 10. [Crossref]
  • 3
    Hair Jr., J. F.; Black, W. C.; Babin, B. J.; Anderson, R. E.; Multivariate Data Analysis, 7th ed.; Prentice Hall: New Jersey, USA, 2009.
  • 4
    Viant, M. R.; Bearden, D. W.; Bundy, J. G.; Burton, I. W.; Collette, T. W.; Ekman, D. R.; Ezernieks, V.; Karakach, T. K.; Lin, C. Y.; Rochfort, S.; De Ropp, J. S.; Teng, Q.; Tjeerdema, R.S.; Walter, J. A.; Wu, H.; Environ. Sci. Technol. 2009, 43, 219. [Crossref]
  • 5
    Gallo, V.; Intini, N.; Mastrorilli, P.; Latronico, M.; Scapicchio, P.; Triggiani, M.; Bevilacqua, V.; Fanizzi, P.; Acquotti, D.; Airoldi, C.; Arnesano, F.; Assfalg, M.; Benevelli, F.; Bertelli, D.; Cagliani, L. R.; Casadei, L.; Marincola, F. C.; Colafemmina, G.; Consonni, R.; Cosentino, C.; Davalli, S.; De Pascali, S. A.; D’aiuto, V.; Faccini, A.; Gobetto, R.; Lamanna, R.; Liguori, F.; Longobardi, F.; Mallamace, D.; Mazzei, P.; Menegazzo, I.; Milone, S.; Mucci, A.; Napoli, C.; Pertinhez, T.; Rizzuti, A.; Rocchigiani, L.; Schievano, E.; Sciubba, F.; Sobolev, A.; Tenori, L.; Valerio, M.; Anal. Chem. 2015, 87, 6709. [Crossref]
  • 6
    Danzer, K.; Wank, U.; Wienke, D.; Chemom. Intell. Lab. Syst. 1991, 12, 69. [Crossref]
  • 7
    Henrion, R.; Chemom. Intell. Lab. Syst. 1994, 25, 1. [Crossref]
  • 8
    Škrbić, B.; Héberger, K.; Ðurišić-Mladenović, N.; Anal. Bioanal. Chem. 2013, 405, 8363. [Crossref]
  • 9
    Minkkinen, P.; Chemom. Intell. Lab. Syst. 1995, 29, 263. [Crossref]
  • 10
    Heininger, P.; Pelzer, J.; Henrion, R.; Henrion, G.; Fresenius’ J. Anal. Chem. 1998, 360, 344. [Crossref`]
  • 11
    Aoki, R.; Leão, D.; Bustamante, J. P. M.; Vilca, F.; J. Appl. Stat. 2021, 1. [Crossref]
  • 12
    Leeuw, J.; Mair, P.; J. Stat. Software 2009, 31, 1. [Crossref]
  • 13
    Groenen, P. J. F.; van de Velden, M.; J. Stat. Software 2016, 73, 1. [Crossref]
  • 14
    ISO 13528: Statistical Methods for Use in Proficiency Testing by Interlaboratory Comparisons, ISO: Geneva, 2005.
  • 15
    Agrafiotis, D. K.; Rassokhin D. N.; Lobanov, V. S.; J. Comput. Chem. 2001, 22, 488. [Crossref]
  • 16
    Morrison, A.; Ross, G.; Chalmers, M.; Inf. Vis. 2003, 2, 68. [Crossref]
  • 17
    Asan, U.; Ercan, S. In Computational Intelligence Systems in Industrial Engineering; Kahraman, C., ed.; Atlantis Press: Paris, France, 2012, ch. 14. [Link]
  • 18
    Bates, D.; Chambers, J.; Dalgaard, P.; Gentleman, R.; Hornik, K.; Ihaka, R.; Kalibera, T.; Lawrence, M.; Leisch, F.; Ligges, U.; Lumley, T.; Maechler, M.; Meyer, S.; Murrell, P.; Plummer, M.; Ripley, B.; Sarkar, D.; Lang, D. T.; Tierney, L.; Urbanek, S.; R Project, version 4.2.0; R Foundation, Austria, 2022.
  • 19
    Kent, J. T.; Tyler, D. E.; Vard, Y.; Commun. Stat. Simul. Comput. 1994, 23, 441. [Crossref]

Edited by

Editor handled this article: Maria Cristina Canela (Associate)

Publication Dates

  • Publication in this collection
    10 Mar 2023
  • Date of issue
    Mar 2023

History

  • Received
    02 May 2022
  • Published
    16 Sept 2022
Sociedade Brasileira de Química Instituto de Química - UNICAMP, Caixa Postal 6154, 13083-970 Campinas SP - Brazil, Tel./FAX.: +55 19 3521-3151 - São Paulo - SP - Brazil
E-mail: office@jbcs.sbq.org.br