Acessibilidade / Reportar erro

Inter-rater concordance study of the PASI (Psoriasis Area and Severity Index)

Abstracts

BACKGROUND: Developed in 1978, the PASI (Psoriasis Area and Severity Index) is a method to rate psoriasis severity. Since then, it has been criticized for being extremely long, complex and for yielding significantly divergent interpretations. OBJECTIVE: To determine the PASI inter-rater reliability among three independent evaluators examining the same patients. METHODS:Cross-sectional study conducted in 2007 at a psoriasis treatment center, where 20 patients were selected. The patients were evaluated by three postgraduate students of Dermatology who independently determined the PASI of each patient. The inter-rater reliability coefficient was determined by employing intraclass correlation coefficients (ICC) and Bland & Altman plots. RESULTS: An ICC of 0.729 (IC 95%: 0.440 - 0.882) showed a significant degree of concordance among evaluators. By graphic means, the Bland & Altman plot method presents concordance gap ranges. As shown in this study, narrow gap ranges indicate a good concordance degree. It was also observed that for high PASI values, wider concordance gap ranges predominate. CONCLUSION: Based on this analysis, we may conclude that the PASI is a reliable indicator of psoriasis severity because it shows significant concordance when independent evaluations are performed.

Cross-sectional studies; Psoriasis; Reproducibility of results


FUNDAMENTO: O índice da área e gravidade da psoríase foi método desenvolvido em 1978, para quantificar a gravidade da psoríase. Desde então, têm recebido algumas críticas, por ser longo, complexo e apresentar divergências na sua interpretação. OBJETIVO: Demonstrar o grau de concordância do índice da área e gravidade da psoríase aplicado em pacientes de psoríase por três avaliadores isoladamente. MATERIAL E MÉTODO: Estudo observacional, transversal, realizado no ambulatório de psoríase, no ano de 2007, onde foram selecionados 20 pacientes. Os pacientes foram avaliados por três pós-graduandos de dermatologia, que determinaram o índice da área e gravidade da psoríase, isoladamente. A fidedignidade interobservador foi avaliada pelo coeficiente de correlação intraclasses (ICC) e os gráficos de Bland & Altman. RESULTADOS: O ICC de 0,729 (IC 95%: 0,440 - 0,882) demonstrou que existe boa concordância entre os observadores. Pela análise dos gráficos de Bland e Altman, no máximo, duas diferenças estão fora dos limites de concordância, expressando boa concordância. Observou-se também que existe comportamento sistemático nas diferenças do índice da área e gravidade da psoríase em relação a sua média, segundo o qual mostrou para valores altos do índice da área e gravidade da psoríase, ou seja, um predomínio de diferenças maiores. CONCLUSÃO: Notamos, através deste estudo, que o índice da área e gravidade da psoríase apresenta bons critérios de confiabilidade quando houve a avaliação da gravidade da psoríase, mesmo que realizado por diferentes avaliadores.

Estudos transversais; Psoríase; Reprodutibilidade dos testes


INVESTIGATION

Inter-rater concordance study of the PASI (Psoriasis Area and Severity Index)*

Joana Ribeiro Costa de FariaI; Aline Rezende AarãoII; Luiz Miguel Zabaleta JimenezIII; Oscar Hernández SilvaIV; João Carlos Regazzi AvelleiraV

IMD, Postgraduate student of Dermatology, Prof. Rubem David Azulay Institute of Dermatology, Santa Casa de Misericórdia do Rio de Janeiro - Rio de Janeiro (RJ), Brazil

IIMD, Postgraduate student of Dermatology, Prof. Rubem David Azulay Institute of Dermatology, Santa Casa de Misericórdia do Rio de Janeiro - Rio de Janeiro (RJ), Brazil

IIIMD, Postgraduate student of Dermatology, Prof. Rubem David Azulay Institute of Dermatology, Santa Casa de Misericórdia do Rio de Janeiro - Rio de Janeiro (RJ), Brazil

IVMD, Postgraduate student of Dermatology, Prof. Rubem David Azulay Institute of Dermatology, Santa Casa de Misericórdia do Rio de Janeiro - Rio de Janeiro (RJ), Brazil

VPh.D. in Medicine; Associate Professor at Prof. Rubem David Azulay Institute of Dermatology, Santa Casa de Misericórdia do Rio de Janeiro - Rio de Janeiro (RJ), Brazil

Mailing address

ABSTRACT

BACKGROUND: Developed in 1978, the PASI (Psoriasis Area and Severity Index) is a method to rate psoriasis severity. Since then, it has been criticized for being extremely long, complex and for yielding significantly divergent interpretations.

OBJECTIVE: To determine the PASI inter-rater reliability among three independent evaluators examining the same patients.

METHODS: Cross-sectional study conducted in 2007 at a psoriasis treatment center, where 20 patients were selected. The patients were evaluated by three postgraduate students of Dermatology who independently determined the PASI of each patient. The inter-rater reliability coefficient was determined by employing intraclass correlation coefficients (ICC) and Bland & Altman plots.

RESULTS: An ICC of 0.729 (IC 95%: 0.440 - 0.882) showed a significant degree of concordance among evaluators. By graphic means, the Bland & Altman plot method presents concordance gap ranges. As shown in this study, narrow gap ranges indicate a good concordance degree. It was also observed that for high PASI values, wider concordance gap ranges predominate.

CONCLUSION: Based on this analysis, we may conclude that the PASI is a reliable indicator of psoriasis severity because it shows significant concordance when independent evaluations are performed.

Keywords: Cross-sectional studies; Psoriasis; Reproducibility of results

INTRODUCTION

Psoriasis is a chronic inflammatory disease that involves the skin and articulations. It is universally distributed and affects from 1 to 3% of the population. 1 Most cases are benign and recurrent, but the disease can be physically or socially debilitating. 2

The Psoriasis Area Severity Index (PASI) is a tool developed in 1978 by Fredricksson and Pettersson to evaluate the severity of psoriasis. This index analyzes the four regions of the body (head, trunk, upper and lower limbs) in relation to erythema, induration (thickness), desquamation (scaling) of the plaques, and body surface area (BSA) affected. 3 Scores from 0 to 4 are attributed to erythema, induration and scaling, based on their intensity (0-absent, 1mild, 2-moderate, 3-intense, 4-severe). The BSA affected is expressed in percentage (1 to 100%), with corresponding scores that range from 1 to 6. Therefore, the score of 1 means less than 10% of area involved; 2 means from 10-29%; 3 from 30-49%; 4 from 50-69%; 5 from 70-89%; 6 from 90-100%.

The PASI is the most used tool in clinical trials, but it has been criticized for its non-linear scale, complexity, and length. 2

The PASI scores that classify psoriasis as severe vary constantly. To some authors, the severity of the PASI has been defined as > 18, while others suggest > 12. Recently, the clinical guidelines of the British Association of Dermatologists have indicated that eligible patients for the use of biological agents should have a PASI greater than 10 and a DLQI (Dermatologic Life Quality Index) greater than 15. Other authors argue that a DLQI > 10 suggests moderate/severe disease in the "rule of tens". 4

The result of the PASI may vary from 0 to 72, although most patients fall between 0-153 and very few individuals are classified above this value range.

OBJECTIVE

To investigate the inter-rater reliability of the PASI used in clinical trials for assessing disease severity; three researchers with similar knowledge and experience separately evaluated the patients.

MATERIAL AND METHODS

This was an observational, clinical-epidemiological, transversal study conducted at a psoriasis ambulatory, from August to October 2007. Twenty patients were randomly selected, aged from 15 to 70 years, with mild, moderate, and severe psoriasis.

Three first-year postgraduate students of Dermatology evaluated the patients and determined their PASI separately. Next, results were obtained and the indexes attributed to each patient were compared, according to each rater.

The inter-rater reliability of the PASI was evaluated based on two different aspects. First, by the intraclass correlation coefficient (ICC),5 which allowed the assessment of the consistency or reproducibility of the measurements; then, by Bland & Altman plots, 5 which made possible a graphic analysis of the differences observed among the raters. The level of significance adopted was 0.05. The statistical analysis was done with the software package Statistical Product and Service Solutions (version 13.0 for windows, SPSS, Inc.).

RESULTS

First, we outlined the general profile of the case study, showing the mean value of the PASI for the 20 patients studied, according to each rater (Table 1).

Analysis of inter-rater reliability was done by intraclass correlation coefficient (ICC), which verified if there is significant agreement among the three raters in the PASI. Table 2 presents the ICC, the confidence interval (CI 95%) and its respective significance level (p value) for each pair of raters.

We should note that the closer to (1) the ICC is, the stronger (or more perfect) the agreement among the raters is; the closer to (0), the greater the discrepancy.

Therefore:5

ICC £ 0.20 no agreement

0.20 < ICC £ 0.40 weak agreement

0.40 < ICC £ 0.60 moderate agreement

0.60 < ICC £ 0.80 good agreement

ICC > 0.80 excellent agreement

Our results show that there is good agreement among the three raters; raters 2 and 3 showed excellent agreement, with an ICC of 0.817 (CI 95%: 0.601-0.923).

Agreement among the raters verified by Bland & Altman plots assessed the dispersion of the differences among the raters against its respective mean value. The amplitude of the agreement intervals indicated the quality of the concordance among the raters. In other words, relatively wide intervals express weak agreement and narrow intervals indicate good agreement among them. Moreover, we can identify a systematic behavior of the differences that may suggest an under or overestimation in relation to the second observer. The limits (upper and lower) correspond to the limits of agreement (Table 3, Graphs 1-3).


By perceptively analyzing the graphs, we may note:

The mean of the differences among the three raters is very close to zero (less than two points), suggesting that the raters are replicating one another.

Only two differences are beyond the limits of agreement, expressing good quality of concordance, even though the limits are not very narrow.

There is a systematic behavior of the differences of the PASI in relation to its mean, which showed a predominance of greater differences for higher values of the PASI and of smaller ones for lower values. In other words, the raters showed greater differences for more severe patients. This is a negative aspect because the expected behavior is a random distribution of the differences around the general mean.

The best agreement, based on Bland & Altman plots, was between raters 2 and 3, consistent with the ICC analysis.

DISCUSSION

Despite the importance of evaluating the evolution of psoriasis, there is no consensus about the more appropriate indexes and, unfortunately, there are very few randomized clinical trials that prove the reliability of the methods employed. 6

The PASI is the most commonly used tool; however, it is not fully satisfactory in terms of validity and reliability. 6

"Validity" may be defined as the capacity of the scale to assess what it is supposed to measure. "Reliability" is the extension to which the same results are obtained under the same conditions. Important reliability measurements for severity scales include inter and intrarater reliability. 4

Inter-rater reliability represents the level of agreement among different raters applying the scale to assess the same patient. 4

Intra-rater reliability is the degree of agreement among multiple repetitions of a test performed by a single rater. 4

In this study we noticed that there is agreement among the raters regarding the application of the method, but greater differences predominate for higher PASI scores. Nonetheless, since the PASI scores that indicate severity are between 10 and 20, the decision about the need for a more efficient therapy for severe patients would not be affected.

It is possible that studies that test the therapeutic efficiency of new drugs to treat patients with very high PASI scores show differences when conducted by different raters.

Even though the PASI is one of the main tools to evaluate the severity of psoriasis, the index has its limitations, especially when used as a single parameter to orient the medical decision.

The PASI does not efficiently translate a significant reduction of the erythema, scaling and infiltration of lesions when there is no modification of the body surface area (BSA) affected. Other symptoms and signs of the severity of psoriasis should be considered in the evaluation: pruritus, pain or burning on the site of lesions, duration of remission, and presence of articular disease.

Last, it is important to mention that the PASI identifies only the physical characteristics of the lesions and the area affected by psoriasis, but it does not take into account the perception that the patients have of the disease and the impact it has on their quality of life. 7

CONCLUSION

The PASI shows good reliability criteria in the evaluation of the severity of psoriasis, despite the fact that it may present some differences of interpretation for patients with higher PASI scores.

We emphasize the need for other variables to be assessed, such as the presence of psoriatic arthritis, the psychosocial aspect of the disease and its impact on the individual's quality of life. These variables, together with the PASI, would influence the therapeutic decision of the assistant physician.

REFERENCES

  • 1. Azulay RD, Azulay DR. Dermatologia. 4 ed. Rio de Janeiro: Guanabara Koogan; 2006.
  • 2. Carlin CS, Feldman SR, Krueger JG, Menter A, Krueger GG. A 50% reduction in the Psoriasis Area and Severity Index (PASI 50) is a clinically significant endpoint in the assessment of psoriasis. J Am Acad Dermatol. 2004; 50:859-66.
  • 3. Jacobson CC, Kimball AB. Rethinking the psoriasis area and severity index: the impact of area should be increased. Br J Dermatol. 2004; 151:381-7.
  • 4. Berth-Jones J, Grotzinger K, Rainville C, Pham B, Huang J, Daly S, et al. A study examining inter- and intrarrater reliability of three scales for measuring severity of psoriasis: Psoriasis Area and Severity Index, Physician's Global Assessment and Lattice System Physician's Global Assessment. Br J Dermatol. 2006; 155:707-13.
  • 5. Bartko JJ, Carpenter WT Jr. On the methods and theory of reliability. J Nerv Ment Dis. 1976;163:307-17.
  • 6. Langley RG, Ellis CN. Evaluating psoriasis with Psoriasis Area and Severity Index, Psoriasis Global Assessment, and Lattice System Physician's Global Assessment. J Am Acad Dermatol. 2004; 51:563-9.
  • 7. Grob JJ. Why are quality of life instruments not recognized as reference measures in therapeutic trials of chronic skin disorders? J Invest Dermatol. 2007; 127:2299-301.
  • Endereço para correspondência:
    Joana Ribeiro Costa de Faria
    Rua Belfort Roxo, 407 - aptº 601 - Copacabana
    22020-010 Rio de Janeiro - RJ
    Tel: 021 2275 7031 Cel: 021 8159 5206 Fax: 021) 2220 1928(IDPRDA)
    E-mail:
  • *
    Trabalho realizado no Instituto de Dermatologia Prof. Rubem David Azulay da Santa Casa de Misericórdia do Rio de Janeiro - Rio de Janeiro (RJ), Brasil.
  • Publication Dates

    • Publication in this collection
      02 Dec 2010
    • Date of issue
      Oct 2010

    History

    • Accepted
      18 Dec 2008
    • Received
      01 Sept 2008
    Sociedade Brasileira de Dermatologia Av. Rio Branco, 39 18. and., 20090-003 Rio de Janeiro RJ, Tel./Fax: +55 21 2253-6747 - Rio de Janeiro - RJ - Brazil
    E-mail: revista@sbd.org.br