Acessibilidade / Reportar erro

Inter- and intra-rater reliability of swimming teachers with different skill levels, in different conditions, evaluating front crawl arm movement in non-expert swimmers

Abstract

Aim:

This study aimed (i) to test the inter-rater reliability of swimming teachers, (ii) to test the swimming teacher's discussion effect on inter-rater reliability and (iii) to verify the intra-rater swimming teacher's reliability.

Method:

Twenty-one learning swimmers (14.1 ± 5.1 years old) performed two 25-m front crawl courses at a comfortable speed without breathing between the sixth and 20-m, and had their displacements captured on film. Three swimming teachers with different academic backgrounds and skills evaluated the swimmer right upper limb using a 20-items checklist. In the 1st-step, teachers assessed 20-items and in 2nd-step discussed their particular evaluating criteria - selecting five items considered as the most relevant. The inter- and intra-rater reliability were tested using the Fleiss Kappa Coefficient.

Results:

In the 1st-step substantial reliability was found for item three and in movement descriptor for items three and 20. Nearly perfect reliability was found in the movement descriptor for item 13. In 2nd-step, moderate reliability was found only in the movement descriptor for item 20. Only the most experienced evaluator showed substantial intra-rater reliability for items four and 10 and moderate for item 20.

Conclusion:

The proposed discussion method did not cause the expected effect on inter-rater reliability. The swimming teacher with a higher degree and swimming skills showed better intra-rater reliability. Some items and movement descriptors proposed at the 20-items checklist can be used in practical settings.

Keywords:
swimming; sport; evaluation; pedagogy; teaching

Introduction

At any swimming level learning is affected by the interaction of several components, including teacher/coach action11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.2. Gonzalez DH, Chiviacowsky S. Relatedness support enhances motor learning. Psychol Res. 2018; 82(3): 439-47.-33. Wulf G, Shea C, Lewthwaite R. Motor skill learning and performance: a review of influential factors. Medical Education. 2010; 44(1): 75-84.. Teachers, coaches, and scientists use different observational methods to evaluate and implement technique11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.,44. Federolf P, Reid R, Gilgien M, Haugen P, Smith G. The application of principal component analysis to quantify technique in sports. Scand J Med Sci Sports. 2014; 24(3): 491-9.,55. Potdevin F, Vors O, Huchez A, Lamour M, Davids K, Schnitzler C. How can video feedback be used in physical education to support novice learning in gymnastics? Effects on motor learning, self-assessment, and motivation. Phys Educ Sport Pedagogy. 2018; 23(6): 559-74.. During swimming teaching, this evaluation usually takes a qualitative and subjective nature11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.,66. Wizer R, Franken M, Castro FAdS. Concordância intra e inter-observador de protocolo de avaliação de habilidades aquáticas de crianças. R. Bras. Ci. e Mov. 2016; 24(1): 101-7.,77. Madureira F, Gollegã DG, Rodrigues HF, de Oliveira TAC, Dubas JP, Freudenheim AM. Validação de um instrumento para avaliação qualitativa do nado “Crawl”. Rev Bras Educ Fís Esporte. 2008; 22(4): 273-84., where teachers/coaches with more knowledge and experience in the observational process tend to be more competent77. Madureira F, Gollegã DG, Rodrigues HF, de Oliveira TAC, Dubas JP, Freudenheim AM. Validação de um instrumento para avaliação qualitativa do nado “Crawl”. Rev Bras Educ Fís Esporte. 2008; 22(4): 273-84.8. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.-99. Rosado A, Virtuoso L, Mesquita I. Relação entre as competências de diagnóstico de erros das habilidades técnicas e a prescrição pedagógica no voleibol. Rev Bras Educ Fís Esporte. 2004; 18(2): 151-7.. In an attempt to carry out a proper evaluation process, the literature suggests the analysis of the main components of movement44. Federolf P, Reid R, Gilgien M, Haugen P, Smith G. The application of principal component analysis to quantify technique in sports. Scand J Med Sci Sports. 2014; 24(3): 491-9. using checklists66. Wizer R, Franken M, Castro FAdS. Concordância intra e inter-observador de protocolo de avaliação de habilidades aquáticas de crianças. R. Bras. Ci. e Mov. 2016; 24(1): 101-7.,77. Madureira F, Gollegã DG, Rodrigues HF, de Oliveira TAC, Dubas JP, Freudenheim AM. Validação de um instrumento para avaliação qualitativa do nado “Crawl”. Rev Bras Educ Fís Esporte. 2008; 22(4): 273-84.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32..

The inter and intra-rater reliability tests are common approaches that enable the satisfactory use of a checklist by different teachers and for the same teacher over time66. Wizer R, Franken M, Castro FAdS. Concordância intra e inter-observador de protocolo de avaliação de habilidades aquáticas de crianças. R. Bras. Ci. e Mov. 2016; 24(1): 101-7.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.,1111. Nadeau L, Richard J-F, Godbout P. The validity and reliability of a performance assessment procedure in ice hockey. Phys Educ Sport Pedagogy. 2008; 13(1): 65-83.. This reliability refers to the condition of reproducing similar measures on different occasions1111. Nadeau L, Richard J-F, Godbout P. The validity and reliability of a performance assessment procedure in ice hockey. Phys Educ Sport Pedagogy. 2008; 13(1): 65-83.,1212. Currell K, Jeukendrup AE. Validity, reliability, and sensitivity of measures of sporting performance. Sports Med. 2008; 38(4): 297-316., being an important procedure in ensuring that the results do not have significant differences66. Wizer R, Franken M, Castro FAdS. Concordância intra e inter-observador de protocolo de avaliação de habilidades aquáticas de crianças. R. Bras. Ci. e Mov. 2016; 24(1): 101-7.,88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. Furthermore, the inter-rater experiences should not influence the results, although the literature indicates that more experienced evaluators have better competences88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. This is important because from this assessment that learning exercises will be proposed55. Potdevin F, Vors O, Huchez A, Lamour M, Davids K, Schnitzler C. How can video feedback be used in physical education to support novice learning in gymnastics? Effects on motor learning, self-assessment, and motivation. Phys Educ Sport Pedagogy. 2018; 23(6): 559-74.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.,1111. Nadeau L, Richard J-F, Godbout P. The validity and reliability of a performance assessment procedure in ice hockey. Phys Educ Sport Pedagogy. 2008; 13(1): 65-83..

We recognize the importance of the results reported in the literature, but some checklists emphasize only movement errors, suggesting that there is only one rigid way to swim77. Madureira F, Gollegã DG, Rodrigues HF, de Oliveira TAC, Dubas JP, Freudenheim AM. Validação de um instrumento para avaliação qualitativa do nado “Crawl”. Rev Bras Educ Fís Esporte. 2008; 22(4): 273-84.. Indeed, the checklists do not (i) include the range of movements that can be performed by learners77. Madureira F, Gollegã DG, Rodrigues HF, de Oliveira TAC, Dubas JP, Freudenheim AM. Validação de um instrumento para avaliação qualitativa do nado “Crawl”. Rev Bras Educ Fís Esporte. 2008; 22(4): 273-84.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32., and (ii) evaluators did not discuss their observation and evaluation criteria. Therefore, it is crucial to a movement checklist whose focus is based on comprehensive and accurate descriptors44. Federolf P, Reid R, Gilgien M, Haugen P, Smith G. The application of principal component analysis to quantify technique in sports. Scand J Med Sci Sports. 2014; 24(3): 491-9.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69).,1414. Knudson D, Morrison CS. Análise qualitativa do movimento humano. 1 ed. São Paulo, SP: Manole, 2001.. Little comprehensive criteria may not contemplate all the possibilities of movement, especially in the beginners. These two conditions make inter- and intra-rater reliability difficult and make the use of checklist unviable as a pedagogical tool1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. Additionally, teachers use particular and subjective assessment criteria that differ for observation and evaluation moment88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69).. Then, the discussion before evaluation can match observation points among evaluators and as far as we know, this has never been tested.

Accordingly, for the present study, three objectives were determined: (i) to test inter-rater reliability for a 20-item checklist and its movement descriptors; (ii) to test the effect of a discussion among teachers on selecting the most relevant items to assess and the own observation/evaluation criteria used, on the inter-rater reliability; and (iii) to check intra-rater reliability for the pre and post-intervention moments. We hypothesize that the inter-rater reliability would increase due to the discussion among teachers.

Materials and Methods

Three swimming teachers with different academic backgrounds and experience time (Table 1) were responsible for evaluating 21 swimming learners (average age 14.1 ± 5.1 years) from diverse swimming technical level. They did not have previous experiences with the checklist but were informed about how to use it. Non-expert swimmers were randomly selected from two swim clubs that have been training for at least 1 year. They could swim short distances of front crawl (up to 50-m) and some of them could swim backstroke. All participants received oral and written instructions about the study and participated voluntarily. Participants and their parents, when necessary, signed the Informed Consent Form. The Human Research Ethics Committee approved this study under number 2.486.694.

Table 1
Degree, years of swimming experience, and expertise area of the evaluators.

The right upper limb movement during the front crawl was assessed from a checklist and it was chosen because it is considered the most propulsive action88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1515. Silveira RP, de Souza Castro FA, Figueiredo P, Vilas-Boas JP, Zamparo P. The effects of leg kick on swimming speed and arm-stroke efficiency in the front crawl. Int J Sports Physiol Perform. 2017; 12(6): 728-35.. The complete checklist had 20-items (upper limb phases and sub-phases)1616. Chollet D, Chalies S, Chatard J. A new index of coordination for the crawl: description and usefulness. Int J Sports Med. 2000; 21(1): 54-9.,1717. Maglischo EW. Nadando o mais rápido possível. 3 ed. Barueri, SP: Manole, 2010. and different movement descriptors (MD), as shown in Chart 1. Due to the particularity of the movement, not all items had the same amount of MD. The MD is the possibility of the right upper limb movement during the front crawl.

Chart 1
Items and movement descriptors (MD) of the right upper limb movement proposed for swimmer's assessment when performing the front crawl swim.

Each swimmer performed two 25-m front crawl courses at a comfortable velocity without breathing between the sixth and 20-m to maintain standardization of the swimming technique and avoiding possible technical changes1818. Lerda R, Cardelli C. Breathing, and propelling in crawl as a function of skill and swim velocity. Int J Sports Med. 2003; 24(01): 75-80.,1919. Psycharakis SG, McCabe C. Shoulder and hip roll differences between breathing and non-breathing conditions in front crawl swimming. J Biomech. 2011; 44(9): 1752-6.. The swimming movements were captured on film in the sagittal (submerged and out of water) and frontal (submerged and out of water) planes in the central 10-m of the swimming pool, with a sports camera (Garmin, Virb Elite HD; frequency: 60Hz) and, when necessary, using a watertight box (Garmin). For recording out of the water the camera was stabilized by a tripod at 0.80 m high. To obtain underwater images the camera was kept at a depth of 0.45 m. The sagittal plane recording was performed about 5-m away from the swimmer, allowing the visualization of two-stroke cycles. The videos were later digitized using video editing software (Final Cut Pro).

Step 1 - Inter-rater evaluation

The evaluator received an e-mail link containing the swimmers' videos identified by letters and for three weeks evaluated them using the checklist presented in Chart 1. Each evaluator used his equipment, resources, criteria, and method for movement evaluation. The swimmers' videos could be viewed as many times as needed and without contact with the other evaluators.

Step 2 - Inter-rater evaluation after intervention meeting

Thirty days after the end of the first step, the evaluators and one of the researchers met for about 60 min to discuss the swimmers' evaluation process. Initially all items and MD were discussed.

After that, items 4, 10, 11, 15, and 20, were considered by evaluators as the most relevant for the assessment of upper limb movement in front crawl. Subsequently, these five items were again discussed among evaluators in order to align the observation and evaluation criteria. Movement interpretations from different observational planes and the possibility of slow-motion speed views were also discussed.

After the meeting, the evaluators received another e-mail link containing the swimmers' videos identified by letters in a different order from step 1. This time, they had two weeks to perform the new evaluation with five selected items. Again, the swimmers' videos could be viewed as many times as needed and without contact with the other evaluators.

Step 3 - Intra-rater evaluation

This step consisted of the evaluation of intra-rater reliability and was performed only for the five items selected in the second step of the study, considering the results of the first and second steps.

Statistical analysis

The MD used in the present study can be classified as nominal. The general level (item) of inter-rater (1st and 2nd step) and intra-rater (3rd step) reliability was tested using the Fleiss' kappa reliability coefficient (k). Using Fleiss' Kappa it was also possible to obtain the inter-rater reliability level for each of the MD. Fleiss' Kappa ranges from +1 (perfect reliability) to 0 (reliability equal to chance) and goes to -1 (complete unreliability)2020. Field A. Descobrindo a estatística usando o SPSS. 1 ed. Porto Alegre: Artmed, 2009.. It is suggested that results between 0-0.2 show very little reliability, between 0.21-0.40 poor reliability, between 0.41-0.60 moderate reliability, between 0.61-0.80 substantial reliability and value above 0.80 as a nearly perfect reliability2121. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977: 159-74.. In all cases, a significance level of 95% was considered, and a reliability level above 0.41 (moderate) was considered relevant2020. Field A. Descobrindo a estatística usando o SPSS. 1 ed. Porto Alegre: Artmed, 2009..

Results

Step 1 - Inter-rater evaluation

The inter-rater reliability for the 20-items indicates that there was moderate reliability for MD1 (The hand enters the water at the shoulder line, but very close to the head) and MD3 (The hand enters the water at the shoulder line and in front of the head.) of item 1 (how does hand enter in the water?); for MD1 (the hand is directed straight downward after touching the water) of item 4 (which does hand direction when it enters in the water?); on both MD and general item 5 (How is the arm positioned after the hand touches the water?); for MD1 (it performs lateral movements), MD3 (it keeps unmoved - when there is no breathing) and general item 8 (Which kind of movement the head performs throughout the entry and the glide of the hand in the water?); and for general item 13 (how is the hand trajectory throughout the pull and push phase?).

Substantial reliability has also been found for general item 3 (Which part of the hand touches the water first?) and their MD1 (the fingertips are not the first part to touch the water) and MD2 (the fingertips are the first part to touch the water); and for MD4 (it is flexed, elevated, away from water and above hand level) of item 20 (how is elbow angle throughout arm recovery?).

Nearly perfect reliability was found in MD1 (the hand crosses the midline of the trunk to the opposite side) of item 13 (how is the hand trajectory throughout the pull and push phase?); and MD3 (it is at full or almost complete extension, and far from the water level) of item 20 (how is elbow angle throughout arm recovery?). The complete result for the first step of the study is presented in Table 2.

Table 2
Inter-rater reliability from the initial 20-items checklist and its movement descriptors (MD) of the high upper limb in front crawl proposed in the first step.

Step 2 - Inter-rater evaluation after intervention meeting

The five items proposed in the second step of the study (after the discussion of observation and evaluation criteria) by evaluators showed moderate reliability only for MD2 (the elbow is at full or almost complete extension near the water level) and MD3 (the elbow is at full or almost complete extension, and far from the water level) of item 20 (how is elbow angle throughout arm recovery?). The complete result of the second step of the study is presented in Table 3.

Table 3
Inter-rater reliability from the five items checklist and its movement descriptors (MD) of the high upper limb in front crawl proposed in the second step.

Step 3 - Intra-rater evaluation

Evaluator A showed moderate internal reliability only in the evaluation of item 20 (how is elbow angle throughout arm recovery?). Evaluator B did not present at least one item with moderate reliability. On the other hand, evaluator C showed substantial reliability for items 4 (What is the direction of the hand when it enters the water?) and 10 (How is elbow positioned at the catch phase?), and moderate reliability for item 20 (how is elbow angle throughout arm recovery?). The result of intra-rater reliability is presented in Table 4.

Table 4
Intra-rater reliability from the five items checklist and its movement descriptors (MD) of the high upper limb in front crawl proposed in the third step.

Discussion

This study tested inter and intra-rater reliability of the upper limb technique in front crawl using a checklist. In this sense, the inter-rater reliability was initially evaluated for a 20-items checklist. After the intervention, to align the observation and evaluation criteria, the evaluators chose five items they considered most relevant and the inter-rater reliability was tested again. Finally, intra-rater reliability was tested for both evaluations. The study showed that only four items proposed on the initial checklist and 12 movement descriptors showed at least moderate reliability (Step 1). Moreover, none of the five items chosen, and only two of their movement descriptors had adequate inter-rater reliability (Step 2). Thus, the proposed intervention did not have the expected effect, indicating that teachers use different observation and evaluation criteria, certainly influenced by former experience. The most experienced evaluator was the one with the highest internal reliability. Even so, some items showed insufficient reliability.

The initial checklist was made up of 20-items. However, only three of them had moderate general inter-rater reliability and one substantial. This difficulty of reliability can happen because each evaluator has particular and subjective assessment criteria that differ for observation and evaluation moment88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69).. Besides that, the way evaluators interpret the movement may influence their assessment - for example, does the evaluator refer to a more general (global body movement) or specific (concerned with a limb or segment) movement? Of these four items, three are easily observable outside the pool, where a swimming teacher usually makes his observations (items 3, 5, and 8) during class. Although item 13 refers to a movement made underwater, which could lead to difficulties in reliability, it is generally the focus of a teacher's observation because it refers to the underwater trajectory of the hand88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.. Therefore, these four items showed adequate inter-rater reliability and have practical implications. This finding may due to these items are commonly observed by teachers in swimming lessons.

Most items and MD that did not show at least moderate reliability are predominantly underwater movements and/or in combination with other more complex movements. Additionally, they can be less important during the learning process. Thus, the lack of experience in observing these movements daily may difficult their evaluation and, consequently, inter and intra-rater reliability levels88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69).. Some movements are difficult to observe, either by the turbulence generated in the water or by the teacher's positioning (on the ledge or inside the water). For this reason, the teacher can evaluate these items by the consequence of movement and not by movement itself11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.,2222. Madureira F, Bastos FH, Corrêa UC, Rogel T, Freudenheim AM. Assessment of beginners' front-crawl stroke efficiency. Percept Mot Skills. 2012; 115(1): 300-8., with marked difference according to the evaluator's experience88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.. Nevertheless, despite the notion that a swimmer who presents a good quality of movement along with good performance must have a superior technique11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.,88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12., sports performance is dependent on several variables1515. Silveira RP, de Souza Castro FA, Figueiredo P, Vilas-Boas JP, Zamparo P. The effects of leg kick on swimming speed and arm-stroke efficiency in the front crawl. Int J Sports Physiol Perform. 2017; 12(6): 728-35.,2323. Costa MJ, Bragada JA, Mejias JE, Louro H, Marinho DA, Silva AJ, et al. Tracking the performance, energetics, and biomechanics of international versus national level swimmers during a competitive season. Eur J Appl Physiol. 2012; 112(3): 811-20.. Thus, a proper technique may lead to superior performance, but the opposite is not necessarily true11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.. Also, the focus of technical observation was on the right arm stroke and its associated movements. However, the overall movement of the swimming also depends on the action of the other upper and lower limbs1515. Silveira RP, de Souza Castro FA, Figueiredo P, Vilas-Boas JP, Zamparo P. The effects of leg kick on swimming speed and arm-stroke efficiency in the front crawl. Int J Sports Physiol Perform. 2017; 12(6): 728-35.,2222. Madureira F, Bastos FH, Corrêa UC, Rogel T, Freudenheim AM. Assessment of beginners' front-crawl stroke efficiency. Percept Mot Skills. 2012; 115(1): 300-8..

Although some items did not show adequate inter-rater general reliability, some of their MD showed different behavior. While item 1 (How does the hand enter the water) showed poor general reliability (0.38), two of its MD showed moderate reliability (0.55 and 0.41 - respectively for MD1 and MD3). Therefore, in this case, the evaluators agreed to perceive the short arm stroke (MD1) and the arm that enters at the shoulder line and in the front of the head (MD3). Since observation and evaluation criteria differ between evaluators88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69)., the MD that showed the worst reliability was precisely the one in which comparison criteria became more subjective and particular. Consequently, the MD2 of item 1 (The hand enters the water in or out of the shoulder line.) showed poor reliability. This is because extreme cases of arm stroke crossing the midline of the body or very wide are easy to notice. Nevertheless, what is the limit between one arm stroke on the shoulder line and one stroke that slightly crosses the midline of the body? The teacher working at the learning level focuses his observation on gross errors and may not stick to the finer aspects of movement1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32..

A similar understanding can be found for MD1 of item 4, MD1 of item 13 and MD3 and MD4 of item 20. It is more noticeable to watch the hand moves directly downward after entering the water (MD1 of item 4), as it is considered an extreme movement, than the subtle differences between directing the hand forward, a slightly inwards or outwards1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. There are the same perception and reliability for the hand crossing the midline of the trunk (MD1 of item 13) than the other subtle variations of hand's path. In this case, for example, while the inter-rater reliability of the hand crossing the midline of the trunk in the underwater phase was perfect and significant (the best in the whole study), the inter-rater reliability of the hand passing away from the midline of the body (MD2) and the hand passing near the midline of the trunk (MD4) was only poor. This may also indicate that inter-rater reliability for movements with less execution variance is more difficult to perceive in the same way, especially for the less experienced88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. In some cases this can cause difficulties in perceiving movement error bringing consequences for learning1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69)..

It can be thought that it is easier to find appropriate inter-rater reliability when MDs are mutually exclusive. In fact, this happened at items 3 (Which part of the hand touches the water first?) and at item 5 (How is the arm positioned after the hand touches the water?). However, not at item 16 (how is elbow angle at the end of the push phase?) and item 19 (how is elbow movement throughout arm recovery?). It may seem easy, at item 16, to indicate only whether there is an elbow extension at the end of the pushing phase. However, the analysis involves the correct understanding of (i) the end of this phase, (ii) the hand speed, and (iii) whether the elbow extension is appropriate. Thus, great movements can be confused with their performance consequence (displacement efficiency)11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.,2222. Madureira F, Bastos FH, Corrêa UC, Rogel T, Freudenheim AM. Assessment of beginners' front-crawl stroke efficiency. Percept Mot Skills. 2012; 115(1): 300-8.. Item 19 becomes complex because it subjectively involves a kinematic and temporal analysis. Between a flexed and extended elbow may be an angle that raises doubts. In addition, the movement may start in one way, and along its path assume another behavior.

Importantly, while technical analysis is imperative for teachers, coaches, and scientists, their interests and tools are different11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.. This study was concerned with the swimming teachers and their particularities in the teaching process. Of the five items selected by the evaluators as most relevant, four comprised underwater movements and one out of water. The evaluators' focus seems to be on hand entry into the water, arm positioning at the beginning of the propulsive phase, and the direction and trajectory of the hand in this propulsive action. The relevance of these items was also highlighted in another study88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12..

Despite the recognition of the importance of these five items for an adequate technique, the observation and evaluation criteria were very different between evaluators. In fact, the inter-rater reliability for item 10 (How is elbow positioned at the catch phase), item 11 (how is hand positioned at the beginning of the pull until the end of the push phase?) and item 15 (how is the hand trajectory throughout the end of the push phase?) were small and not significant88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. This seems to be a problem for the learning process because with different assessment swimmers would take different feedbacks2424. Wulf G. Attentional focus and motor learning: a review of 15 years. International Review of Sport and Exercise Psychology. 2013; 6(1): 77-104.doi: 10.1080/1750984X.2012.723728,2525. Marques-Dahi MTdSP, Bastos FH, de Araujo UO, Walter C, Freudenheim AM. Verbal instructions on learning the front-crawl: emphasizing a single component or the interaction between components? Human Movement. 2016; 17(2): 80-6..

It was hypothesized that the inter-rater reliability for the five selected items would increase due to the implemented intervention. However, the overall reliability rating for the five items did not change. Nevertheless, MD2 (it is flexed and lies below the hand) and MD3 (it is flexed and at the same level as the hand) of item 10 (How is elbow positioned at the catch phase), MD3 (initially the palm is facing backward, but throughout the phase, it turns outward) of item 11 (how is hand positioned at the beginning of the pull until the end of the push phase?) and MD2 (it is at full or almost complete extension near the water level) of item 20 (how is elbow angle throughout arm recovery?) showed an improvement in their reliability rating, although they are still low.

The MD3 (it is at full or almost complete extension, and far from the water level) and MD4 (it is flexed, elevated, away from water and above hand level) of item 20 (how is elbow angle throughout arm recovery?) had their reliability ratings reduced, respectively, from practically perfect and moderate, to moderate and poor. The same happened with MD1 (the hand is directed straight downward after touching the water) of item 4 (What is the direction of the hand when it enters the water?), which had its reliability rating changed from moderate to poor. Item 15 (how is the hand trajectory throughout the end of the push phase?) and its MD continued to show random reliability or even marked unreliability. These results indicate that some observation and evaluation points could be slightly adjusted among the evaluators due to the intervention implemented, but not enough for the item to present substantial or practically perfect reliability. Since it is an important item, an effort should be made to approximate the observation and evaluation strategy used88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.. Maybe it can occur with a greater period of discussion between teachers.

Intra-rater reliability was lower for the evaluator with less experience in swimming teaching. This may indicate that the evaluator is still forming his observation, comparison, and analysis references88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,1313. Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69).. However, even evaluator B, with more than 10 years of teaching experience, presented only one item with moderate reliability. Anyway, in these two cases, it is worth noting that there was an intervention between the two moments of evaluation, which may have been a factor that led to some behavioral change and the absence of more expressive reliability. Explicitly, evaluators may have used new observation and analysis criteria that contributed to these results88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.. Usually the teacher compares the swimmer's movement with a standard model that he has as a reference and then makes his decisions11. Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.,1010. Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.. Thus, it seems that the standard models created by evaluators are different, and, for some movements, their internal standard models of comparison are not well defined, making even their internal reliability difficult88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12..

The evaluator with the longest experience and academic background was the one who presented substantial intra-rater reliability for two items (4 and 10) and moderate intra-rater reliability for one (20). More experienced evaluators were able to differentiate swimmers' performance levels more accurately than those less experienced88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.. On the other hand, it is not known whether the intervention strengthened or was unable to modify some of its observation and evaluation criteria in light of its pre-established concepts. Regardless, it should be noted that even in the most experienced evaluator, items 11 and 15 presented an unsatisfactory classification of reliability, which had already happened in the inter-rater reliability. Therefore, one must reinforce that the perception and evaluation of the movement and direction of the hand during the underwater actions of the stroke are difficult even with video. These points should be addressed, as they are extremely important in swimming technique and may lead to inadequate feedback by swimming teachers88. Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.,2424. Wulf G. Attentional focus and motor learning: a review of 15 years. International Review of Sport and Exercise Psychology. 2013; 6(1): 77-104.doi: 10.1080/1750984X.2012.723728,2525. Marques-Dahi MTdSP, Bastos FH, de Araujo UO, Walter C, Freudenheim AM. Verbal instructions on learning the front-crawl: emphasizing a single component or the interaction between components? Human Movement. 2016; 17(2): 80-6..

The findings of this study have to be seen in the light of some limitations. The first is how teachers use the computer and its resources to better observe and understand the movement. At step 1 they assessed the swimmers from personal criteria and at step 2 some observational tips were shown. Nevertheless, they used the computer as their personal preferences. The second limitation concerns the observational focus1414. Knudson D, Morrison CS. Análise qualitativa do movimento humano. 1 ed. São Paulo, SP: Manole, 2001.. We did not evaluate the front crawl, but just the right upper limb movement. To enumerate, breath and arm coordination were not evaluated, and the outcome could be different. Breathing, for instance, may cause technical changes1818. Lerda R, Cardelli C. Breathing, and propelling in crawl as a function of skill and swim velocity. Int J Sports Med. 2003; 24(01): 75-80.,1919. Psycharakis SG, McCabe C. Shoulder and hip roll differences between breathing and non-breathing conditions in front crawl swimming. J Biomech. 2011; 44(9): 1752-6. and it should be one specific observational focus, as proposed in literature44. Federolf P, Reid R, Gilgien M, Haugen P, Smith G. The application of principal component analysis to quantify technique in sports. Scand J Med Sci Sports. 2014; 24(3): 491-9.,1414. Knudson D, Morrison CS. Análise qualitativa do movimento humano. 1 ed. São Paulo, SP: Manole, 2001..

Conclusion

Four of the 20-item checklist had moderate and substantial inter-rater reliability. In addition, 12 movement descriptors had at least moderate inter-rater reliability. These items and movement descriptors proposed at the 20-items checklist can be used in practical settings. Although five items were considered as the most relevant for the swimmer's technical evaluation, only two movement descriptors showed adequate inter-rater reliability. Therefore, the novelty proposed discussion method for improving inter-rater reliability did not cause the expected effect. Hence, it was unable to modify some of the teacher observation and evaluation criteria in light of its pre-established concepts. Moreover, the intra-rater reliability can be influenced by the evaluator degree and experience. The better intra-rater reliability was found with a higher degree and experience swimming teaching.

References

  • 1.
    Lees A. Technique analysis in sports: a critical review. J Sports Sci. 2002; 20(10): 813-28.
  • 2.
    Gonzalez DH, Chiviacowsky S. Relatedness support enhances motor learning. Psychol Res. 2018; 82(3): 439-47.
  • 3.
    Wulf G, Shea C, Lewthwaite R. Motor skill learning and performance: a review of influential factors. Medical Education. 2010; 44(1): 75-84.
  • 4.
    Federolf P, Reid R, Gilgien M, Haugen P, Smith G. The application of principal component analysis to quantify technique in sports. Scand J Med Sci Sports. 2014; 24(3): 491-9.
  • 5.
    Potdevin F, Vors O, Huchez A, Lamour M, Davids K, Schnitzler C. How can video feedback be used in physical education to support novice learning in gymnastics? Effects on motor learning, self-assessment, and motivation. Phys Educ Sport Pedagogy. 2018; 23(6): 559-74.
  • 6.
    Wizer R, Franken M, Castro FAdS. Concordância intra e inter-observador de protocolo de avaliação de habilidades aquáticas de crianças. R. Bras. Ci. e Mov. 2016; 24(1): 101-7.
  • 7.
    Madureira F, Gollegã DG, Rodrigues HF, de Oliveira TAC, Dubas JP, Freudenheim AM. Validação de um instrumento para avaliação qualitativa do nado “Crawl”. Rev Bras Educ Fís Esporte. 2008; 22(4): 273-84.
  • 8.
    Zamparo P, Carrara S, Cesari P. Movement evaluation of front crawl swimming: technical skill versus aesthetic quality. PloS One. 2017; 12(9): 1-12.
  • 9.
    Rosado A, Virtuoso L, Mesquita I. Relação entre as competências de diagnóstico de erros das habilidades técnicas e a prescrição pedagógica no voleibol. Rev Bras Educ Fís Esporte. 2004; 18(2): 151-7.
  • 10.
    Soares S, Fernandes R, Carmo C, Santos Silva J, Vilas-Boas J. Avaliação qualitativa da técnica em natação: apreciação da consistência de resultados produzidos por avaliadores com experiência e formação similares. Rev. Port. Ciênc. Desporto. 2001; 1(3): 22-32.
  • 11.
    Nadeau L, Richard J-F, Godbout P. The validity and reliability of a performance assessment procedure in ice hockey. Phys Educ Sport Pedagogy. 2008; 13(1): 65-83.
  • 12.
    Currell K, Jeukendrup AE. Validity, reliability, and sensitivity of measures of sporting performance. Sports Med. 2008; 38(4): 297-316.
  • 13.
    Mendes R, Clemente F, Rocha R, Damásio A. Observação como instrumento no processo de avaliação em educação física. Exedra Journal. 2012; 6(57-69).
  • 14.
    Knudson D, Morrison CS. Análise qualitativa do movimento humano. 1 ed. São Paulo, SP: Manole, 2001.
  • 15.
    Silveira RP, de Souza Castro FA, Figueiredo P, Vilas-Boas JP, Zamparo P. The effects of leg kick on swimming speed and arm-stroke efficiency in the front crawl. Int J Sports Physiol Perform. 2017; 12(6): 728-35.
  • 16.
    Chollet D, Chalies S, Chatard J. A new index of coordination for the crawl: description and usefulness. Int J Sports Med. 2000; 21(1): 54-9.
  • 17.
    Maglischo EW. Nadando o mais rápido possível. 3 ed. Barueri, SP: Manole, 2010.
  • 18.
    Lerda R, Cardelli C. Breathing, and propelling in crawl as a function of skill and swim velocity. Int J Sports Med. 2003; 24(01): 75-80.
  • 19.
    Psycharakis SG, McCabe C. Shoulder and hip roll differences between breathing and non-breathing conditions in front crawl swimming. J Biomech. 2011; 44(9): 1752-6.
  • 20.
    Field A. Descobrindo a estatística usando o SPSS. 1 ed. Porto Alegre: Artmed, 2009.
  • 21.
    Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977: 159-74.
  • 22.
    Madureira F, Bastos FH, Corrêa UC, Rogel T, Freudenheim AM. Assessment of beginners' front-crawl stroke efficiency. Percept Mot Skills. 2012; 115(1): 300-8.
  • 23.
    Costa MJ, Bragada JA, Mejias JE, Louro H, Marinho DA, Silva AJ, et al. Tracking the performance, energetics, and biomechanics of international versus national level swimmers during a competitive season. Eur J Appl Physiol. 2012; 112(3): 811-20.
  • 24.
    Wulf G. Attentional focus and motor learning: a review of 15 years. International Review of Sport and Exercise Psychology. 2013; 6(1): 77-104.doi: 10.1080/1750984X.2012.723728
  • 25.
    Marques-Dahi MTdSP, Bastos FH, de Araujo UO, Walter C, Freudenheim AM. Verbal instructions on learning the front-crawl: emphasizing a single component or the interaction between components? Human Movement. 2016; 17(2): 80-6.

Publication Dates

  • Publication in this collection
    20 July 2020
  • Date of issue
    Jan-Mar 2020

History

  • Received
    03 Feb 2020
  • Accepted
    20 Apr 2020
Universidade Estadual Paulista Universidade Estadual Paulista, Av. 24-A, 1515, 13506-900 Rio Claro, SP/Brasil, Tel.: (55 19) 3526-4330 - Rio Claro - SP - Brazil
E-mail: motriz.rc@unesp.br