Table 3

Inter-rater reliability

Outcome measureSuspected CSA groupControl groupTotal sample
Verbal scoring form (52 items)
 Cohen’s kappa, median (IQR)1.00 (0.69-1.00)*1.00 (0.76-1.00)†0.91 (0.66-1.00)‡
 POA, median (IQR)100 (94-100)100 (94-100)98 (95-100)
Non-verbal scoring form (360 items)
 Cohen’s kappa, median (IQR)0.37 (-.03-0.55)§0.47 (0.22-0.79)¶0.36 (-0.01-0.53)**
 POA, median (IQR)97 (92-100)100 (97-100)97 (94-100)
Red flag scoring form (3 items)
 Cohen’s kappa, median (min-max)0.42 (0.27-0.47)(0.38-0.52)††0.51 (0.45-0.61)
 POA, median (min-max)74 (73-87)77 (72-97)82 (73-83)
  • *kappa could be calculated for 45 out of 52 questions.

  • †kappa could be calculated for 41 out of 52 questions.

  • ‡kappa could be calculated for 48 out of 52 questions.

  • §kappa could be calculated for 183 out of 360 reactions.

  • ¶kappa could be calculated for 87 out of 360 reactions.

  • **kappa could be calculated for 206 out of 360 reactions.

  • ††Kappa could be calculated for 2 out of 3 questions; therefore, only minimum and maximum values given.

  • IQR, interquartile range; min-max, lowest and highest value.