METHODS: A total of 2 meetings and 3 pilot trials were held with raters before commencement of the study to establish an agreed grading system protocol. Interrater reproducibility was investigated using a Latin square repeated measures design. Thirty-four subjects (62% male and 38% female; age range, 15-70 years) with a history of low back pain participated. A total of 4 practicing physiotherapists and 1 sports medicine physician graded subjects using the Wisbey-Roth grading system protocol. RESULTS: Pair-wise weighted kappa values ranged from -0.01 (95% confidence interval [CI], -0.33 to 0.31) to 0.56 (95% CI, 0.25 to 0.87), with average weighted kappa being 0.29. The intraclass correlation coefficient (2,1) was 0.30 (95% CI, 0.15 to 0.48), and the standard error of the measurement was 1.6 units.
CONCLUSIONS: The Wisbey-Roth grading system shows fair to poor reproducibility between raters. Therefore, it should not be used to exchange meaningful information between clinicians. Recommendations are made for further research and toward improving its reproducibility.
Click on the above link for the PubMed record for this article; full text by subscription. This abstract is reproduced with the permission of the publisher.