The measurement of observer agreement for categorical data

JR Landis, GG Koch�- biometrics, 1977 - JSTOR
JR Landis, GG Koch
biometrics, 1977JSTOR
This paper presents a general statistical methodology for the analysis of multivariate
categorical data arising from observer reliability studies. The procedure essentially involves
the construction of functions of the observed proportions which are directed at the extent to
which the observers agree among themselves and the construction of test statistics for
hypotheses involving these functions. Tests for interobserver bias are presented in terms of
first-order marginal homogeneity and measures of interobserver agreement are developed�…
This paper presents a general statistical methodology for the analysis of multivariate categorical data arising from observer reliability studies. The procedure essentially involves the construction of functions of the observed proportions which are directed at the extent to which the observers agree among themselves and the construction of test statistics for hypotheses involving these functions. Tests for interobserver bias are presented in terms of first-order marginal homogeneity and measures of interobserver agreement are developed as generalized kappa-type statistics. These procedures are illustrated with a clinical diagnosis example from the epidemiological literature.
JSTOR