Fleiss' Kappa

Fleiss' kappa (named after Joseph L. Fleiss) is a statistical measure for assessing the reliability of agreement between a fixed number of raters when assigning categorical ratings to a number of items or classifying items. This contrasts with other kappas such as Cohen's kappa, which only work when assessing the agreement between two raters. The measure calculates the degree of agreement in classification over that which would be expected by chance and is scored as a number between 0 and 1. There is no generally agreed on measure of significance, although guidelines have been given.

Fleiss' kappa can be used only with binary or nominal-scale ratings. No version is available for ordered-categorical ratings.

Read more about Fleiss' Kappa:  Introduction, Equations, Worked Example, Significance