Resources for Score Reliability
Reliability - Books and Articles
Brennan, R. L. (2006). Educational measurement. Praeger Pub Text.
Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory (p. 527). Holt, Rinehart and Winston.
Thompson, B. (2002). Score reliability: Contemporary thinking on reliability issues. Sage Publications.
Traub, R. E. (1994). Reliability for the social sciences: Theory and applications. (R. M. Jaeger, Ed.) (Vol. 3). Sage.
Generalizability Theory - Books and Articles
Brennan, R. L. (1992). Generalizability Theory. Educational Measurement: Issues and Practice, 11(4), 27–34.
Brennan, R. L. (1992). The NCME instructional module on generalizability theory. Instructional Topics in Educational Measurement, 11(4), 225–232.
Brennan, R. L. (1997). A Perspective on the History of Generalizability Theory. Educational Measurement: Issues and Practice, 16(4), 14–20.
Brennan, R. L. (2000). Performance Assessments from the Perspective of Generalizability Theory. Applied Psychological Measurement, 24(4), 339–353.
Ruiz-Primo, M. A., & Shavelson, R. J. (1996). Rhetoric and reality in science performance assessments: An update. Journal of Research in Science Teaching, 33(10), 1045–1063.
Shavelson, R. J., Baxter, G. P., & Gao, X. (1993). Sampling Variability of Performance Assessments. Journal of Educational Measurement, 30(3), 215–232.
Shavelson, R. J., & Webb, N. M. (1991). Generalizability theory: A primer (Vol. 1). Sage Publications.
Software