

  • Hurtz, G. M. (forthcoming). Xcalibre item parameter calibration software for item response theory and Rasch models [software review]. Measurement: Interdisciplinary Research and Perspectives.

  • Hurtz, G. M., & Weiner, J. A. (2022). Comparability and integrity of online remote vs. onsite proctored credentialing exams. Journal of Applied Testing Technology, 23 (Special Issue), 36-45.

  • Hurtz, G. M., & Weiner, J. A. (2019). Analysis of test-taker profiles across a suite of statistical indices for detecting the presence and impact of cheating. Journal of Applied Testing Technology, 20, 1-15.

  • Weiner, J. A., & Hurtz, G. M. (2017). A comparative study of online remote proctored versus onsite proctored high-stakes exams. Journal of Applied Testing Technology, 18, 13-20.

  • Hurtz, G. M., Chinn, R. N, Barnhill, G., & Hertz, N. R. (2012). Measuring clinical decision making: Do key features problems measure higher-level cognitive processes? Evaluation & the Health Professions, 35, 396-415.

  • Marentette, B. J., Meyers, L. S., Hurtz, G.M., & Kuang, D. C. (2012). Order effects on situational judgment test items: A case of construct-irrelevant difficulty. International Journal of Selection and Assessment, 20, 319-332.

  • Hurtz, G. M., & Wright, C. W. (2012). Designing work descriptions to maximize the utility of employee recruitment efforts. In M. A. Wilson, S. G. Gibson, G. M. Alliger, & W. Bennett, Jr., The Handbook of Work Analysis: The Methods, Systems, Applications, & Science of Work Measurement in Organizations. New York: Routledge Academic.

  • Hurtz, G. M. (2011). Investigating test bias. In D. A. Biddle, Adverse Impact and Test Validation: A Practitioner's Handbook (3rd ed). Folsom, CA: Biddle Consulting Group, Inc.

  • Hurtz, G. M., & Williams, K. J. (2009). Attitudinal and motivational antecedents of participation in voluntary employee development activities. Journal of Applied Psychology, 94, 635–653.

  • Pasisz, D. J., & Hurtz, G. M. (2009). Testing for between-group differences in within-group interrater agreement. Organizational Research Methods, 12, 590-613.

  • Hurtz, G. M., & Jones, J. P. (2009). Innovations in measuring rater accuracy in standard-setting: Assessing “fit” to item characteristic curves. Applied Measurement in Education, 22, 120-143.

  • Hurtz, G. M., Jones, J. P., & Jones, C. N. (2008). Conversion of proportion-correct standard-setting judgments to cutoff scores on the IRT theta scale. Applied Psychological Measurement, 32, 385–406.

  • Hurtz, G. M., & Auerbach, M. A. (2003). A meta-analysis of the effects of modifications to the Angoff method on cutoff scores and judgment consensus. Educational and Psychological Measurement, 63, 584–601.

  • Donovan, J. J., Dwight, S. A., & Hurtz, G. M. (2003). An assessment of the prevalence, severity, and verifiability of entry-level applicant faking using the randomized response technique. Human Performance, 16, 81–106.

  • Hurtz, G. M., & Alliger, G. M. (2002). Influence of coaching on integrity test performance and unlikely virtues scale scores. Human Performance, 15, 255–273.

  • Hurtz, G. M., & Donovan, J. J. (2000). Personality and job performance: The Big Five revisited. Journal of Applied Psychology, 85, 869–879.

  • Hurtz, G. M., & Hertz, N. R. (1999). How many raters should be used for establishing cutoff scores with the Angoff method?: A generalizability theory study. [Featured article] Educational and Psychological Measurement, 59, 885–897.


(past 10 years)

  • Hurtz. G. M., & Mucino, R. (2021, October). Analysis of item response time patterns through the lens of profile similarity and model fit metrics. Presentation at the Conference On Test Security, Virtual/Online.

  • Hurtz, G. M. (2020, October). Test Security During a Global Pandemic: If Remote Proctoring is Our Vaccine Against Cheating, Then Data Forensics is Our Antibody Test. Keynote address at the Personnel Testing Council of Northern & Southern California Virtual Conference.

  • Tucker, N., Hurtz, G. M., & Zu, D. (2019, October). A picture is worth a thousand words: Using data visualization to find and communicate patterns of test fraud. Presentation at the Conference On Test Security, Miami, Florida.

  • Hurtz, G. M., Weiner, J., Tucker, N., & Dreseden, B. (2019, July). Advances in test security - detecting potential cheating and other fraudulent behavior. Presentation at the International Personnel Assessment Council Conference, Minneapolis, Minnesota.

  • Hurtz, G. M. (2019, June). Psychometricians playing detective: Use of data forensics techniques for spotting signs of misconduct in test-takers’ item response data. Presentation at the Personnel Testing Council of Northern California Summer Conference, Sacramento, California.

  • Rios, O., Storz, M., Hoang, H., & Hurtz, G. (2019, April). Testing the frame-of-reference effect in personality testing across multiple life contexts: A replication and extension. Poster presented at the 99th Annual Convention of the Western Psychological Association, Pasadena, California.

  • Bonfert, C., Medina, Y., & Hurtz, G. (2019, April). Measuring school-oriented work ethic: Modification of the Multidimensional Work Ethic Profile for the schoolwork context. Poster presented at the 99th Annual Convention of the Western Psychological Association, Pasadena, California.

  • Underwood, J., & Hurtz, G. (2019, April). Biasing effects of scale anchor wording on job analysis frequency ratings. Poster presented at the 99th Annual Convention of the Western Psychological Association, Pasadena, California.

  • Hurtz, G. M., Weiner, J. A., & Du, Z. (2018, October). Development of a data forensics system for surveillance and ad hoc investigations of international testing programs. Presentation at the Conference On Test Security, Park City, Utah.

  • Rios, O., Hurtz, G. M., Storz, M. O., & Alejos, V. J. (2018, April). Evidence of a quasi-circumplex arrangement of subscales measuring subdimensions of the conscientiousness trait. Poster presented at the 98th Annual Convention of the Western Psychological Association, Portland, Oregon.

  • Rios, O., Hurtz, G. M., Hoang, H. N., Alejos, V. J., & Storz, M. O. (2018, April). Extending the frame-of-reference effect in personality testing beyond the work and school context. Poster presented at the 98th Annual Convention of the Western Psychological Association, Portland, Oregon.

  • Goodman, Z. T., Hurtz, G. M., & Whitehead, M. H. (2018, April). Clarification and recommendations regarding partial variance-explained effect size measures for ANOVA designs. Poster presented at the 98th Annual Convention of the Western Psychological Association, Portland, Oregon.

  • Hurtz, G. M., & Weiner, J. A. (2018, April). Integrating AIG into the Monte Carlo LOFT algorithm to reduce item exposure. Presentation at the 80th Annual Meeting of the National Council on Measurement in Education, New York, New York.

  • Hurtz, G. M., & Weiner, J. A. (2018, April). Enhancing the sensitivity of the J2 model for detecting test cheating. Presentation at the 80th Annual Meeting of the National Council on Measurement in Education, New York, New York.

  • Hurtz, G. M., & Weiner, J. A. (2017, September). Ockham’s razor and the selection of collusion indices: Variants of J2 provide a simple and effective diagnostic tool. Presentation at the Conference On Test Security, Madison, Wisconsin.

  • Hurtz, G. M., & Weiner, J. A. (2017, September). Forensic profiling of test-taker response patterns associated with distinct cheating-related behaviors. Presentation at the Conference On Test Security, Madison, Wisconsin.

  • Hurtz, G. M., & Weiner, J. A. (2016, October). Extension and cross-validation of a profile of statistical indices for detection of aberrant test responses. Presentation at the Conference On Test Security, Cedar Rapids, Iowa.

  • Hurtz, G. M. (2016, August). Test security systems: Using statistical and psychometric models for prevention and detection of test fraud. Keynote address at the International Personnel Assessment Council Conference, Sacramento, CA.

  • Hurtz, G. M., Saiar, A., & Weiner, J. (2016, April). Detecting two patterns of cheating with a profile of statistical indices. In Belov, D. (Chair). Test security and cheating. Symposium presented at the 78th Annual Meeting of the National Council on Measurement in Education, Washington, DC.

  • Weiner, J. & Hurtz, G. M.(2016, April). Automated test assembly methods using Monte-Carlo-based linear-on-the-fly (LOFT) techniques. Electronic poster board presented at the 78th Annual Meeting of the National Council on Measurement in Education, Washington, DC.

  • Abraham, J., Lambert, D., Jockin, V., Hurtz, G., & Weiner, J. (2016, April). SJT scoring methods can impact what we are measuring. In Jockin, V. (Chair). Advances in SJT and personality scoring methods. Symposium presented at the 31st Annual Conference of the Society for Industrial and Organizational Psychology, Anaheim, California.

  • Hurtz, G. M., Saiar, A., & Weiner, J. (2015, November). Design matters in Monte Carlo investigations of aberrant test-taker response patterns. Presentation at the Conference On Test Security, Lawrence, Kansas.

  • Hurtz, G. M., & Brown, R. (2015, April). Establishing meaningful expectations for test performance via invariant latent standards. In French, B. (Chair). Setting cut scores. Symposium presented at the 77th Annual Meeting of the National Council on Measurement in Education, Chicago, Illinois.

  • Sandoval, R. G., Rodriguez, B., Christianson, N., Holm, S., Jones, R, Hurtz, G. M., & Meyers, L. S. (2015, May). A structural model predicting multidimensional work ethic. Poster presented at the 95th Annual Convention of the Western Psychological Association, Las Vegas, Nevada.

  • Vadnais, M. K., & Hurtz, G. M. (2015, May). The effects of stereotype threat and identity salience on mathematics performance in females. Poster presented at the 95th Annual Convention of the Western Psychological Association, Las Vegas, Nevada.

  • Hurtz, G., Brown, R., & Tucker, N. (2015, March). Evolution of psychological measurement models and their applications in practical testing and assessment. Paper presented at the Association of Test Publishers Innovations in Testing Conference, Palm Springs, California.

  • Weiner, J., Saiar, A., & Hurtz, G. (2014, October). Follow-up study of an empirical method for the detection of potential test fraud. Presentation at the Conference on Test Security, Iowa City, Iowa.

  • Hurtz, G. M., Muh, V. P., Pierce, M. S., & Hertz, N. R. (2012, April). The Angoff method through the lens of latent trait theory: Theoretical and practical benefits of setting standards on the latent scale (where they belong). In Barney, M. (Chair). To raise or lower the bar: Innovations in standard setting. Symposium presented at the 27th Annual Conference of the Society for Industrial and Organizational Psychology, San Diego, CA.

  • Carroll, J. R., Hurtz, G. M., & Steinwert, M. L. (2012, April). Monte Carlo evidence favoring new measures of skewness and kurtosis. Poster presented at the 92nd Annual Convention of the Western Psychological Association, San Francisco, California.

  • Echeveste, D. I., Carroll, J. R., & Hurtz, G. M. (2012, April). Expanding cognitive tests: A validation study of nonverbal reasoning items. Poster presented at the 92nd Annual Convention of the Western Psychological Association, San Francisco, California.

  • Hurtz, G. M., & Carroll, J. R. (2012, April). Making Rasch and IRT more accessible to researchers: Estimation using SPSS. Poster presented at the 92nd Annual Convention of the Western Psychological Association, San Francisco, California.

  • Echeveste, D. I., & Hurtz, G. M. (2012, April). A DIF analysis of three- vs. four-option multiple-choice items. Poster presented at the 92nd Annual Convention of the Western Psychological Association, San Francisco, California.