浪花直播 Assessment Research Center (MARC)

Journal Articles

  • Jiao, H., Yadav*, C., & Li*, G. (2023, May 2). Integrating psychometric analysis and machine learning to augment data for cheating detection in large-scale assessment. Retrieved from
  • Fu, Y., Zhan, P., Chen, Q., & Jiao, H. (2023). Joint Modeling of Action Sequences and Action Times in Problem-solving Tasks. Behavioral Research Methods. 
  • Zhu, H., Jiao, H., Gao, W., & Meng, X (2023). Bayesian change point analysis approach to detecting aberrant test-taking behavior using response times. Journal of Educational and Behavioral Statistics. 
  • Jiao, H., He, Q., & Yao, L. (2023). Psychological Testing and Assessment Modeling. 
  • Yao, L. & Jiao, H. (2023) , Chinese/English Journal of Educational Measurement and Evaluation | 教育测量与评估双语期刊: Vol. 4: Iss. 3, Article 1.
  • Zhou*, T. & Jiao, H. (2022). Psychological Testing and Assessment Modeling. 64(4), 425-444.
  • Qiao, X., Jiao, H., & He, Q. (2022). A multiple group joint modeling of item responses, response times, and action counts with the Conway-Maxwell-Poisson distribution. Journal of Educational Measurement. 
  • Zhou*, T. & Jiao, H. (2022). Exploration of the stacking ensemble machine learning algorithm for cheating detection in large-scale assessment. Educational and Psychological Measuremet. 
  • Fu, Y., Zhan, P., & Jiao, H. (2022). Joint modeling of action sequences and action times in Problem-solving Tasks. Retrieved from
  • Liao*, M. & Jiao, H. (2022). Modeling multiple problem-solving strategies and strategy shift in cognitive diagnosis for growth. British Journal of Mathematical and Statistical Psychology.
  • Jiao, H. (2022). Psychological Testing and Assessment Modeling. 64(1), 65-86.
  • Qiao*, X. & Jiao, H. (2022). Explanatory cognitive diagnostic modeling incorporating response times. Journal of Educational Measurement.
  • Jiao, H, & Liao, M. (2021). Testlet response theory. Educational Measurement: Issues and Practices. 
  • Jiao, H., He, Q., & Veldkamp, B. P. (2021). Editorial: Process data in educational and psychological assessment. Frontiers in Psychology. 
  • Zhan, P., Li, F., & Jiao, H. (2021). Editorial: Cognitive Diagnostic Assessment for Learning. Frontiers in Psychology. 
  • Wu, F., Yang, X., & Jiao, H. (2021). [Principled Approaches to Test Development and Design]. Exploring Education Development, 41(10), 30-40.
  • Zhan, P., Jiao, H, Wang, W. & Man, K. (2021). Variable speed across dimensions of ability in the joint model of responses and response times. Frontiers in Psychology. 
  • Liao, M., Patton, J., Yan, R., & Jiao, H. (2021). Mining process data to detect aberrant test takers. Measurement: Interdisciplinary Research and Perspectives. 19(2), 93-105. 
  • Zhan, P., Jiao, H., & Man, K. (2020). The multidimensional log-normal response time model: An exploration of the multidimensionality of latent processing speed. Acta Psychologica Sinica.
  • Jiao, H. & Lissitz, R. W. (2020). What hath the coronavirus brought to assessment? Unprecedented challenges in educational assessment in 2020 and years to come. Educational Measurement: Issues and Practice
  • Liao*, D., He, Q., & Jiao, H. (2019). Mapping background variables with sequential patterns in problem solving environments: An investigation on US adults’ employment status in PIAAC. Frontiers in Psychology.  
  • Man*, K., Harring, J., Jiao, H., & Zhan*, P. (2019). Conditional joint modeling of compensatory multidimensional item responses and response times. Applied Psychological Measurement. Advanced Online Publication.
  • Zhan*, P., Jiao, H., Liao*, D. & Li, F (2019). A longitudinal higher-order diagnostic classification model. Journal of Educational and Behavioral Statistics. Advanced Online Publication.  
  • Zhan*, P., Ma, W., Jiao, H., & Ding, S. (2019). A sequential higher-order latent structural model for hierarchical attributes in cognitive diagnostic assessments. Applied Psychological Measurement. Advanced Online Publication.
  • Zhan*, P., Jiao, H., Man, K, & Wang, L. (2019). Using JAGS for Bayesian cognitive diagnosis modeling: A tutorial. Journal of Educational and Behavioral Statistics. Advanced Online Publication. 
  • Zhan*, P., Wang, W.-C., Jiao, H., & Bian, Y. (2018). The probabilistic-inputs, noisy conjunctive models for cognitive diagnosis. Frontiers in Psychology
  • Zhan*, P., Jiao, H., Liao*, M., & Bian, Y. (2018). Bayesian DINA modeling incorporating within-item characteristics dependency. Applied Psychological Measurement. 43, 143–158. 
  • Qiao*, X., & Jiao, H. (2018). Comparing data mining techniques in analyzing process data: A case study on PISA 2012 problem-solving items. Frontiers in Psychology. 
  • Zhan*, P. Jiao, H., & Liao*, D. (2017). Cognitive diagnosis modeling incorporating item response times. British Journal of Mathematical and Statistical Psychology
  • Li, Y., Panagiotou, O. A., Black, A., Liao, D., & Wacholder, S. (2016). Multivariate piecewise exponential survival modeling. Biometrics.
  • Luo, Y., & Jiao, H. (2017). Using the Stan program for Bayesian item response theory. Educational and Psychological Measurement.
  • Li*, T., Xie*, C., & Jiao, H. (2016). Assessing fit of alternative unidimensional polytomous item response models using posterior predictive model checking. Psychological Methods. 
  • Li*, T., Jiao, H., & Macready, G. (2015). Different approaches to covariate inclusion in the mixture Rasch model. Educational and Psychological Measurement
  • Jiao, H., & Zhang*, Y. (2015). Polytomous multilevel testlet models for testlet-based assessments with complex sampling designs. British Journal of Mathematical and Statistical Psychology, 1, 65-83. Online first,
  • Wolfe, E., Song, T. W., & Jiao, H. (2015). Features of difficult-to-score essays. Assessing Writing.27, 1-10. 
  • Chen*, Y.-F. & Jiao, H. (2014). Exploring the utility of background and cognitive variables in explaining latent differential item functioning: An example of the PISA 2009 reading assessment. Educational Assessment.19, 77-96. .

  • Jiao, H., Wang, S., & He, W. (2013). Estimation methods for one-parameter testlet models. Journal of Educational Measurement, 50, 186-203. 
  • Wang, S., Jiao, H., & Zhang, L. (2013). Validation of longitudinal achievement constructs of vertically scaled computerized adaptive tests: A multiple-indicator, latent-growth modeling approach. International Journal of Quantitative Research in Education, 1, 383-407. 
  • Tao, J., Xu, B., Shi, N., & Jiao, H. (2013). Refining the two-parameter testlet response model by introducing testlet discrimination parameters. Japanese Psychological Research. 55, 284-291 

  • Wang, S., McCall, M., Jiao, H., & Harris, G. (2013). Construct validity and measurement invariance of computerized adaptive testing: application to Measures of Academic Progress (MAP) using confirmatory factor analysis. Journal of Educational and Developmental Psychology. 3, 88-100. 
  • Li*, Y., Jiao, H., & Lissitz, R.W. (2012). Journal of Applied Testing Technology, vol. 13, n2
  • Jiao, H., Macready, G., Liu*, J., & Cho*, Y. (2012). A mixture Rasch model based computerized adaptive test for latent class identification. Applied Psychological Measurement, 36, 469-493.
  • Jiao, H., Kamata, A., Wang, S., & Jin, Y. (2012). A multilevel testlet model for dual local dependence. Journal of Educational Measurement, 49, 82-100. 
  • Jiao, H., Liu*, J., Haynie, K., Woo, A., & Gorham, J. (2012). Comparison between dichotomous and polytomous scoring of innovative items in a large-scale computerized adaptive test. Educational and Psychological Measurement, 72, 493 - 509. 
  • Jiao, H., Lissitz, B., Macready, G., Wang, S., & Liang*, S. (2011). . Psychological Testing and Assessment Modeling, 53, 499-522
  • Jiao, H., & Wang, S. (2010). A multifaceted approach to investigating the equivalence between computer-based and paper-and-pencil assessments: An example of Reading Diagnostics. International Journal of Learning Technology, 5, 264-288.
  • Wang, S., & Jiao, H. (2009). Construct equivalence across grades in a vertical scale for a K-12 large-scale reading assessment. Educational and Psychological Measurement, 69, 760-777. 
  • Wang, S., Jiao, H., Young, M. J., Brooks, T., & Olson, J. (2007). Comparability of computer- based and paper-and-pencil testing in K-12 reading assessments: A meta-analysis of testing mode effects. Educational and Psychological Measurement, 68(1), 5-24.
  • Wang, S., Jiao, H., Young, M. J., Brooks, T., & Olson, J. (2007). A meta-analysis of testing mode effects in Grade K-12 Mathematics Tests. Educational and Psychological Measurement, 67(2), 219-238. 
  • Lissitz, R. W., & Samuelsen, K. (2007). A suggested change in terminology and emphasis regarding validity and education. Educational Researcher, 36, 437-448.
  • Lissitz, R. W., & Samuelsen, K. (2007). Further clarification regarding validity and education. Educational Researcher, 36, 482-484. 
  • Jiao, H., Wang, S., & Kamata, A. (2005). . Journal of Applied Measurement, 6(3), 311-321.
  • Wang, S., & Jiao, H. (2005). Development and application of the Stanford achievement test, diagnostic test, and English proficiency test. Examinations Research, 1(1), 118-128. (in Chinese with English abstract)