Biblio
Gender differences in large-scale math assessments: PISA trend 2000 and 2003. Applied Measurement in Education, 22, 164–184.
. (2009). 
Mapping multiple dimensions of student learning: the ConstructMap program. Journal of applied measurement, 10, 1.
. (2009). Mapping student understanding in chemistry: The perspectives of chemists. Science Education, 93, 56–85.
. (2009). 
The mastery learner judgment consistency rate of Rasch model-based standard setting method: Focused on the comparison with raw-score and Angoff methods. In Criterion-referenced testing: Practice analysis to score reporting using Rasch measurement. Chicago: JAM Press.
. (2009). Measuring measuring: Toward a theory of proficiency with the Constructing Measures framework. Journal of applied measurement, 296.
. (2009). 
Measuring progressions: Assessment structures underlying a learning progression. Journal of Research in Science Teaching, 46, 716–730.
. (2009). 
Sources of self-efficacy belief: development and validation of two scales. Journal of applied measurement, 11, 24–37.
. (2009). 
Validating for use and interpretation a mixed methods contribution illustrated. Journal of Mixed Methods Research, 3, 242–264.
. (2009). 
Cognitive diagnosis using item response models. Zeitschrift für Psychologie/Journal of Psychology, 216, 74–88.
. (2008). 
A comparative analysis of the ratings in performance assessment using generalizability theory and the many-facet Rasch model. Journal of applied measurement, 10, 408–423.
. (2008). 
ConstructMap Version 4 (computer program). University of, Berkeley, CA: BEAR Center.
. (2008). 
Contributions of Middle Grade Students to the Validation Process of a National Science Assessment Study. Middle Grades Research Journal, 3, 1–22.
. (2008). 
Contributions of Middle Grade Students to the Validation Process of a National Science Assessment Study. Middle Grades Research Journal, 3, 1–22.
. (2008). 
Explanatory item response models: A brief introduction. Assessment of competencies in educational contexts, 91–120.
. (2008). 
A LLTM approach to the examination of teachers' ratings of classroom assessment tasks. Psychology Science, 50, 417.
. (2008). Making Sense of Student Responses to Assessment Items Using Scoring Exemplars. Berkeley Evaluation and Assessment Seminar. Berkeley, CA.
. (2008). Mixture models in a developmental context. Advances in Latent Variable Mixture Models, 199.
. (2008). 
A multidimensional Rasch analysis of gender differences in PISA mathematics. Journal of applied measurement, 9, 18.
. (2008). 
Random parameter structure and the testlet model: extension of the Rasch testlet model. Journal of applied measurement, 10, 394–407.
. (2008). 
A Study of Confidence and Accuracy Using the Rasch Modeling Procedures. ETS Research Report Series, 2008, i–25.
. (2008). 
A Study of Confidence and Accuracy Using the Rasch Modeling Procedures. ETS Research Report Series, 2008, i–25.
. (2008). 
Adaptive Technology for e-Learning: Principles and Case Studies of an Emerging Field. Journal of the American Society for Information Science and Technology, 58(14). doi:10.1002
. (2007). Adaptive technology for e-learning: principles and case studies of an emerging field. Journal of the American Society for Information Science and Technology, 58, 2295–2309.
. (2007). 
Application of the Saltus model to stagelike data: Some applications and current developments. In Multivariate and mixture distribution Rasch models (pp. 119–130). Springer.
. (2007). 