Biblio
NAEP Pilot Learning Progression Framework. Report to the National Assessment Governing Board.
. (2007). 
Using progress variables to map intellectual development. Berkeley, CA: Berkeley Evaluation & Assessment Research Center.
. (2007). 
Cognitive diagnosis using item response models. Zeitschrift für Psychologie/Journal of Psychology, 216, 74–88.
. (2008). 
A comparative analysis of the ratings in performance assessment using generalizability theory and the many-facet Rasch model. Journal of applied measurement, 10, 408–423.
. (2008). 
ConstructMap Version 4 (computer program). University of, Berkeley, CA: BEAR Center.
. (2008). 
Explanatory item response models: A brief introduction. Assessment of competencies in educational contexts, 91–120.
. (2008). 
A LLTM approach to the examination of teachers' ratings of classroom assessment tasks. Psychology Science, 50, 417.
. (2008). Making Sense of Student Responses to Assessment Items Using Scoring Exemplars. Berkeley Evaluation and Assessment Seminar. Berkeley, CA.
. (2008). Mixture models in a developmental context. Advances in Latent Variable Mixture Models, 199.
. (2008). 
A multidimensional Rasch analysis of gender differences in PISA mathematics. Journal of applied measurement, 9, 18.
. (2008). 
Random parameter structure and the testlet model: extension of the Rasch testlet model. Journal of applied measurement, 10, 394–407.
. (2008). 
A Study of Confidence and Accuracy Using the Rasch Modeling Procedures. ETS Research Report Series, 2008, i–25.
. (2008). 
A Study of Confidence and Accuracy Using the Rasch Modeling Procedures. ETS Research Report Series, 2008, i–25.
. (2008). 
Concrete, abstract, formal, and systematic operations as observed in a" Piagetian" balance-beam task series. Journal of applied measurement, 11, 11–23.
. (2009). 
. (2009).
Gender differences and similarities in PISA 2003 mathematics: A comparison between the United States and Hong Kong. International Journal of Testing, 9, 20–40.
. (2009). 
Gender differences and similarities in PISA 2003 mathematics: a comparison between the United States and Hong Kong. International Journal of Testing, 9, 20–40.
. (2009). Gender differences in large-scale math assessments: PISA trend 2000 and 2003. Applied Measurement in Education, 22, 164–184.
. (2009). 
Mapping student understanding in chemistry: The perspectives of chemists. Science Education, 93, 56–85.
. (2009). 
Measuring measuring: Toward a theory of proficiency with the Constructing Measures framework. Journal of applied measurement, 296.
. (2009). 
Measuring progressions: Assessment structures underlying a learning progression. Journal of Research in Science Teaching, 46, 716–730.
. (2009). 
Sources of self-efficacy belief: development and validation of two scales. Journal of applied measurement, 11, 24–37.
. (2009). 
Articulating Assessments Across Childhood: The Cross-Age Validity of the Desired Results Developmental Profile–Revised. Educational Assessment, 15, 1–26.
. (2010). 