Biblio
Explanatory item response models: A brief introduction. Assessment of competencies in educational contexts, 91–120.
. (2008). 
. (2004).
Explanatory secondary dimension modeling of latent differential item functioning. Applied Psychological Measurement, 35, 583–603.
. (2011). 
"Exploring Contexts of Assessment." In R. Lehrer (Chair), Constructing A Multidimensional Learning Progression of Data Modeling: Design Studies, Psychometric Modeling and Brokering Professional Development. National Countcil of Teachers of Mathematics. Paper, San Diego, CA.
. (2010). Exploring the Contexts of Assessment: Comparing Evidence of Learning from Within the Classroom. Jean Piaget Society. Berkeley, CA.
. (2011). . (2009).
Formative Evaluation of an Online Teaching Strategy: Using Mixed Methods to Learn From the Student Experience. Presented at the American Evaluation Association 2002 Conference, Washington, D.C.
. (2002, November). 
Formulating latent growth using an explanatory item response model approach. Journal of applied measurement, 13, 1.
. (2012). Formulating latent growth using an explanatory item response model approach. Journal of applied measurement, 13, 1–22.
. (2011). 
Formulating the Rasch Differential Item Functioning Model Under the Marginal Maximum Likelihood Estimation Context and Its Comparison With Mantel–Haenszel Procedure in Short Test and Small Sample Conditions. Educational and Psychological Measurement, 71, 1023–1046.
. (2011). 
From principles to practice: An embedded assessment system. Applied Measurement in Education, 13, 181–208.
. (2000). Gender differences and similarities in PISA 2003 mathematics: a comparison between the United States and Hong Kong. International Journal of Testing, 9, 20–40.
. (2009). Gender differences and similarities in PISA 2003 mathematics: A comparison between the United States and Hong Kong. International Journal of Testing, 9, 20–40.
. (2009). 
Gender differences in large-scale math assessments: PISA trend 2000 and 2003. Applied Measurement in Education, 22, 164–184.
. (2009). 
Generalizability in item response modeling. Journal of Educational Measurement, 44, 131–155. Retrieved from http://onlinelibrary.wiley.com/doi/10.1111/j.1745-3984.2007.00031.x/full
. (2007). A gentle introduction to Rasch measurement models for metrologists. Journal of Physics: Conference Series, 459, 012002. Retrieved from http://stacks.iop.org/1742-6596/459/i=1/a=012002
. (2013). How do curriculum developers measure success?. Presented at the ScienceGate II Conference, Xerox Document University, VA.
. (1998, Jannuary). The Imperial vs Metric Study. University of California, Berkeley.
. (2004). Improving assessment evidence in e-learning products: some solutions for reliability. International Journal of Learning Technology, 5, 191–208.
. (2010). 
Improving measurement in health education and health behavior research using item response modeling: introducing item response modeling. Health education research, 21, i4–i18.
. (2006). 
Improving measurement in health education and health behavior research using item response modeling: comparison with the classical test theory approach. Health education research, 21, i19–i32.
. (2006). Improving measurement in health education and health behavior research using item response modeling: comparison with the classical test theory approach. Health Education Research, 21, i19–i32.
. (2006). 