Sage Journals: Discover world-class research

Abstract

This study investigates item parameter recovery, standard error estimates, and fit statistics yielded by the WINSTEPS program under the Rasch model and the rating scale model through Monte Carlo simulations. The independent variables were item response model, test length, and sample size. WINSTEPS yielded practically unbiased estimates for the difficulty parameters under the Rasch model and the overall difficulty parameters under the rating scale model. However, the estimates for the intersection parameters under the rating scale model were substantially biased, especially for short tests. The standard errors of the overall difficulties and intersection parameters were slightly underestimated. The cube root-transformed weighted and unweighted item fit statistics did not follow the standard normal distribution in that their empirical sampling variances were much smaller than the expected value of unity. Correction procedures were proposed to make them follow approximately the standard normal distribution so that the usual critical ranges at the α nominal level could be used to screen the misfitting items.

Keywords

Rasch model rating scale model joint maximum likelihood estimation infit outfit

Get full access to this article

View all access options for this article.

References

Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561-573.

Bond, T. G. , & Fox, C. M. (2001). Applying the Rasch model: Fundamental measurement in the human sciences. Mahwah, NJ: Lawrence Erlbaum.

Embretson, S. E. , & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Lawrence Erlbaum.

Holland, P. W. (1990). On the sampling theory foundations of item response theory models. Psychometrika, 55, 577-601.

Linacre, J. M. (2001). WINSTEPS Rasch measurement computer program(Version 3.31) [Computer software]. Chicago: Winsteps.com.

Linacre, J. M. (2003). Facets Rasch measurement computer program [Computer software]. Chicago: Winsteps.com.

Linacre, J. M. , & Wright, B. D. (1998). A user’s guide to BIGSTEPS/WINSTEPS. Chicago: Measurement, Evaluation, Statistics, and Assessment Press.

Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174.

Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen, Denmark: Institute of Educational Research.

10.

Smith, R. M. (1988). The distributional properties of Rasch standardized residuals. Educational and Psychological Measurement, 48, 657-667.

11.

Smith, R. M. (1991). The distributional properties of Rasch item fit statistics. Educational and Psychological Measurement, 51, 541-565.

12.

Smith, R. M. (1994). Detecting item bias in the Rasch rating scale model. Educational and Psychological Measurement, 54, 886-896.

13.

Smith, R. M. (1996). A comparison of the Rasch separate calibration and between-fit methods of detecting item bias. Educational and Psychological Measurement, 56, 403-418.

14.

Smith, R. M. , Schumacker, R. E. , & Bush, M. J. (1998). Using item mean squares to evaluate fit to the Rasch model. Journal of Outcome Measurement, 2, 66-78.

15.

Wright, B. D. , & Douglas, G. A. (1977). Best procedures for sample-free item analysis. Applied Psychological Measurement, 1, 281-295.

16.

Wright, B. D. , Linacre, J. M. , & Gustafson J.-E. , & Martin-Löf, P. (1994). Reasonable mean-square fit values. Rasch Measurement Transactions, 8(3), 370. Retrieved December 1, 2003, from http://www.rasch.org/rmt/rmt83b.htm

17.

Wright, B. D. , & Masters, G. N. (1982). Rating scale analysis. Chicago: Measurement, Evaluation, Statistics, and Assessment Press.

18.

Wright, B. D. , & Mead, R. J. (1978). BICAL: Calibrating items and scales with the Rasch model (Research Memorandum No. 23A). Chicago: University of Chicago, Department of Education, Statistical Laboratory.

19.

Wright, B. D. , & Panchapakesan, N. (1969). A procedure for sample-free item analysis. Educational and Psychological Measurement, 29, 23-48.

20.

Wright, B. D. , & Stone, M. H. (1979). Best test design. Chicago: Measurement, Evaluation, Statistics, and Assessment Press.

Item Parameter Recovery,Standard Error Estimates,and Fit Statistics of the Winsteps Program for the Family of Rasch Models

Abstract

Keywords

Get full access to this article

References