Sage Journals: Discover world-class research

Abstract

A brief review of various information criteria is presented for the detection of differential item functioning (DIF) under item response theory (IRT). An illustration of using information criteria for model selection as well as results with simulated data are presented and contrasted with the IRT likelihood ratio (LR) DIF detection method. Use of information criteria for general IRT model selection is discussed.

Keywords

Akaike information criterion Bayesian information criterion differential item functioning information criteria item response theory likelihood ratio test marginal maximum likelihood estimation

Get full access to this article

View all access options for this article.

References

Akaike

(1973). Information theory and an extension of the maximum likelihood principle. In Petrov

B. N.

Csáke

(Eds.), 2nd International Symposium on Information Theory (pp. 267-281). Budapest, Hungary: Akadémiai Kiadó.

Akaike

(1976). On entropy maximization principle. In Krishnaiah

P. R.

(Ed.), Applications of statistics: Proceedings of the symposium held at Wright State University, Dayton, Ohio. (pp. 27-53) Amsterdam, The Netherlands: North-Holland.

Baker

F. B.

Kim

S.-H.

(2004). Item response theory: Parameter estimation techniques (2nd ed.). New York, NY: Dekker.

Bishop

Y. M. M.

Fienberg

S. E.

Holland

P. W.

(1975). Discrete multivariate analysis: Theory and practice. Cambridge, MA: The MIT Press.

Bock

R. D.

Aitkin

(1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443-459; 47, 369 (Errata).

Bock

R. D.

Lieberman

(1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35, 179-197.

Bock

R. D.

Moustaki

(2007). Item response theory in a general framework. In Rao

C. R.

Sinharay

(Eds.), Handbook of statistics (Vol. 26, pp. 469-513). Amsterdam, The Netherlands: Elsevier.

Bozdogan

(1987). Model selection and Akaike’s Information Criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52, 345-370.

Burnham

K. P.

Anderson

D. R.

(1998). Model selection and inference: A practical information-theoretic approach. New York, NY: Springer-Verlag.

10.

Cai

(2012). flexMIRT: A numerical engine for multilevel item factor analysis and test scoring (Version 1.88) [Computer software]. Seattle, WA: Vector Psychometric Group.

11.

Cai

Thissen

du Toit

(2011). IRTPRO: Item response theory for patient-reported outcomes [Computer software]. Lincolnwood, IL: Scientific Software International.

12.

Cohen

A. S.

Cho

S.-J.

(2016). Information criteria. In van der Linden

W. J.

(Ed.), Handbook of item response theory (Vol. 2, pp. 363-378). Boca Raton, FL: CRC Press.

13.

Cohen

A. S.

Kim

S.-H.

Wollack

J. A.

(1996). An investigation of the likelihood ratio test for detection of differential item functioning. Applied Psychological Measurement, 20, 15-26.

14.

de Ayala

R. J

. (2009). The theory and practice of item response theory. New York, NY: The Guilford Press.

15.

deLeeuw

(1992). Introduction to Akaike (1973) information theory and an extension of the maximum likelihood principle. In Kotz

Johnson

N. L.

(Eds.), Breakthroughs in statistics, Volume I: Foundations and basic theory (pp. 599-609). New York, NY: Springer-Verlag.

16.

Fujikoshi

Satoh

(1997). Modified AIC and C_p in multivariate linear regression. Biometrika, 84, 707-716.

17.

Holland

P. W.

Wainer

(Eds.). (1993). Differential item functioning. Hillsdale, NJ: Lawrence Erlbaum.

18.

Hurvich

C. M.

Tsai

C.-L.

(1989). Regression and time series model selection in small samples. Biometrika, 76, 297-307.

19.

Judge

G. G.

Griffiths

W. E.

Hill

R. C.

Lütkepohl

Lee

T.-C.

(1985). The theory and practice of econometrics (2nd ed.). New York, NY: John Wiley.

20.

Kang

T.-H.

Cohen

A. S.

(2007). IRT model selection methods for dichotomous items. Applied Psychological Measurement, 31, 331-358.

21.

Kang

T.-H.

Cohen

A. S.

Sung

H.-J.

(2009). IRT model selection methods for polytomous items. Applied Psychological Measurement, 33, 499-518.

22.

Kim

S.-H.

(2007). Some posterior standard deviations in item response theory. Educational and Psychological Measurement, 67, 258-279.

23.

Kim

S.-H.

Cohen

A. S.

(1998). Detection of differential item functioning under the graded response model with the likelihood ratio test. Applied Psychological Measurement, 22, 345-355.

24.

Klein Entink

R. H.

Fox

J.-P.

van der Linden

W. J

. (2009). A multivariate multilevel approach to the modeling of accuracy and speed of test takers. Psychometrika, 74, 21-48.

25.

Kullback

(1959). Information theory and statistics. New York, NY: John Wiley.

26.

Cohen

A. S.

Kim

S.-H.

Cho

S.-J.

(2009). Model selection methods for dichotomous mixture IRT models. Applied Psychological Measurement, 33, 353-373.

27.

Lord

F. M.

(1968). An analysis of the Verbal Scholastic Aptitude Test using Birnbaum’s three-parameter logistic model. Educational and Psychological Measurement, 28, 989-1020.

28.

Lord

F. M.

(1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum.

29.

Magis

Tuerlinckx

De Boeck

(2015). Detection of differential item functioning using the lasso approach. Journal of Educational and Behavioral Statistics, 40, 111-135.

30.

May

(2006). A multilevel Bayesian item response theory method for scaling socioeconomic status in international studies of education. Journal of Educational and Behavioral Statistics, 31, 63-79.

31.

Meade

A. W.

Wright

N. A.

(2012). Solving the measurement invariance anchor item problem in item response theory. Journal of Applied Psychology, 97, 1016-1031.

32.

Parzen

Tanabe

Kitagawa

(Eds.). (1998). Selected papers of Hirotugu Akaike. New York, NY: Springer-Verlag.

33.

Rao

C. R.

(1973). Linear statistical inference and its applications (2nd ed.). New York, NY: John Wiley.

34.

Rao

C. R.

(2001). On model selection (with discussions and rejoinder). In Lahiri

(Ed.), Model selection (pp. 1-64). Beachwood, OH: Institute of Mathematical Statistics.

35.

Rissanen

(1978). Modeling by shortest data description. Automatica, 14, 465-471.

36.

Sakamoto

Ishiguro

Kitagawa

(1986). Akaike information criterion statistics. Tokyo, Japan: KTK Scientific Publishers.

37.

Schwartz

(1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461-464.

38.

Sclove

S. L.

(1987). Application of model-selection criteria to some problems in multivariate analysis. Psychometrika, 52, 333-343.

39.

Shibata

(1989). Statistical aspects of model selection (IIASA Working Paper, No. WP-89-077). Laxenburg, Austria: International Institute for Applied Systems Analysis. Retrieved from http://pure.iiasa.ac.at/3267/1/WP-89-077.pdf

40.

Spiegelhalter

D. J.

Best

N. G.

Carlin

B. P.

van der Linde

(2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society, Series B, 64, 353-616.

41.

Sugiura

(1978). Further analysis of the data by Akaike’s information criterion and the finite corrections. Communications in Statistics: Part A–theory and Methods, A, 7, 13-26.

42.

Thissen

(1982). Marginal maximum likelihood estimation for the one-parameter logistic model. Psychometrika, 47, 175-186.

43.

Thissen

(1991). MULTILOG user’s guide: Multiple, categorical item analysis and test scoring using item response theory (Version 6.0). Chicago, IL: Scientific Software.

44.

Thissen

(2001). IRTLRDIF v.2.0b: Software for the computation of the statistics involved in item response theory likelihood-ratio tests for differential item functioning. Chapel Hill: L. L. Thurstone Psychometric Laboratory, University of North Carolina.

45.

Thissen

Chen

W.-H.

Bock

R. D.

(2002). MULTILOG: Multiple, categorical item analysis and test scoring using item response theory [Computer software]. Lincolnwood, IL: Scientific Software International.

46.

Thissen

Steinberg

Gerrard

(1986). Beyond group differences: The concept of item bias. Psychological Bulletin, 99, 118-128.

47.

Thissen

Steinberg

Wainer

(1988). Use of item response theory in the study of group differences in trace lines. In Wainer

Braun

H. I.

(Eds.), Test validity (pp. 147-169). Hillsdale, NJ: Lawrence Erlbaum.

48.

Thissen

Wainer

(1982). Some standard errors in item response theory. Psychometrika, 47, 397-412.

49.

Tutz

Schauberer

(2015). A penalty approach to differential item functioning in Rasch models. Psychometrika, 80, 21-43.

50.

Venables

W. N.

Smith

D. M.

, & The R Development Core Team. (2009). An introduction to R (2nd ed.). La Vergne, TN: Network Theory.

Use of Information Criteria in the Study of Group Differences in Trace Lines

Abstract

Keywords

Get full access to this article

References