SPSS Syntax for Missing Value Imputation in Test and Questionnaire Data Joost R. van Ginkel and L. Andries van der Ark, Department of Methodology and Statistics, Tilburg University

Description A well-known problem in the analysis of test and questionnaire data is that some item scores may be missing. Advanced methods for the imputation of missing data are available, such as multiple imputation under the multivariate normal model and imputation under the saturated logistic model (Schafer, 1997). Accompanying software was made available by, for example, Schafer (1998a, 1998b) and in SOLAS (2001) and S-Plus 6 for Windows (2001). However, these methods and software may be too complicated for a typical psychological researcher, and for the imputation of his or her missing data, he or she depends on the help of a trained statistician. If available, this statistician may not always have enough time or may not be an experienced software user, so the researcher may decide to simply delete all incomplete observations. To help researchers impute scores using simple methods, two SPSS subroutines were written. The aim of these subroutines is that researchers can apply them easily within SPSS and without experienced help. The subroutine “tw” performs two-way imputation, and the subroutine “rf” performs responsefunction imputation. Two-way imputation and response-function imputation are described by Sijtsma and Van der Ark (2003). Simulation studies by Van der Ark and Sijtsma (in press) indicate that these imputation methods work rather well when applied to an approximately unidimensional set of items (i.e., the items measure the same construct). The subroutines allow the researcher to transform an SPSS data file with missing values (an incomplete data file) into an SPSS data file without missing values (a completed data file). The researcher can use the completed data file for further analysis. To run the subroutines, one must select the variables containing the missing scores that need to be imputed, and some optional arguments also can be specified. For two-way imputation, the most important optional argument pertains to changing or removing the random error that is added to the imputed values by default. For response-function imputation, the most important optional argument pertains to changing the minimum group size used for estimating the response function (see Sijtsma & Van der Ark, 2003). Availability The SPSS syntax files, “tw.sps” and “rf.sps,” contain the subroutines that were written in SPSS 10 for Windows using the MATRIX-command (SPSS, 2000). They are available free of charge from http://www.uvt.nl/faculteiten/fsw/organisatie/departementen/mto/software2.html. A brief manual is also available from this Web site. The syntax files can be applied using SPSS 6 and later versions for Windows.

152

Applied Psychological Measurement, Vol. 29 No. 2, March 2005, 152–153 DOI: 10.1177/0146621603260688 © 2005 Sage Publications

J. R. VAN GINKEL and L. A. VAN DER ARK COMPUTER PROGRAM EXCHANGE

153

References Schafer, J. L. (1997). Analysis of incomplete multivariate data. London: Chapman & Hall. Schafer, J. L. (1998a). CAT: Software for S-PLUS Version 4.0 for Windows. Retrieved May 28, 2003, from www.stat.psu.edu/∼jls/sp40.html Schafer, J. L. (1998b). NORM: Software for S-PLUS Version 4.0 for Windows. Retrieved May 28, 2003, from www.stat.psu.edu/∼jls/sp40.html Sijtsma, K., & Van der Ark, L. A. (2003). Investigation and treatment of missing item scores in test and questionnaire data. Multivariate Behavioral Research, 38, 505-528. SOLAS. (2001). SOLAS for Missing Data Analysis 3.0 [Computer software]. Cork, Ireland: Statistical Solutions. S-Plus 6 for Windows [Computer software]. (2001). Seattle, WA: Insightful Corporation.

SPSS. (2000). SPSS Base 10.0 user’s guide [Software manual]. Chicago: Author. Van der Ark, L. A., & Sijtsma, K. (in press). The effect of missing data imputation on Mokken scale analysis. In L. A. van der Ark, M. A. Croon, & K. Sijtsma (Eds.), New developments in categorical data analysis for the social and behavioral sciences. Mahwah, NJ: Lawrence Erlbaum.

Author’s Address Joost R. van Ginkel, Department of Methodology and Statistics, Tilburg University, P.O. Box 90153, 5000 LE Tilburg, The Netherlands; phone: +31 13 466 8046; fax: +31 13 466 3002; e-mail: [email protected].

SPSS Syntax for Missing Value Imputation in Test and ...

scores using simple methods, two SPSS subroutines were written. The aim of ... http://www.uvt.nl/faculteiten/fsw/organisatie/departementen/mto/software2.html.

39KB Sizes 1 Downloads 152 Views

Recommend Documents

Investigation and Treatment of Missing Item Scores in Test and ...
May 1, 2010 - This article first discusses a statistical test for investigating whether or not the pattern of missing scores in a respondent-by-item data matrix is random. Since this is an asymptotic test, we investigate whether it is useful in small

Multiple Imputation of Item Scores in Test and Questionnaire Data, and ...
The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate nor- mal imputation were used as lower and upper benchmark, respectively. Test data were simulated

The Role of Missing Data Imputation Methods on the Accuracy of ...
The Role of Missing Data Imputation Methods on the Accuracy of Data Mining Results .pdf. The Role of Missing Data Imputation Methods on the Accuracy of ...

test for independence of the variables with missing elements in one ...
not identified. Similar investigations with one missing element in the correlation ... correlation matrix P. We derive also the maximum likelihood ratio test for the.

Syntax and Semantics of Axial Expressions in Russian
Dec 7, 2011 - v-nutr'. IN-INSIDE.ACC iz-nutr-i. FROM-INSIDE-GEN. The Goal and Source expressions in Table 2 both combine with overt DP complements, and yield the semantics predicted by the structure of PathPs in (5):. (21) On zabrals'a v-nutr' tank-a

Chapter 8 The Effect of Missing Data Imputation on ...
criterion in Step 1. For confirmatory test construction, the MHM is fitted to the data cor- responding to the a priori defined test consisting of J items using methods.

Better Learning and Decoding for Syntax Based SMT ...
Data made available by the courtesy of Microsoft .... Part-of-Speech mapping template: whether the ..... clude that PSDIG and Pharaoh each excel on dif-.

The Role of Missing Data Imputation Methods on the Accuracy of Data ...
The Role of Missing Data Imputation Methods on the Accuracy of Data Mining Results .pdf. The Role of Missing Data Imputation Methods on the Accuracy of ...

CPP imputation codebook.pdf
... data in the non- imputed dataset. Imputation rule. Child Demographics Impute for all live-born children. Child's race (NCPP_RACE) Use data from all sources, ...

Two-way imputation: A Bayesian method for estimating ...
Dec 17, 2006 - Involved methods often use data augmentation (Tanner and Wong, 1987) for estimation of the imputation model. Examples are multiple.

SYNTAX AND INFORMATION STRUCTURE: FREE ...
I further argue that flexible relative prominence in Serbian is best captured by .... 2.3 The role of quantification and domain restriction in constituent order variation ..... 29 ...... This modification is named the Mapping Hypothesis, and states,

pdf syntax
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. pdf syntax. pdf ...

Merkelized Abstract Syntax Trees
2008. [3] P. Todd. Re: Which clients fully support p2sh and/or multisig? https://bitcointalk.org/index.php? topic=255145.msg2757327#msg2757327. Accessed:.

Partial Correlation in SPSS Minitab and R.pdf
Whoops! There was a problem loading this page. Retrying... Partial Correlation in SPSS Minitab and R.pdf. Partial Correlation in SPSS Minitab and R.pdf. Open.

r for sas and spss users - Meredith A. Kleykamp
can download the programs and data sets used in both documents at: http://r4stats. ...... “argument” that tells R what percent of the extreme values to exclude before calculating the ...... identical language with extensions to handle “big data