WinGen: Windows software that Generates IRT parameters and item responses
Citation
Han, K. T. (2007). WinGen2: Windows software that generates IRT parameters and item responses [computer program]. Amherst, MA: Center for Educational Assessment, University of Massachusetts Amherst. Retrieved February 2, 2007, from http://people.umass.edu/kha/wingen/
Han, K. T. (in press). WinGen: Windows software that generates IRT parameters and item responses. Applied Psychological Measurement.
Han, K. T., & Hambleton, R. K. (2007). User's Manual: WinGen.
I. Introduction
Item response theory (IRT) is a popular and valuable scaling method that is particularly useful for large-scale assessments due to its attractive properties such as the invariance of item and examinee parameter values. However, IRT relies heavily on two strict assumptions, unidimensionality and the mathematical form of the item characteristic functions, the violations of which can produce serious negative consequences to the measurement process. One way of testing the impact of violating these two critical assumptions, as well as studying factors such as the impact of examinee sample sizes, the shape of the ability distribution, and test length, and many other factors, is via Monte Carlo simulation studies.
The essential part of Monte Carlo studies is simulating examinee item response data given true parameter values. A number of computer programs for simulating IRT data have been developed since the early 1970s. However, most of them were developed in the DOS environment (e.g., DATAGEN, Hambleton & Rovinelli, 1973, GENIRV, Baker, 1989, and RESGEN, Muraki, 1992). As a result, these programs are limited today because of inherent problems in DOS: (1) slow performance speed (16-bit), (2) limited usable system resources, (3) incompatibility with recent 32-bit Windows-based OSs, and (4) not a user-friendly interface. Some recent computer simulation programs, such as WINIRT (Fang & Johanson, 2005) and PARDSIM (Yoes, 1997), were developed for Windows-based OSs, but they provide only limited options for generating data from IRT models. Therefore, the primary goal of this study was to develop computer software that can simulate IRT data using many IRT models and with many important options to facilitate the design and implementation of computer simulation studies in the field of psychometric methods.
A computer program, named WinGen (Han, 2006), was developed to generate dichotomous and polytomous item response data under IRT. WinGen has the following appealing features:
1. WinGen supports various IRT models.
There are several kinds of unidimensional and multidimensional IRT models that are widely used today: (1) dichotomous IRT models with one, two, and three parameters, (2) non-parametric models, (3) polytomous IRT models such as the partial credit model, generalized partial credit model, graded response model, rating scale model, and nominal response model, and, (4) multidimensional compensatory models. WinGen supports these IRT models and allows users to even have a mixture of more than one IRT model in a set of items (for example, data for the first 10 items might be generated from the two-parameter logistic model, the next 30 items might be generated from the three-parameter logistic model, and with the last 10 items, data might be generated by the graded response model). Such a data set might be useful when simulating state test data, for example.
2. WinGen generates parameter values from various distributions for realistic data.
WinGen can generate sets of item parameters and sets of examinee ability parameters to essentially create more realistic item response data from various kinds of distributions. Most programs that simulate IRT data only support normal and/or uniform distributions. Although uniform and normal distributions may be theoretically ideal or easy to understand, these distributions are not likely to be the only distributions seen in the real world or of interest to researchers. With WinGen a user can choose a normal, uniform, or beta distribution (which can be defined to allow the generation of skewed distributions, skewed from highly positive to highly negative) for the examinee parameter and a normal, uniform, beta, or lognormal distribution for item parameters so that the user can conduct research with more realistic IRT data sets, or even unrealistic parameters if that is the interest of the researcher.. WinGen provides a graphic function of plotting a histogram of examinee ability parameters so that a user can make sure the distribution of the generated parameters is what he/she wanted. This graphing feature is especially helpful when the goal is to produce a skewed distribution. Of course, WinGen gives a user substantial control over item and examinee parameters too by allowing a user to read in user specified parameter sets and/or to set SEED values for generating parameters.
3. WinGen provides an intuitive and user-friendly interface.
By taking advantage of running on a Windows platform, WinGen provides a virtually point-and-click solution for simulating IRT data. There are three stages for simulating data:
(1) generating or reading in ability parameter values, (2) generating or reading in item parameter values, and (3) simulating item response data. All three stages are shown on a one-stop interface screen, so a user can prevent possible mistakes by being able to check the simulated data at each stage. Also the intuitive interface of WinGen would be helpful for teaching IRT to those who are not familiar with the theory. Once item parameters and/or ability parameters are generated, WinGen immediately provides IRT plots such as item characteristic curves (ICC), test characteristic curves (TCC), item information curves (IFC), and test information curves (TIC) so that a user can make sure the generated parameters fit his/her research purposes. See Figure 1 for an example of the interface.
4. WinGen is based on the most recent computer environment.
WinGen was developed on Microsoft .NET frameworks 2.0, the most recent computer software development environment. The program can run on either 32bit Windows series (ex., Windows XP) or 64bit Windows series (ex., Windows vista) and is optimized to use system resources efficiently so that the program can handle a bigger data in a shorter time. This software can also easily be converted to web-based software.
5. WinGen provides a powerful research tool.
Several powerful tools are provided for various research purposes in WinGen. Replication data can be simulated up to 1,000,000 sets, and syntax files for other IRT programs, such as PARSCALE (Muraki & Bock, 2003), BILOG-MG (Zimowski, Muraki, Mislevy, & Bock, 2003), and MULTILOG (Thissen, 2003), are automatically generated (of course, a user can specify the syntax of the software by reading in a sample syntax file). Batch files can also be generated for handling multiple calibrations in a cue. WinGen provides a dialog input to introduce differential item functioning (DIF) or item parameter drift in the simulated data. With multiple file read-in option in WinGen, a user can have multiple groups of examinees and multiple sets of items/tests. WinGen can also read-in output files (parameter estimates) from other IRT programs and automatically analyze DIF by computing RMSD, MAD, and BIAS. Table 1 summarizes the features of WinGen and several of other simulation software available.
Educational Importance
Although many IRT simulation programs have been developed, most of them were used only for specifically narrow purposes since they were not publicly shared programs. WinGen is more general and definitely more user friendly than many of the IRT simulation programs available today. WinGen is easy to use with an intuitive, user-friendly interface while providing very strong research tools and performance at the same time. Thus, WinGen would seem to be a popular choice for researchers who want to do simulation research, and students who are eager to learn more about IRT.