vignettes/blog/2020-03-06-local-in-dependence-and-polytomization.Rmd
2020-03-06-local-in-dependence-and-polytomization.Rmd
dexter has a function, fit_domains
,
that deals with subtests. Domains are subsets of the items in the test
defined as a nominal variable: an item property. The items in each
subset are transformed into a large partial credit item whose item score
is the subtest score; the two models, ENORM and the interaction models,
are then fit on the new items.
To illustrate, let us go back to our standard example, the verbal aggression data. Each of the 24 items pertains to one of four frustrating situations. Treating the four situations as subtests or domains, we obtain for one of them:
To the left we see how the two models compare in predicting the category probabilities; to the right are the item-total regressions for the item (i.e. domain) score. Everything looks nice for this high-quality data set, but note in particular how closely the two models agree for the domain score. For all four situations:
Observations like these are ubiquitous. After all, the (polytomous) interaction model differs from ENORM in allowing for local dependence, and there is a certain tradition in dealing with local dependence through polytomization. We will review this tradition here, bearing in mind that its potential is probably far from exhausted. To spice things up even more, we will consider rated data.
The assumption of local stochastic independence (LSI) can be violated for many reasons. One example is when raters grade various aspects of the examinee’s work. The rubric may have the grading on one aspect depend on the grading on other aspects. Or there may be a halo effect: raters assign lower grades to an aspect if they have assigned higher grades to other aspects. To make matters concrete, suppose that an examinee has written an essay that is rated by rater on three aspects using rubric . We assume a one-point rubric: each aspect is graded if insufficient and if sufficient. Consider a model where the probability of the three grades, , given can be expressed as In this equation denotes the number of aspects that were judged sufficient. It is seen that the probability depends on , and on the combination of and . This is a special case of a model described by Kelderman (1984).
Now consider the grades assigned to different aspects as independent items such that Given LSI, the probability of a response pattern is given by which is equal to our model for rated data if . Thus, the parameter models the violation of LSI due to the rater (subscript ) or the scoring rubric (subscript ). In this example, it is assumed that grades given to the first aspect and the third aspect are related given , but more complex dependencies can be modeled by a straightforward extension of the model.
Under the Kelderman model, the probability of a score is modeled as The second term is positive and, given the beta parameters, it depends only on , the rubric , and the rater . Hence, we can write this factor as . If we define we can write This model is formally equivalent to a PCM with for all raters and all . Thus, polytomizing releases us of the need to model local dependencies. Hence, this is one reason why model fit improves when we polytomize. It will be clear that the same argument would hold if the original items follow an interaction model.
The argument has been worked out in greater generality by Verhelst and Verstralen (2008) who note that local dependencies can still be seen if we look at the item-category parameters of the polytomized items. Suppose that LSI holds and . In that case we find that for all raters . In addition, it can be shown that certain order relations must hold among the category parameters. To see this, we stay in the context of the example and define . We may then write where is the usual way to denotes the elementary symmetric function of order with argument . Thus, under LSI the same relationships that hold between elementary symmetric functions of different orders hold between the PCM parameters; e.g., the classical Newton inequalities. Verhelst and Verstralen (2008) propose to test whether these relations hold.