vignettes/blog/2020-05-27-the-information-game.Rmd
2020-05-27-the-information-game.Rmd
A typical task in adaptive testing is to select, out of a precalibrated item pool, the most appropriate item to ask, given an interim estimate of ability, . A popular approach is to select the item having the largest value of the item information function (IIF) at .
When the IRT model is the Rasch model, the item response function (IRF) is where is the item difficulty, and the IIF can be computed as . For all items, this function reaches a maximum of 0.25, encountered exactly where and . So picking the item with the maximum IIF is the same as picking the item for which a person of ability of has a probability of 0.5 to produce the correct answer.
If we use the two-parameter logistic (2PL) model instead, the IRF is where the new parameter is called the discrimination parameter, and the IIF can be computed as . Note that while the contribution of the product, , remains bounded between 0 and 0.25, the influence of can become quite large – for example, if , gets multiplied by 25, ten times the maximum value under the Rasch model. Selecting such an informative item gives us the happy feeling that our error of measurement will decrease a lot, but what happens to the person’s probability to give the correct response?
In our game, we have one person and two items. The person’s ability is 0, represented with a vertical gray line. One of the item is fixed to be a Rasch item with ; its IRF is shown as a solid black curve, and its IIF as a dotted black curve. The second item, shown in red, is 2PL, and you can control its two parameters with the sliders. Initially, and .
The purpose of the game is to find particularly awkward situations where the more informative item is the most inappropriate in the sense of:
having a difficulty as remote from as possible;
having a probability of a correct response at as low as possible.
Can you find the sweet spots? I have just computed the answer to the second question but I am not telling (in fact, I can tell you the value of : 0.0022, quite a bit off 0.5).