The mathematics behind PKH1982
Another departure from global error term models such as RW1972 (Rescorla & Wagner, 1972), the PKH1982 model (Pearce et al., 1982) does not use an error term for learning excitatory associations (but does for inhibitory associations), and ties stimulus associability () to absolute global prediction error.
note: The implementation of this model closely follows the technical note from the CAL-R group where possible. Divergences are noted.
1 - Generating expectations
Let denote the excitatory strength from stimulus to stimulus , and the inhibitory strength from stimulus to stimulus (effectively, a “no j” representation). On any given trial, the net expectation of stimulus , , is given by:
where denotes the presence (1) or absence (0) of stimulus , and the set represents all stimuli in the design.
2 - Learning associations
Changes to the excitatory and inhibitory associations between stimuli are given by:
where and represent learning rates for excitatory and inhibitory associations, respectively, as determined by stimulus , is the associability of stimulus , respectively, and and are the excitatory asymptote and the overexpectation of stimulus , respectively.
Importantly, in Eq.2a is a parameter that is equal to 1 if the expectation of stimulus , is lower than its excitatory asymptote (i.e., ), but 0 if not. This implies that the model stops strengthening if the expectation of is higher than its excitatory asymptote.
As mentioned in the introductory note, the PKH1982 model does not learn excitatory associations via correction error. However, the model does learn inhibitory associations via correction error, as the overexpectation term above, is equal to , where is the minimum function. This implies only takes non-zero values when the expectation of is higher than its intensity on the trial ().
3 - Learning to attend
The associability parameter changes completely from trial to trial as a function of learning (note the lack of below), with the change being equal to the difference of the absolute global error, via:
where denotes the contribution of the prediction error based on the jth stimulus. In this regard, it is important to note that Pearce et al. (1982) did not extend their model to account for the predictive power of within-compound associations, yet the implementation of the model in this package does. This can sometimes result in unexpected behaviour, and as such, Eq. 3 above includes the extra parameter (defaulting to 1/K) that denotes whether the expectation of stimulus contributes to attentional learning. As such, the user can set these parameters manually to reflect the contribution of the different experimental stimuli. For example, in a simple “AB>(US)” design, setting = 1 and leads to the behavior of the original model.
The PKH1982 model improves upon the Pearce & Hall (1980) model by adding an extra parameter that controls the rate at which associability changes. If we qualify the changes in associability described by Eq.3 via (meaning they happened after trial ), then we can quantify the total associability of stimulus after trial via:
where is a parameter determining both the rate at which associability decays (via ), and the rate at which increments in attention occur. Note that changes in associability only apply to stimuli presented on the trial (i.e., ); attention to absent stimuli remains unchanged.
4 - Generating responses
There is no specification of response-generating mechanisms in PKH1982. However, the simplest response function that can be adopted is the identity function on stimulus expectations. If so, the responses reflecting the nature of , , are given by: