Skip to contents

The mathematics behind RW1972

The most influential associative learning model, RW1972 (Rescorla & Wagner, 1972), learns from global error and posits no changes in stimulus associability.

1 - Generating expectations

Let vk,jv_{k,j} denote the associative strength from stimulus kk to stimulus jj. On any given trial, the expectation of stimulus jj, eje_j, is given by:

ej=kKxkvk,j \tag{Eq.1} e_j = \sum_{k}^{K}x_k v_{k,j}

xkx_k denotes the presence (1) or absence (0) of stimulus kk, and the set KK represents all stimuli in the design.

2 - Learning associations

Changes to the association from stimulus ii to jj, vi,jv_{i,j}, are given by:

Δvi,j=αiβj(λjej) \tag{Eq.2} \Delta v_{i,j} = \alpha_i \beta_j (\lambda_j - e_j)

where αi\alpha_i is the associability of stimulus ii, βj\beta_j is a learning rate parameter determined by the properties of jj1, and λj\lambda_j is a the maximum association strength supported by jj (the asymptote).

3 - Generating responses

There is no specification of response-generating mechanisms in RW1972. However, the simplest response function that can be adopted is the identity function on stimulus expectations. If so, the responses reflecting the nature of jj, rjr_j, are given by:

rj=ej \tag{Eq.3} r_j = e_j

References

Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory. (pp. 64–69). Appleton-Century-Crofts.