To test recall, the sine-wave distribution was chosen, which included noise. Thus, this task could also demonstrate the ability to generalize. 15 principal components were extracted (explaining 84.8% of the total variance). The output follows the shape of the sine wave, and it does not get distorted by the outliers (figure 5.6). Here, the right balance of the number of principal components was important. With too many (q=40) extracted components, also the noise was included in the potential field.
The mixture of local PCA could also restore the input-output relationship despite the noise (figure 5.7). Here, all noise points were assigned to one big ellipse (in the center of the image). This ellipse did not disturb the recall because the algorithm punishes large ellipsoidal volumes (section 4.2).
|
|