XAI: Class Activation Map (CAM)
Table of Contents
Attention
Visualizing and Understanding Convolutional Networks
The conventional CNN can be conceptually divided into two parts. One part is feature extraction and the other is classification. In the feature extraction process, convolution is used to extract the features of the input data so that the classification can be performed well. The classification process classifies which group each input data belongs to by using the extracted features from the input data.
When we visually identify images, we do not look at the whole image; instead, we intuitively focus on the most important parts of the image. CNN learning is similar to the way humans focus. When its weights are optimized, the more important parts are given higher weights. But generally, we are not able to recognize this because the generic CNN goes through a fully connected layer and makes the features extracted by the convolution layer more abstract.
But works as a black box
When we visually identify images, we do not look at the whole image
When CNN weights are optimized, the more important parts are given higher weights
Class activation map (CAM)