The mask approach [1] generates heatmaps by solving an optimization problem, which aims to find the smallest and smoothest area that maximally decreases the output of a neural network. It can generate very good heatmaps, but usually takes a long time to converge