gradcam optimization
there's a lot of things we can optimize in our gradcam class/processes. this has kind of been neglected for a bit but is more curious given the recent timing information from Hall B where bad images (one's where a heatmap are produced) seemingly increase the predict time by an order of magnitude.