How do we interpret these vision models?

Visualizing what they have learned

Filters

Can simply visualize the first layer’s filters by seeing they’re shapes

Higher than first layer get much more complicated- intractable

Final Layer Features

Can do cool stuff for a given images’s final feature vectors

For example

Get L2 neighbors in that feature space
Visualize the space of feature vectors (using dimensionality reduction)
- Can even plot the images on an x-y grid and see its underanding

Activations

Understanding input pixels

Important Pixels

Process

Run images through network- record values of a chosen channel
Visualize image patches which correspond to maximal activations

Saliency via occlusion

Mask part of an image and see how much predicted probabilities change

Saliency via backprop

Do a forward pass on an image and compute the gradient with respect to image pixels

Absolute value and max over RGB channels

This will generate a saliency map where white corresponds to impact on the gradient

This can help illuminate biases

e.g. classifying husky with white snow

Guided backprop to visualize features

Process

Pick a single intermediate channel
Compute gradient of neuron value with respect to image pixels
Illuminates intermediate features

Gradient ascent to visualize features

Gradient ascent

Generate synthetic image which maximally activates a neuron

Process

Initialize image to zeros
Repeat the following
1. Forward pass to get current score
2. Backprop to get gradient of neuron value
3. Make small update (gradient ascent) to the image

Asecent : $I arg max S_{c} (I) - λ ∣∣ I ∣ ∣_{2}^{2}$

$S_{c} (I)$ : score for class c (before softmax)
$λ ∣∣ I ∣ ∣_{2}^{2}$ : simple regularizer Can do cool stuff with “muti-faceted” visualization

Adversarial perturbations

General process

Pick an artbitrayry image
Pick an arbitrary class
Modify image to maximize class
Repeat until network is fooled

Very subtle changes!

Style Transfer

Features Inversion

Given CNN feature vector, get new image whcih

matches feature vector
looks natural

Basically

$x^{*} = x \in R^{H \times W \times C} arg min l (ϕ (x), ϕ_{0}) + λ R (x)$
- But not $ϕ$ todo instead is a similar symbol
- $l (ϕ (x), ϕ_{0}) = ∣∣ ϕ (x) - ϕ_{0} ∣ ∣^{2}$

Deep dream

Instead of synthesizing image to maximize a specific neuron, amlify neuron activations at some layer in the network

Basic process

Choose image + layer in CNN
Repeat
1. Compute layer’s activations
2. Set gradient of layer equal to activation
  1. $I^{*} = arg max_{l} \sum_{i} f_{i} (l)^{2}$
3. Compute gradient on image
4. Update image

Texture Synthesis

Goal: patch of texture → bigger image of same texture

Couple of methods

Nearest neighbor

Typical nearest neighbors
Generate pixel one at a time in scanline order- form neighborhood of already generated pixels and copy nearest neighbor from input

Neural Texture Synthesis: Gram Matrix

Each layer of CNN gives
- C x H x W tensor of features
- Equal to: H x W grid of C-dimensoinal vectors
From outer product of two C-dimensional vectors, get C x C matrix measuring co-occurence
Average over all HW pairs of vectors, gives
- Gram matrix of shape C x C Process

Pretrain CNN
Run input texture forward through CNN, record activations
At each layer compute gram matrix
Initialize generated image frmo random noise
Pass image through CNN, compute gram matrix on each layer
1. $G_{ij}^{l} = \sum_{k} F_{ik}^{l} F_{jk}^{l}$
2. Shape is $C_{i} \times C_{i}$
Compute loss
1. Weighted sum of L2 distance between Gram matrices
Backprop to get gradient on image
Gradient step on image
Go to step 5

Neural Style Transfer

Feature + gram reconstruction

Basic idea- Content Image + Style Image → Stylized image (ie style transfer)

TODO review

Cons

Many forward / bacpward passes

Solution: fast style transfer

Train another neural network to perform style transfer for us

Quick review

Lots of ways to understand CNN’s representations
Activatoins
- NN
- Dimensionality reduction
- Maximal patches
- Occlusion
Gradients
- Saliency maps
- Class visualiation
- Fooling images
- Feature inversion
Fun stuff
- DeepDream- amplify neuron activations at some layer in the network
- Style transfer- usage of gram matrices

Pablo's Reference Notes

Explorer

Visualization and understanding

Visualizing what they have learned

Filters

Final Layer Features

Activations

Understanding input pixels

Important Pixels

Saliency via occlusion

Saliency via backprop

Guided backprop to visualize features

Gradient ascent to visualize features

Adversarial perturbations

Style Transfer

Features Inversion

Deep dream

Texture Synthesis

Neural Style Transfer

Graph View

Table of Contents

Backlinks