Link to paper

The full paper is available here.

You can also find the paper on PapersWithCode here.

Abstract

Current Deep Network visualization and interpretability methods rely on data space visualizations.
SplineCam is the first provably exact method for computing the geometry of a DN’s mapping.
SplineCam applies to any DN architecture based on CPWL nonlinearities.
SplineCam enables comparison of architectures, measuring generalizability and sampling from the decision boundary.

Paper Content

Introduction

Deep learning and in particular Deep Networks (DNs) have redefined machine learning and pattern recognition
DNs employ a variety of techniques to improve performance
DNs consist of sequentially mapping an input vector to a sequence of feature maps
Weight matrix, bias vector and activation operator control the type of layer
Rectified Linear Unit (ReLU) is a popular choice for activation operator
Interpreting the geometry of a DN is a nontrivial task
Activation based interpretability methods can be susceptible to feature adversarial attacks
Finding closest point to a training sample that lies on the model’s decision boundary is an empirical method for model interpretation
Continuous Piece-Wise Linear (CPWL) activation functions are used in DNs
SplineCam is a sampling-free method to compute the exact partition of a DN
SplineCam can visualize a DN’s input space partition, compute partition statistics and sample from the decision boundary

The exact geometry and decision boundary of continuous piece-wise linear deep networks

Deep networks as continuous piece-wise linear operators

Spline operators are a form of nonlinear function
Each region of the input space has a degree P polynomial
The first P-1 derivatives of the polynomials are continuous
DNs with CPA activation can be expressed as a spline
Spline theory has been used in approximation theory, optimal control, and statistics

Exact computation of their partition and decision boundary

Suppose w and b are rows of W and b.
Lemma 2 provides a framework to back-project a hyperplane from layer with parameters w and b.
Theorem 1 states that the decision boundary in R S is the union of the projection of the hyperplane onto the tangent space of region ω.
SplineCam partitions P into Ω 1 via hyperplanes h 1 i from layer 1.
For each ω in Ω 1, Lemma 1 and Lemma 2 are used to obtain proj ω (h 2 i) for layer two.
SplineCam is scalable and vectorized except for the search algorithm.
The number of intersections, edges and cycles is ≤ O(n2).

Visualizing and understanding implicit neural representations

INRs are used in 3D view synthesis and inverse problems
MLPs are trained to produce a continuous mapping from signal coordinates to the value of the signal
ReLU MLPs are used in NeRF, a popular INR application
Current practice uses periodic encodings of the input coordinates and a ReLU-MLP
Visualize the geometry of the regions learned by these methods

Decision boundary of signed distance functions

We train an INR as a 2D SDF using an image from the MetFaces dataset.
We create two binary images and use them to create separate ground truth SDFs.
We train an identical ReLU-MLP architecture on both ESDF and HSDF.
The network creates more regions with higher density for the harder HSDF task.

The effect of positional encoding on inr geometry

INRs trained with periodically encoded coordinates can fit input signals better and faster
Little theoretical investigation of how positional encoding affects learning
ReLU-MLP used as INR backbone
Piecewise approximation of sine/cosine used while training
Periodic wrapping of space induced by P.E. increases number of regions and weight sharing across input space

How training hyper-parameters impact your spline

DNs with CPWL nonlinearities are CPWL mappings or affine splines.
Properties of affine splines can be used to measure complexity of a function.
This section proposes a quantitative approach to using SplineCam to measure how different training choices impact the partition of the DN.

Impact of architecture on partitions properties

Computing the exact partition boundary has many applications.
Choice of architecture can have a significant effect on the partitioning induced by a deep neural network.
Quantifying the characteristics of the partitions involves measures such as Average Region Volume, number of vertices, Number of Regions, and eccentricity.
Convolutional architectures have a significantly higher number of partition regions, smaller eccentricity and volume of the polytopes, and higher partition density.

Data-augmentation

Data-Augmentation is a technique used to improve the performance of Deep Neural Networks
An empirical study has been conducted to understand the impact of Data-Augmentation on Deep Neural Networks
SplineCam was used to quantify the changes within a Deep Neural Network when Data-Augmentation is applied
Results show that the average number of regions more than doubles between VGG11 and VGG16

Conclusions

We present a method to visualize and sample the decision boundary of deep neural networks.
We can use this method to gain insights into neural network geometries.
We can use this method to provide improved initialization or pruning schemes.
We can use this method to visualize the decision boundary dynamics of neural networks.
We use Pytorch and Graphtool for implementation.
We assess the computational complexity of SplineCam.
We vary the width of a single layer MLP.
We vary the area of the input domain.
We train an 8 layer CNN with 6 convolutional layers and 2 fully connected layers.
We provide SplineCam as a python toolbox.
We define a 2D input space region of interest.
We use a search algorithm to find cycles from a given graph.
We visualize the exact decision boundary of a 3D neural signed distance field.
We visualize the affine spline mapping.
We visualize the ANR distribution.
We visualize the evolution of ARV with training epochs.
We visualize the decision boundary of a 5 layer convnet.

Link to paper#

Abstract#

Paper Content#

Introduction#

The exact geometry and decision boundary of continuous piece-wise linear deep networks#

Deep networks as continuous piece-wise linear operators#

Exact computation of their partition and decision boundary#

Visualizing and understanding implicit neural representations#

Decision boundary of signed distance functions#

The effect of positional encoding on inr geometry#

How training hyper-parameters impact your spline#

Impact of architecture on partitions properties#

Data-augmentation#

Conclusions#

Link to paper

Abstract

Paper Content

Introduction

The exact geometry and decision boundary of continuous piece-wise linear deep networks

Deep networks as continuous piece-wise linear operators

Exact computation of their partition and decision boundary

Visualizing and understanding implicit neural representations

Decision boundary of signed distance functions

The effect of positional encoding on inr geometry

How training hyper-parameters impact your spline

Impact of architecture on partitions properties

Data-augmentation

Conclusions