Link to paper

The full paper is available here.

You can also find the paper on PapersWithCode here.

Abstract

  • Estimation of probability density functions is a difficult task.
  • Machine learning techniques have been used to tackle this task.
  • Boltzmann machine (BM) architecture has been used for successful applications.
  • Product Jacobi-Theta Boltzmann machine (pJTBM) is a restricted version of the Riemann-Theta Boltzmann machine (RTBM).
  • Score matching, based on the Fisher divergence, can be used to fit probability densities with the pJTBM more efficiently than with the original RTBM.

Paper Content

Introduction

  • Modelling general probability density functions is difficult
  • Kernel density estimation is commonly used
  • Variational autoencoders and normalizing flows have been successful in modelling arbitrary probability densities
  • Riemann-Theta Boltzmann machine is a novel method for modelling arbitrary probability densities
  • Product Jacobi-Theta Boltzmann machine is a simplified version of the RTBM which is more efficient

Riemann-theta boltzmann machine

  • RTBM is a variant of a Boltzmann machine
  • RTBM is defined by an energy model
  • RTBM has a visible sector and a hidden sector
  • Boltzmann distribution is defined for this model
  • Hidden state space is taken to be ZNh
  • Visible state space is RNv
  • Closed form analytic expression for visible sector probability density function can be derived

Product jacobi-theta boltzmann machine

  • θ-function factorizes for diagonal second argument matrix Ω
  • Computational complexity of non-factorized θ grows exponentially with dimension of Ω
  • Factorized computation runtime grows experimentally as d 1.2
  • Non-factorized shows exponential growth under increasing k
  • Similar to restricted Boltzmann machine

Derivation of the fisher cost function for the rtbm and pjtbm

  • Modeling probability density using RTBM allows us to compute Fisher cost function.
  • Calculating D and H can be computationally expensive.
  • For pJTBM, complexity of Fisher cost function grows linearly with N h.
  • Dedicated implementation of Jacobi-Theta function is more efficient.

Example

  • RTBMs are used to model the uranium dataset
  • pJTBM is compared to RTBM using Fisher cost function
  • Data is preprocessed using z-score normalization and PCA
  • Aim is to quantify speed-up and compare quality of fits
  • Results show pJTBM is exponentially faster and has comparable FF values
  • Variance for pJTBM is larger than RTBM
  • Future work will investigate optimization on manifolds