Differential entropy is a quantity used in many signal processing problems. Often we need to calculate not only the entropy itself, but also its gradient with respect to various variables, for efficient optimization, sensitivity analysis, etc. Entropy estimation can be based on an estimate of the probability density function, which is computationally costly if done naively. Some prior algorithms use computationally efficient non-parametric entropy estimators. However, differentiation of the previously proposed estimators is difficult and may even be undefined.
To counter these obstacles, we consider non-parametric kernel entropy estimation that is differentiable. We present two different accelerated kernel algorithms. The first accelerates the entropy gradient calculation based on a back propagation principle. It allows calculating the differential entropy gradient in the same complexity as that of calculating the entropy itself. The second algorithm accelerates the estimation of both entropy and its gradient by using fast convolution over a uniform grid. We expand this approach to fast optimization of mutual information (MI), and then apply it to BSS by independent component analysis (ICA). This yields both high separation performance and low computational complexity. For K sources with N samples, our ICA algorithm has an iteration complexity of at most O(KNlogN + K2N).
In our work about "convolutive mixtures" we are inspired by mixtures occurring in photgraphy, microscopy and tomography. Their formation process involves focusing on an object layer, over which defocused layers are superimposed. BSS of convolutive image mixtures by direct optimization of MI is very complex and suffers from local minima. Thus, we devise an efficient approach to solve these problems. The convolutive BSS problem is converted into a set of instantaneous (pointwise) problems, using a short time Fourier transform (STFT). Standard BSS solutions for instantaneous problems suffer, however, from scale and permutation ambiguities. We obtain diambiguition by exploiting a parametric model of the defocus point spread function. Moreover, we enhance the efficiency of the approach by exploiting the sparsity of the STFT representation as a prior.
We have also applied similar principles in order to overcome haze. I this case, a major problem is spatially-varying reduction of contrast by stray radiance (airlight), which is scattered by the haze particles towards the camera. We blindly estimate an airlight parameter, thus enabing blind airlight separation based on polarization-filtered images. We avoid common ICA ambiguities by a physics-based model.
Download the code for efficient non-parametric ICA optimization. The archive contains the following files:
For more details about the algorithm see Publication #2 above.
For bug reports: mzib@ee.technion.ac.il, psarit@tx.technion.ac.il and yoav@ee.technion.ac.il.