Simplify direct smoothing code
If I understood correctly, this functionality performs Gaussian smoothing on a (non-equidistant) array of data.
I can implement this efficiently and without using Cython; I just need some help in the case that the array is distributed over several MPI tasks...
Implementing this would get rid of Nifty's Cython dependence.