NIFTy issues

NIFTy issues https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues 2016-05-26T11:10:36Z https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/1 Add out-array parameter to numerical d2o operations. 2016-05-26T11:10:36Z Theo Steininger

Add out-array parameter to numerical d2o operations.

Numpy supports to specify an out array in order to avoid memory reallocation. a = np.array([1,2,3,4]) b = np.array([5,6,7,8]) # slow: a = a + b # fast: np.add(a,b,out=a) Numpy supports to specify an out array in order to avoid memory reallocation. a = np.array([1,2,3,4]) b = np.array([5,6,7,8]) # slow: a = a + b # fast: np.add(a,b,out=a) d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/4 Add `axis` keyword functionality to unary methods. 2016-05-26T11:10:16Z Theo Steininger

Add `axis` keyword functionality to unary methods.

Many numpy functions support the `axis` keyword in order to perform an operation only along certain directions of the array. The current implementation of d2o does not support this, e.g. for `all`, `any`, `sum`, etc... Related to: the... Many numpy functions support the `axis` keyword in order to perform an operation only along certain directions of the array. The current implementation of d2o does not support this, e.g. for `all`, `any`, `sum`, etc... Related to: theos/NIFTy#3 d2o https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/25 Slow speed of obj.copy() 2016-04-06T00:22:22Z Theo Steininger

Slow speed of obj.copy()

obj.copy() is slower than obj+0 obj.copy() is slower than obj+0 d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/33 Add tensor-/outer-dot to d2o 2016-05-26T11:09:02Z Theo Steininger

Add tensor-/outer-dot to d2o

d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/19 Add function d2o.arange 2016-05-26T11:09:22Z Theo Steininger

Add function d2o.arange

d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/2 d2o: Contraction functions rely on non-degeneracy of distribution strategy 2017-09-20T21:36:14Z Theo Steininger

d2o: Contraction functions rely on non-degeneracy of distribution strategy

Several methods of the distributed_data_object rely on the fact, that the distribution strategy behaves as if the local data was non-degenerate. Currently the non-distributor fixes this by returning trivial (local) results in the _allgat... Several methods of the distributed_data_object rely on the fact, that the distribution strategy behaves as if the local data was non-degenerate. Currently the non-distributor fixes this by returning trivial (local) results in the _allgather and the _Allreduce_sum method. Affected d2o methods are at least: _contraction_helper, mean Fix: Move the functionality for sum, prod, etc... into the distributor. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/8 Profile the d2o.bincount method 2016-05-26T11:09:57Z Theo Steininger

Profile the d2o.bincount method

The d2o.bincount method scales well with MPI parallelization but compared to single-core np.bincount has a rather big overhead. The d2o.bincount method scales well with MPI parallelization but compared to single-core np.bincount has a rather big overhead. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/3 d2o: _contraction_helper does not work when using numpy keyword arguments 2017-09-20T21:44:34Z Theo Steininger

d2o: _contraction_helper does not work when using numpy keyword arguments

The `_contraction_helper` passes keyword arguments to the underlying numpy functions (axis=, keepdims=). The result of the _contraction_helper's local computation is then an array and not a scalar. Therefore the dtype check fails. Fi... The `_contraction_helper` passes keyword arguments to the underlying numpy functions (axis=, keepdims=). The result of the _contraction_helper's local computation is then an array and not a scalar. Therefore the dtype check fails. Fix: After solving theos/NIFTy#2, adopt to the case that the local run's result object is an array and make a further distinction of cases, i.e for something like axis=0 for the slicing_distributor. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/14 The d2o_librarian will fail when mixing different MPI comms 2016-05-26T11:09:36Z Theo Steininger

The d2o_librarian will fail when mixing different MPI comms

Every local librarian instance on a node of a MPI cluster just increments its internal counter by one when a new d2o is registered. This gets out of sync, when only a part of the full cluster is covered by a special comm. ?Possible s... Every local librarian instance on a node of a MPI cluster just increments its internal counter by one when a new d2o is registered. This gets out of sync, when only a part of the full cluster is covered by a special comm. ?Possible solution: The individual librarians store the id of 'their' d2o and communicate a common id for their dictionary. Con: Involves MPI communication. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/13 Add support for `from array` indexing 2016-05-26T11:09:42Z Theo Steininger

Add support for `from array` indexing

When building the kdict from pindex and kindex something of the following form must be done (a==kindex, b==pindex): a = np.arange(16)*2 b = np.array([[3,2],[1,0]]) In [1]: a[b] Out[1]: array([[6, 4], ... When building the kdict from pindex and kindex something of the following form must be done (a==kindex, b==pindex): a = np.arange(16)*2 b = np.array([[3,2],[1,0]]) In [1]: a[b] Out[1]: array([[6, 4], [2, 0]]) Currently, this is solved using a hack: p.apply_scalar_function(lambda z: obj[z]) This functionality could easily be added to the get_data interface. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/10 Semi-advanced indexing is not recognized 2024-04-10T08:04:20Z Theo Steininger

Semi-advanced indexing is not recognized

a = np.arange(24).reshape((3, 4,2)) obj = distributed_data_object(a) Semi-advanced indexing a[(2,1,1),1] yields array([[18, 19], [10, 11], [10, 11]]) The ``indexinglist'' scheme in d2o expects... a = np.arange(24).reshape((3, 4,2)) obj = distributed_data_object(a) Semi-advanced indexing a[(2,1,1),1] yields array([[18, 19], [10, 11], [10, 11]]) The ``indexinglist'' scheme in d2o expects either scalars or numpy arrays as tuple elements and therefore: obj[(2,1,1),1] -> AttributeError However, obj[np.array((2,1,1)), 1] works. Solution: Parse the elements and in doubt cast them to numpy arrays. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/5 Add `copy` parameter to d2o.get_data() 2016-04-06T00:56:53Z Theo Steininger

Add `copy` parameter to d2o.get_data()

A `copy` parameter should be added to d2o.get_data in order to control, whether the resulting d2o should contain a view on or a copy of the old data. A `copy` parameter should be added to d2o.get_data in order to control, whether the resulting d2o should contain a view on or a copy of the old data. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/6 Add `source_rank` parameter to d2o.set_full_data() 2016-05-26T11:10:11Z Theo Steininger

Add `source_rank` parameter to d2o.set_full_data()

A source_rank parameter should be added to the distributed_data_object in order to specify on which node the source data-array resides on. A source_rank parameter should be added to the distributed_data_object in order to specify on which node the source data-array resides on. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/7 Move `flatten` method into the distributor. 2016-05-26T11:10:06Z Theo Steininger

Move `flatten` method into the distributor.

At the moment `flatten` is performed by the distributed_data_object itself. Thereby it assumes, that flattening the local arrays produces the right result. In general with arbitrary distribution strategies this is wrong. At the moment `flatten` is performed by the distributed_data_object itself. Thereby it assumes, that flattening the local arrays produces the right result. In general with arbitrary distribution strategies this is wrong. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/15 d2o cumsum and flatten rely on certain features of distribution strategy 2016-05-26T11:09:31Z Theo Steininger

d2o cumsum and flatten rely on certain features of distribution strategy

cumsum and flatten assume: if the shape of the d2o changes through flattening, the distribution strategy was "slicing". cumsum and flatten assume: if the shape of the d2o changes through flattening, the distribution strategy was "slicing". d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/23 d2o initialization from d2o is slow 2016-04-06T00:54:57Z Theo Steininger

d2o initialization from d2o is slow

The slicig distributor is slower than it could be for: a = np.arange(200000) obj = distributed_data_object(a, distribution_strategy='fftw') distributed_data_object(obj, distribution_strategy='equal') This is connected... The slicig distributor is slower than it could be for: a = np.arange(200000) obj = distributed_data_object(a, distribution_strategy='fftw') distributed_data_object(obj, distribution_strategy='equal') This is connected to the _enfold and _defold methods. d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/24 Slow speed of d2o slicing distribibutor during slicing access. 2016-04-06T00:56:28Z Theo Steininger

Slow speed of d2o slicing distribibutor during slicing access.

obj[::-1] obj[::-1] d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/26 Add unit tests for `copy=True/False` functionality 2016-05-26T11:09:16Z Theo Steininger

Add unit tests for `copy=True/False` functionality

d2o Theo Steininger Theo Steininger https://gitlab.mpcdf.mpg.de/ift/nifty/-/issues/28 Move obj.bincount and obj.unique into distributor and make them more efficient. 2016-05-26T11:09:09Z Theo Steininger

Move obj.bincount and obj.unique into distributor and make them more efficient.

For efficiency, use Allreduce instead of allgather in bincount. Use `fast-summation` in obj.unique <http://materials.jeremybejarano.com/MPIwithPython/collectiveCom.html#fastsum> For efficiency, use Allreduce instead of allgather in bincount. Use `fast-summation` in obj.unique <http://materials.jeremybejarano.com/MPIwithPython/collectiveCom.html#fastsum> d2o Theo Steininger Theo Steininger