In the xarray documentation for the function apply_ufunc it says:
dask: ‘forbidden’, ‘allowed’ or ‘parallelized’, optional
How to handle applying to objects containing lazy data in the form of dask arrays:
‘forbidden’ (default): raise an error if a dask array is encountered.
‘allowed’: pass dask arrays directly on to func.
‘parallelized’: automatically parallelize func if any of the inputs are a dask array.
If used, the output_dtypes argument must also be provided.
Multiple output arguments are not yet supported.
and in the documentation's page on Parallel Computing then there is a note:
For the majority of NumPy functions that are already wrapped by dask, it’s usually a better idea to use the pre-existing dask.array function, by using either a pre-existing xarray methods or apply_ufunc() with dask='allowed'. Dask can often have a more efficient implementation that makes use of the specialized structure of a problem, unlike the generic speedups offered by dask='parallelized'.
However, I'm still not clear as to what the difference between these two options is. Does allowed
still operate on chunks one by one to lower memory usage? Will allowed
still parallelize if the applied ufunc only uses dask operations? Why does parallelized
require you to give more information about the ufunc outputs (i.e. the arguments output_dtypes
, output_sizes
) than allowed
does?