Do we have a GPU accelerated of version of numpy.max(X, axis=None)
in Theano.
I looked into the documentation and found theano.tensor.max(X, axis=None)
, but it is 4-5 times slower than the numpy implementation.
I can assure you, it is not slow because of some bad choice of matrix size. Same matrix under theano.tensor.exp is 40 times faster than its numpy counterpart.
Any suggestions?