I'm working on implementation of LSTM Neural Network for sequence classification. I want to design a network with the following parameters:
- Input : a sequence of
n
one-hot-vectors. - Network topology : two-layer LSTM network.
- Output: a probability that a sequence given belong to a class (binary-classification). I want to take into account only last output from second LSTM layer.
I need to implement that in CNTK but I struggle because its documentation is not written really well. Can someone help me with that?