I need some precision about what a decoder (such as the SelfAttention one) is taking in input, in your example your wrote
Here your give
inputs for your
decoder.decode() and the
memory parameter is containing the output of the encoder, which is optional
Could you explain why the
memory parameter is not mandatory, don’t we always need something to be decoded ?
And why the
inputs parameter is mandatory, because it contains the target sentence embeddings and sometimes you don’t have a target sentence (at translation time for example) ?