End to End Neural Machine Translatation

Hi, I am new to NMT, can anyone suggest me to some End to End NMT Learning material.
I am familiar with deep learning, also familiar with LSTM and Transformers, but looking for a guideline to use Tokenizer, BPE, how and when to use.

Many thanks in Advance.