INPUT:
!python OpenNMT-py/preprocess.py -src_seq_length 80 -tgt_seq_length 80 -src_vocab_size 30000 -tgt_vocab_size 30000 -lower -share_vocab .
-when i run this command vocab-size and seq_lenght no change .
-share_vocab it is merging src and tgt vocab, What is its role and can it be dispensed with.
-lower , what can do?
OUTPUT:
[2021-02-10 09:58:59,525 INFO] * tgt vocab size: 10207.
[2021-02-10 09:58:59,540 INFO] * src vocab size: 10287.
[2021-02-10 09:58:59,540 INFO] * merging src and tgt vocab…
[2021-02-10 09:58:59,578 INFO] * merged vocab size: 17928.
best regards
Abas.