I did not find information in the forum about the shuffling of sentences during the training.
I fount this parameter but I am not sure how is it working:
# (optional) The number of elements from which to sample during shuffling (default: 500000). # Set 0 or null to disable shuffling, -1 to match the number of training examples. sample_buffer_size: 500000
Could someone explain the behavior of this parameter?
the shuffling is done with all sentences or at batch level?
regards and thanks in advance