Thank you very much Yasmin.
My training dataset size is 76184 samples. I can’t add more data, it is a specific domain. Do you think that because of early_stopping criteria, the model is not learning enough? I am asking because most of the models in the literature did not use this criteria and they were trained till the end of train_steps of 250000 at least.
Well, early stopping simply means the model is not learning (enough) anymore. However, if you increase its value or remove it, you can test the results on the two checkpoints, one with early stopping and one after further steps.