Hi,
what actually does max_generator_batches do?
Some discussions here also:
so it is relevant to the loss back propagation using sharding hm… alright I will have a look into them thanks