Hello,
I have started training a new dataset, but whenever I begin training, a terminal message appears stating “Empty line in corpus_1#4332…” etc. I am training the dataset on my CPU because PyTorch is not utilizing my GPU, but I am unsure if this is the issue. Should I reformat the training datasets to remove any empty lines?
The dataset contains Cyrillic text, and the target data is in English. I have already formatted the .txt files to remove empty lines, but the terminal message still appears stating that there are empty lines in the corpus.