I’m running it in a docker image and installed opennmt via
pip. Here’s what that shows:
root@a33d70122f86:/mh# pip list | grep nmt
onmt_train from a shell script. It does a bunch of data wrangling, builds the vocab, and then seems to hit the error. That error seems to generate a bunch of additional problems, but I’ve cut them off below.
Corpus corpus_1's weight should be given. We default it to 1 for you.
[2021-04-15 13:42:42,294 INFO] Counter vocab from 7000 samples.
[2021-04-15 13:42:42,294 INFO] Build vocab on 7000 transformed examples/corpus.
[2021-04-15 13:42:42,301 INFO] corpus_1's transforms: TransformPipe(BARTNoiseTransform(None))
[2021-04-15 13:42:42,301 INFO] Loading ParallelCorpus(/workspace/big/BIG_src-train.txt, /workspace/big/BIG_tgt-train.txt, align=None)...
Traceback (most recent call last):
File "/opt/conda/lib/python3.8/multiprocessing/pool.py", line 125, in worker
result = (True, func(*args, **kwds))
File "/opt/conda/lib/python3.8/site-packages/onmt/inputters/corpus.py", line 298, in build_sub_vocab
maybe_example = DatasetAdapter._process(item, is_train=True)
File "/opt/conda/lib/python3.8/site-packages/onmt/inputters/corpus.py", line 69, in _process
maybe_example = transform.apply(
File "/opt/conda/lib/python3.8/site-packages/onmt/transforms/transform.py", line 189, in apply
example = transform.apply(
File "/opt/conda/lib/python3.8/site-packages/onmt/transforms/bart.py", line 380, in apply
if is_train and self.vocabs is not None:
AttributeError: 'BARTNoiseTransform' object has no attribute 'vocabs'
The above exception was the direct cause of the following exception: