Hello!
I have some questions about using case_feature option.
I put -case_feature and -segment_case options when I tokenized and trained data.
These are my scripts:
th tools/tokenize.lua -case_feature true -segment_case true < data/en_train.txt > data/output_en.tok.txt
th train.lua -data data/data-train.t7 -save_model model -gpuid 1 2 -layers 8 -rnn_size 1000 -tok_src_case_feature true -tok_src_segment_case true > log.txt
th tools/rest_translation_server.lua -model model_checkpoint.t7 -host xxx -port xxxx -case_feature true -segment_case true -replace_unk_tagged -gpuid 2
My first question is that I have to put -case_feature and -segment_case options together?
What’s the differences if I put only -case_feature option without -segment_case option?
Also I got an error when requested to this server.
500 Internal Server Error - Error in application: tools/rest_translation_server.lua:99: unicode error in line ./tools/utils/case.lua:83: assertion failed!
Please help me to fix this problem.
Thank you!