Scheduled Sampling for Sequence Tagger Model

Scheduled sampling has been widely applied to the sequence to sequence model but it could be helpful to improve the performance for such the sequence tagger model. Any comments will be welcomed!