1
/
5

Neural sequence model training via α-divergence minimization

https://arxiv.org/abs/1706.10031