Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning
arXiv (2018) - Comments
arxiv: 1801.06176  issn: 2331-8422 

Baolin Peng, Xiujun Li, Jianfeng Gao, Jingjing Liu, Kam-Fai Wong, Shang-Yu Su