PubPeer
The online Journal club
login
create account
Home
Publications
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
arXiv (2021) -
Comments
arxiv: 2106.04895 issn: 2331-8422
Tengyang Xie, Nan Jiang, Huan Wang, Caiming Xiong, Yu Bai
Go to article
Go to preprint
Comments awaiting moderation ({{totalComments}})
Review last reports ({{totalReports}})
Review last email suggestions ({{totalPendingEmails}})
Last month's whitelisted comments ({{totalWhitelistedComments}})