Biorobotics Lab
Biorobotics Lab
Home
People
Research
Publications
News
Careers
Contact
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Hou Zhimin
,
Zhang, Kuangen
,
Wan, Yi
,
Li Dongyu
,
Fu, Chenglong
,
Haoyong Yu
January 2020
Cite
Source Document
DOI
Type
Preprint
Publication
arXiv
Cite
×