【論文紹介】 PGQ: Combining Policy Gradient And Q-learning O’Donoghue et al. ICLR 2017 紹介者: Sotetsu KOYAMADA (@sotetsuk)Read less