Rollout, policy iteration, and distributed reinforcement learning Foundations of deep reinforcement learning Human-robot interaction control using reinforcement learning 并列题名:Foundations of deep reinforcement learning