Policy Gradients

Group: 5 #group-5

Relations

  • Reinforcement Learning: Policy gradient methods are a class of reinforcement learning algorithms that directly optimize the policy function.