Policy Gradients § Group: 5 #group-5 Relations § Reinforcement Learning: Policy gradient methods are a class of reinforcement learning algorithms that directly optimize the policy function.