Home ‣ Similarities between policy gradient methods (PGM) in reinforcement learning (RL) and supervised learning (SL)