Astarag Mohapatra

Dec 25, 2022

5 stories

1 save

Reinforcement Learning Algorithms

It contains PPO and TRPO and I describe trust region methods can enhance the optimization process in deep learning
DDPG is an extension of Policy gradient algorithms which is an Actor-critic model. You will learn about the improvements of DDPG over vanilla PG algorithms
Policy gradient algorithms encourage exploration over value-iteration methods. Know about its advantages and disadvantages in this article
Astarag Mohapatra

Astarag Mohapatra

Hi Astarag here, I am interested in topics about Deep learning and other topics. If you have any queries I am one comment away