Policy-Gradient-Methods Implementation of Policy Gradient Methods for Continuous and Discrete Action Spaces