Reinforcement Algorithms - Policy Gradient, Q Learning, Double Q Learning, Deep Q Learning and Double Deep Q Learning