Reinforcement learning framework for implementing custom models on custom environments using state of the art RL algorithms