Searching for optimal paths in a customized Grid-world environment using Imitation Learning; Variational Adversarial Imitation Learning [VAIL]