Proximal Policy Optimisation