I\'m trying to make an RL agent for a drone. My main constraint is to return the drone back to its initial location at the end of each episode. Do you have any ideas about h