I am coding right now an DRL algorithm that uses the principle of Double DQN with a model and a target which gets updated over time.
So in my code i go this line: