DQN with policy and target networks doesn't learn properly on LunarLander enviroment

前端 未结 0 1608
無奈伤痛
無奈伤痛 2020-12-12 16:09

I\'m trying to get a hang of reinforcement learning, so I\'m following a guide at: pytorch.org/tutorials/

They\'ve implemented DQN that solves CartPole with computer

相关标签:
回答
  • 消灭零回复
提交回复
热议问题