RL: Self-Play with On-Policy and Off-Policy

后端 未结 0 1192
再見小時候
再見小時候 2020-12-10 01:41

:) I try to implement self play with PPO. Suppose we have a game with 2 agents. We control one player on each side and get information like observation and reward after each

相关标签:
回答
  • 消灭零回复
提交回复
热议问题