indices of sublist, except one

后端 未结 0 1090
清酒与你
清酒与你 2020-12-13 01:44

I am trying to calculate Q learning values following the equation:

$$Q_{k + 1}(s, a) \\leftarrow \\sum_{s\'} P(s, a, s\')[ R(s, a, s\') + \\gamma \\max_{a\'} Q_k(s\',

相关标签:
回答
  • 消灭零回复
提交回复
热议问题