markov-decision-process

What is a policy in reinforcement learning? [closed]

六眼飞鱼酱① 提交于 2019-12-31 08:42:05
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 2 years ago . I've seen such words as: A policy defines the learning agent's way of behaving at a given time. Roughly speaking, a policy is a mapping from perceived states of the environment to actions to be taken when in those states. But still didn't fully understand. What exactly is a policy in reinforcement learning? 回答1:

What is a policy in reinforcement learning? [closed]

半城伤御伤魂 提交于 2019-12-31 08:41:27
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 2 years ago . I've seen such words as: A policy defines the learning agent's way of behaving at a given time. Roughly speaking, a policy is a mapping from perceived states of the environment to actions to be taken when in those states. But still didn't fully understand. What exactly is a policy in reinforcement learning? 回答1:

What is a policy in reinforcement learning? [closed]

狂风中的少年 提交于 2019-12-02 17:43:42
I've seen such words as: A policy defines the learning agent's way of behaving at a given time. Roughly speaking, a policy is a mapping from perceived states of the environment to actions to be taken when in those states. But still didn't fully understand. What exactly is a policy in reinforcement learning? The definition is correct, though not instantly obvious if you see it for the first time. Let me put it this way: a policy is an agent's strategy . For example, imagine a world where a robot moves across the room and the task is to get to the target point (x, y), where it gets a reward.