markov-decision-process

What is a policy in reinforcement learning? [closed]

阅读更多关于 What is a policy in reinforcement learning? [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 2 years ago . I've seen such words as: A policy defines the learning agent's way of behaving at a given time. Roughly speaking, a policy is a mapping from perceived states of the environment to actions to be taken when in those states. But still didn't fully understand. What exactly is a policy in reinforcement learning? 回答1:

What is a policy in reinforcement learning? [closed]

阅读更多关于 What is a policy in reinforcement learning? [closed]

What is a policy in reinforcement learning? [closed]

阅读更多关于 What is a policy in reinforcement learning? [closed]

I've seen such words as: A policy defines the learning agent's way of behaving at a given time. Roughly speaking, a policy is a mapping from perceived states of the environment to actions to be taken when in those states. But still didn't fully understand. What exactly is a policy in reinforcement learning? The definition is correct, though not instantly obvious if you see it for the first time. Let me put it this way: a policy is an agent's strategy . For example, imagine a world where a robot moves across the room and the task is to get to the target point (x, y), where it gets a reward.