openai-gym | 易学教程

Low GPU utilisation when running Tensorflow

阅读更多关于 Low GPU utilisation when running Tensorflow

问题 I've been doing Deep Reinforcement Learning using Tensorflow and OpenAI gym. My problem is low GPU utilisation. Googling this issue, I understood that it's wrong to expect much GPU utilisation when training small networks ( eg. for training mnist). But my Neural Network is not so small, I think. The architecture is similar to the given in the original deepmind paper (more or less). The architecture of my network is summarized below Convolution layer 1 (filters=32, kernel_size=8x8, strides=4)

How does DQN work in an environment where reward is always -1

阅读更多关于 How does DQN work in an environment where reward is always -1

问题 Given that the OpenAI Gym environment MountainCar-v0 ALWAYS returns -1.0 as a reward (even when goal is achieved), I don't understand how DQN with experience-replay converges, yet I know it does, because I have working code that proves it. By working, I mean that when I train the agent, the agent quickly (within 300-500 episodes) learns how to solve the mountaincar problem. Below is an example from my trained agent. It is my understanding that ultimately there needs to be a "sparse reward"

python OpenAI gym monitor creates json files in the recording directory

阅读更多关于 python OpenAI gym monitor creates json files in the recording directory

问题 I am implementing value iteration on the gym CartPole-v0 environment and would like to record the video of the agent's actions in a video file. I have been trying to implement this using the Monitor wrapper but it generates json files instead of a video file in the recording directory. This is my code: env = gym.make('FrozenLake-v0') env = gym.wrappers.Monitor(env, 'recording', force=True) env.seed(0) optimalValue = valueIteration(env) st = time.time() policy = cal_policy(optimalValue) policy

python OpenAI gym monitor creates json files in the recording directory

阅读更多关于 python OpenAI gym monitor creates json files in the recording directory

python OpenAI gym monitor creates json files in the recording directory

阅读更多关于 python OpenAI gym monitor creates json files in the recording directory

Extracting state-space from Atari games at specific frames and hard coding agents?

阅读更多关于 Extracting state-space from Atari games at specific frames and hard coding agents?

问题 I am trying to extract the state space from Amidar in order to hard code an agent for some specific purposes. For example, I want the agent to go down whenever an enemy is 2 cells away or up until they hit a wall then go down again. However, I'm not quite sure how to extract the state space at a specific frame, or in general for that instance, and how to go about interpreting the output. I have tried env.observation_space but that just returns the frame size (i.e: Box(250,160,3) ). Anyone

Extracting state-space from Atari games at specific frames and hard coding agents?

阅读更多关于 Extracting state-space from Atari games at specific frames and hard coding agents?

OpenAI Gym: Understanding `action_space` notation (spaces.Box)

阅读更多关于 OpenAI Gym: Understanding `action_space` notation (spaces.Box)

问题 I want to setup an RL agent on the OpenAI CarRacing-v0 environment, but before that I want to understand the action space. In the code on github line 119 says: self.action_space = spaces.Box( np.array([-1,0,0]), np.array([+1,+1,+1])) # steer, gas, brake How do I read this line? Although my problem is concrete wrt CarRacing-v0 I would like to understand the spaces.Box() notation in general 回答1: Box means that you are dealing with real valued quantities. The first array np.array([-1,0,0] are

OpenAI Gym: Understanding `action_space` notation (spaces.Box)

阅读更多关于 OpenAI Gym: Understanding `action_space` notation (spaces.Box)

Tensorflow: Different results with the same random seed

阅读更多关于 Tensorflow: Different results with the same random seed

来源： https://stackoverflow.com/questions/54047654/tensorflow-different-results-with-the-same-random-seed