Only Coders - Where knowledge meets opportunity

python (65.2k questions)

javascript (44.3k questions)

reactjs (22.7k questions)

java (20.8k questions)

c# (17.4k questions)

html (16.3k questions)

r (13.7k questions)

android (13k questions)

Questions - openai-gym

Rollout summary statistics not being monitored for CustomEnv using Stable-Baselines3

I am trying to train a custom environment using PPO via Stable-Baselines3 and OpenAI Gym. For some reason the rollout statistics are not being reported for this custom environment when I try to train ...

Alex Hill

reinforcement-learning

openai-gym

stable-baselines

openai-api

Votes: 0

Answers: 1

Latest Answer

SOLVED: There was an edge case where the environment was not ending, and the done variable remained False indefinitely. After fixing this bug, the Rollout statistics reappeared.

Alex Hill

Why is the Stable-Baselines3 evaluate_policy() function never finishing/completing?

I have created my own custom environment using OpenAI Gym and Stable-Baselines3. Once I've trained the agent, I try to evaluate the policy using the evaluate_policy() function from stable_baselines3.c...

Alex Hill

reinforcement-learning

openai-gym

stable-baselines

openai-api

Votes: 0

Answers: 1

Latest Answer

Roy_L

How do I discretise a continuous observation and action space in Python?

My professor has asked me to apply a Policy Iteration method on the Pendulum-V1 gym environment in OpenAI. Pendulum-V1 has the following Environment: Observation Type: Box(3) Num Observation Min M...

Dzartz94

python

reinforcement-learning

openai-gym

discretization

openai-api

Votes: 0

Answers: 0

MuJoCo via mujoco-py interface FetchReach-v1 scenario robotic action delay

Dear MuJoCo community, in last few days I was working with a simple FetchReach-v1 scenario in open-ai gym MuJoCo environment. I was trying to apply the MPC (Model Predictive Control) to this scenario ...

mazy

openai-gym

mujoco

Votes: 0

Answers: 1

Latest Answer

It sounds like you are expecting to literally set the velocities or accelerations? That is not how actuation works. Actuators apply forces, the resulting motion depends on many other things (inertia, ...

yuval

Posts

Questions

Blogs

Jobs

Questions about openai-gym

Read more about openai-gym