python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
Rollout summary statistics not being monitored for CustomEnv using Stable-Baselines3
I am trying to train a custom environment using PPO via Stable-Baselines3 and OpenAI Gym. For some reason the rollout statistics are not being reported for this custom environment when I try to train ...
Alex Hill
Votes: 0
Answers: 1
Why is the Stable-Baselines3 evaluate_policy() function never finishing/completing?
I have created my own custom environment using OpenAI Gym and Stable-Baselines3. Once I've trained the agent, I try to evaluate the policy using the evaluate_policy() function from stable_baselines3.c...
Alex Hill
Votes: 0
Answers: 1
How do I discretise a continuous observation and action space in Python?
My professor has asked me to apply a Policy Iteration method on the Pendulum-V1 gym environment in OpenAI.
Pendulum-V1 has the following Environment:
Observation
Type: Box(3)
Num
Observation
Min
M...
Dzartz94
Votes: 0
Answers: 0
MuJoCo via mujoco-py interface FetchReach-v1 scenario robotic action delay
Dear MuJoCo community,
in last few days I was working with a simple FetchReach-v1 scenario in open-ai gym MuJoCo environment. I was trying to apply the MPC (Model Predictive Control) to this scenario ...
mazy
Votes: 0
Answers: 1