Training Error with DDPG/TD3/PPO/TRPO

**Describe the bug**
1)I am not able to train using DDPG/TD3/PPO/TRPO. Everything is default except the agent and timestep which was chosen by me.


**To Reproduce**
Steps to reproduce the behavior:
1)python train.py -agent DDPG --timesteps 100000

**Expected behavior**
The script should be able to run and train using the chosen agent. It worked for option_critic and dac_ppo.

**Screenshots**
![image](https://user-images.githubusercontent.com/88356438/132971290-d0e88c31-1ac2-44c7-b5f5-058a62e1893a.png)


**Desktop (please complete the following information):**
 - OS: Linux (Workstation) ; Windows 10 (PC)
 - Browser: Chrome
 - Version: 93.0.4577.63

**Smartphone (please complete the following information):**
 - Device: iphone 11
 - OS: iOS 14.71
 - Browser: Safari
 - Version: Can't find but it's the latest

**Additional context**
I think the issue is with the environment being 1 dimensional instead of giving 3 dimensions. I'm not sure how to troubleshoot this part as I don't know where the environment is taken from.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training Error with DDPG/TD3/PPO/TRPO #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Training Error with DDPG/TD3/PPO/TRPO #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions