-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Description
Describe the bug
1)I am not able to train using DDPG/TD3/PPO/TRPO. Everything is default except the agent and timestep which was chosen by me.
To Reproduce
Steps to reproduce the behavior:
1)python train.py -agent DDPG --timesteps 100000
Expected behavior
The script should be able to run and train using the chosen agent. It worked for option_critic and dac_ppo.
Desktop (please complete the following information):
- OS: Linux (Workstation) ; Windows 10 (PC)
- Browser: Chrome
- Version: 93.0.4577.63
Smartphone (please complete the following information):
- Device: iphone 11
- OS: iOS 14.71
- Browser: Safari
- Version: Can't find but it's the latest
Additional context
I think the issue is with the environment being 1 dimensional instead of giving 3 dimensions. I'm not sure how to troubleshoot this part as I don't know where the environment is taken from.
Metadata
Metadata
Assignees
Labels
No labels