You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
self.target_critic.load_state_dict(critic_state_dict)
above code seems make target critic network's parameter always be same as the critic network's. So what is the purpose? making the network learn more slowly?
Hope somebody help me!