The pytorch version of the network you provided has extremely low training efficiency and requires too much time.