Skip to content

the issue of train iterations in your paper #21

@lhehejunl

Description

@lhehejunl

Dear author:
In your paper, The learning rate decays by a factor of 0.5 every 200,000 iterations.
I want to confirm if it's 200,000 iterations, because in my opinion, 200,000 iterations is a very large number.
The learning rate starts from 1e-4 and decays by a factor of 0.5 every 200,000 iterations and finally ends with 3.125e-6. Thus, a total of 100,000,0 iterations are required, right?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions