Gradient accumulation bug? #2601

Open

opened

on Dec 12, 2024

Is this common gradient accumulation bug present in OpenNMT-py?

This was found by Unsloth developers:
https://unsloth.ai/blog/gradient

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests