-
Notifications
You must be signed in to change notification settings - Fork 36
Open
Description
Hi, thank you very much for providing the open source code!
However, I meet a problem when I use this code.
The model is not converged when I replace the Softmax function in attention with the Sparsemax function in my model.
Next, I try to use the Sparsemax function which is implemented by https://github.com/msobroza/SparsemaxPytorch, the model is converged as soon as possible.
So, I want to know whether some errors are in this code.
Thank you!
Metadata
Metadata
Assignees
Labels
No labels