-
Notifications
You must be signed in to change notification settings - Fork 228
Open
Description
I got an AssertionError: Mask is silently ignored due to the use of a custom kernel
when training GPT-2 with examples/pretrain_gpt.sh
.
This line leads to the assertion error:
assert mask is None, "Mask is silently ignored due to the use of a custom kernel" |
Is this assertion necessary? And is it even correct?
Metadata
Metadata
Assignees
Labels
No labels