Fix padded batch items lengthening generation time #419

jonatanklosko · 2025-06-17T18:27:33Z

When generating text with batch_size > 1, if there are padded batch items, the model gets zeroed input and may output arbitrary tokens. For some models that could be EOS and it would make no difference, but others could keep outputting a random token until the max length is reached (making the generation longer unnecessarily).

Fix padded batch items lengthening generation time

2dccaf8

jonatanklosko merged commit 7617ff5 into main Jun 17, 2025
0 of 2 checks passed

jonatanklosko deleted the jk-padded-generation branch June 17, 2025 18:27

jonatanklosko mentioned this pull request Jun 17, 2025

Use m2m100 #417

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix padded batch items lengthening generation time #419

Fix padded batch items lengthening generation time #419

Uh oh!

jonatanklosko commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

Fix padded batch items lengthening generation time #419

Fix padded batch items lengthening generation time #419

Uh oh!

Conversation

jonatanklosko commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!