You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Handle label imbalance in binary classification tasks on text benchmark (#376)
Labels in the text benchmarks are imbalanced and weighting the positive
labels improves performance.
Experiments done on `fake` dataset (5% positive labels) with
`text_embedded` and `RoBERTa` encodings:
- `ResNet` result changes 91.1% -> 93.4%
- `FTTransformer` result remains unchanged
- `Trompt` result changes 95.2% -> 95.8%
The differences were even more stark with distilled roberta, but we
aren't reporting those anywhere so I didn't note them down.
More results are pending
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
0 commit comments