Skip to content

Significant difference in results with different GPU counts on WMDP bio dataset using GradDiff #144

@ZeguanXiao

Description

@ZeguanXiao

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task
  • My own task or dataset (give details below)

Reproduction

I’ve encountered a very strange issue when running the WMDP bio dataset with the GradDiff method. Specifically, I observed that the number of GPUs used (I tested with 4 vs 8 GPUs) leads to significant differences in the results, even though the effective batch size was kept constant (32).

Has anyone else experienced this? Could there be any known reason why the GPU count would have such a strong impact on the outcomes?

Expected behavior

The results should be identical or similar.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions