Skip to content

Now, does tensor_parallel no longer support the huggingface trainer? #132

@HanGyeol-Yoo

Description

@HanGyeol-Yoo

I'm trying to use the huggingface trainer after using tensor_parallel with the Llama2 7b model, by calling

model = tp.tensor_parallel(model)

but I'm getting the following error.

ValueError: Model parameters were moved to incorrect devices, did call on model.cuda() or model.to(device)? If so, please avoid doing that

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions