fix continuous batching issues, extend ut cases to xpu #41830

yao-matrix · 2025-10-23T21:17:20Z

@SunMarc , pls help review, thx very much.

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

yao-matrix · 2025-10-29T15:04:30Z

@SunMarc , could you pls take a review? Thx very much.

remi-or

LGTM! Thanks for adding, I just have 2 nits :)

remi-or · 2025-11-07T10:40:12Z

tests/generation/test_continuous_batching.py

+    Expectations,
+    require_kernels,
+    require_torch_accelerator,
+    require_torch_gpu,


Is @require_torch_gpu still used after those changes?

@remi-or paged attention enabling on XPU in kernels is still ongoing, suppose be ready soon, then we will transfer the left cases from CUDA-only to XPU, at this time, we still have some paged attention cases which are still CUDA only.

remi-or · 2025-11-07T10:42:01Z

src/transformers/integrations/mxfp4.py

            blocks = blocks.reshape(local_experts, -1, module.intermediate_size // 2)
        if getattr(target_device, "type", target_device) == "cpu":
-            target_device = "cuda"
+            target_device = torch.accelerator.current_accelerator().type if hasattr(torch, "accelerator") else "cuda"


if @SunMarc can OK this, not familiar w/ different accelerators

should be fine !

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

yao-matrix · 2025-11-07T16:52:39Z

@SunMarc , I think we are OK to go now, thx very much.

HuggingFaceDocBuilderDev · 2025-11-10T13:02:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…1830) * extend conrinuous batching cases to xpu Signed-off-by: Yao, Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao, Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao, Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao, Matrix <matrix.yao@intel.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

yao-matrix added 6 commits October 23, 2025 21:13

extend conrinuous batching cases to xpu

24c8781

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

fix style

e1c845b

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

Merge branch 'main' into cb-xpu

8ad4e9c

Merge branch 'main' into cb-xpu

22c6a87

Merge branch 'main' into cb-xpu

30e4ca0

Merge branch 'main' into cb-xpu

52840e3

yao-matrix changed the title ~~fix continuous batching issues on XPU, extend ut cases to xpu~~ fix continuous batching issues, extend ut cases to xpu Oct 29, 2025

yao-matrix added 4 commits October 29, 2025 08:27

Merge branch 'main' into cb-xpu

fb98d11

Merge branch 'main' into cb-xpu

3a5bba5

Merge branch 'main' into cb-xpu

a3bddf8

Merge branch 'main' into cb-xpu

3dab82a

SunMarc requested a review from remi-or November 4, 2025 14:53

Merge branch 'main' into cb-xpu

d9ea091

remi-or approved these changes Nov 7, 2025

View reviewed changes

yao-matrix added 3 commits November 7, 2025 08:40

Merge branch 'main' into cb-xpu

32f149f

fix style

aa785a2

Signed-off-by: Yao, Matrix <matrix.yao@intel.com>

Merge branch 'main' into cb-xpu

130387b

Merge branch 'main' into cb-xpu

3edd7e0

SunMarc enabled auto-merge (squash) November 10, 2025 12:52

SunMarc merged commit dba6aeb into huggingface:main Nov 10, 2025
23 checks passed

yao-matrix deleted the cb-xpu branch November 10, 2025 16:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix continuous batching issues, extend ut cases to xpu #41830

fix continuous batching issues, extend ut cases to xpu #41830

Uh oh!

yao-matrix commented Oct 23, 2025

Uh oh!

yao-matrix commented Oct 29, 2025

Uh oh!

remi-or left a comment

Uh oh!

remi-or Nov 7, 2025

Uh oh!

yao-matrix Nov 7, 2025

Uh oh!

remi-or Nov 7, 2025

Uh oh!

SunMarc Nov 10, 2025

Uh oh!

yao-matrix commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix continuous batching issues, extend ut cases to xpu #41830

fix continuous batching issues, extend ut cases to xpu #41830

Uh oh!

Conversation

yao-matrix commented Oct 23, 2025

Uh oh!

yao-matrix commented Oct 29, 2025

Uh oh!

remi-or left a comment

Choose a reason for hiding this comment

Uh oh!

remi-or Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

remi-or Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants