[GPU] enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32 #32501

michal-miotk · 2025-10-21T13:14:53Z

Details:

new platforms not support simd8 (LNL, BMG)

Tickets:

174772

This reverts commit da80f7c.

This reverts commit 3849a35.

Lyamin-Roman

LGTM, but please add tests

src/plugins/intel_gpu/src/kernel_selector/cl_kernels/convolution_gpu_mmad_b_fs_yx_fsv32.cl

isanghao · 2025-10-28T13:03:54Z

src/plugins/intel_gpu/include/intel_gpu/runtime/format.hpp

        os_is_yx_isa8_osv8_isv4,                      ///< format for weights for MMAD convolution
        os_is_zyx_isa8_osv8_isv4,                     ///< format for weights for MMAD convolution
        os_is_yx_isa8_osv16_isv4,                     ///< format for weights for fully connected MMAD
        os_is_zyx_isa8_osv16_isv4,                    ///< format for weights for fully connected MMAD


random spot) could you check why onednn convolution kernel is not chosen? For GPU with XMX, our expectation is to use OneDNN convolutions, instead of cldnn convolutions.

because weights are u8

oh I see. I think we need to file a ticket to onednn team for supporting it. Performance won't be very good with cldnn kernel..

ticket created: https://jira.devtools.intel.com/browse/MFDNN-14355

isanghao · 2025-10-31T05:31:18Z

src/plugins/intel_gpu/tests/unit/test_cases/convolution_gpu_test.cpp


 class convolution_random_smoke_test : public testing::TestWithParam<convolution_random_test_all_params> {};
+class convolution_random_fsv32_test : public testing::TestWithParam<convolution_random_test_all_params> {};



random spot)

How does it perform on old platforms with SIMD16? If your new implementation performs good, I'd suggest to remove simd8 implementation. That will simplify the code.

Does this test code validate simd16 version on old platforms? We don't have precommit test for the new platforms.

on model from ticket on intelUHD770 it is 34fps(simd8) vs 39fps(simd16)

on model from ticket on intel iris (DUT2069-ADLP) it is 36fps(simd8) vs 42fps(simd16)

on model from ticket on A770 it is (simd8) 438fps(simd8) vs 418fps(simd16)

isanghao

LGTM

…penvinotoolkit#32501) ### Details: - new platforms not support simd8 (LNL, BMG) ### Tickets: - 174772

michal-miotk added 9 commits October 15, 2025 16:20

begin

7077224

wip

c00dcfe

wip

da80f7c

Revert "wip"

e78b1b5

This reverts commit da80f7c.

wip

c1ab575

wip

812bc0e

block read

9cf703a

works on asym wieghts

8bb6b94

works on asym data, version also for xyz

26b4e61

michal-miotk requested review from a team as code owners October 21, 2025 13:14

github-actions bot added the category: GPU OpenVINO GPU plugin label Oct 21, 2025

michal-miotk added 11 commits October 21, 2025 13:25

Merge branch 'master' into simd16

4888867

cleaning

687c0f1

clean v2

004b320

added forgotten simd16 kernel to kernel selector

6210768

block write adjustments

5260644

wip

3849a35

Revert "wip"

7f44be3

This reverts commit 3849a35.

only one kernel

d29e8cc

small fix

ece823c

wip

deff726

fix

8a1dc45

michal-miotk changed the title ~~[GPU][do not review] enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32~~ [GPU]enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32 Oct 23, 2025

michal-miotk changed the title ~~[GPU]enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32~~ [GPU] enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32 Oct 23, 2025

michal-miotk added 3 commits October 23, 2025 14:36

Merge branch 'master' into simd16

7f3413a

fix accuracy

93b040b

block write defined only when subg is 8

6483723

Lyamin-Roman reviewed Oct 27, 2025

View reviewed changes

src/plugins/intel_gpu/src/kernel_selector/cl_kernels/convolution_gpu_mmad_b_fs_yx_fsv32.cl Outdated Show resolved Hide resolved

michal-miotk added this to the 2025.4 milestone Oct 27, 2025

michal-miotk added 2 commits October 27, 2025 22:38

little jit, added test

086572e

Merge branch 'master' into simd16

b53c6ab

isanghao reviewed Oct 28, 2025

View reviewed changes

michal-miotk added 6 commits October 29, 2025 09:55

handle input size 5

4712620

added 3d test

c0fca61

try to satisfy CI

00fd74c

delete test which not run new condition

a0d90d3

Merge branch 'master' into simd16

647a383

Merge branch 'master' into simd16

50bc2c9

Lyamin-Roman approved these changes Oct 29, 2025

View reviewed changes

isanghao reviewed Oct 31, 2025

View reviewed changes

isanghao approved these changes Nov 3, 2025

View reviewed changes

isanghao added this pull request to the merge queue Nov 3, 2025

Merged via the queue into openvinotoolkit:master with commit d917ea5 Nov 3, 2025
189 checks passed

isanghao mentioned this pull request Nov 4, 2025

[GPU] Enable uint8 weights for oneDNN-based convolution #32650

Merged

IamRam3 pushed a commit to IamRam3/openvino that referenced this pull request Nov 6, 2025

[GPU] enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32 (o…

e6c7b16

…penvinotoolkit#32501) ### Details: - new platforms not support simd8 (LNL, BMG) ### Tickets: - 174772


		class convolution_random_smoke_test : public testing::TestWithParam<convolution_random_test_all_params> {};
		class convolution_random_fsv32_test : public testing::TestWithParam<convolution_random_test_all_params> {};

[GPU] enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32 #32501

[GPU] enable simd16 version for convolution_gpu_mmad_b_fs_yx_fsv32 #32501

Uh oh!

Conversation

michal-miotk commented Oct 21, 2025

Details:

Tickets:

Uh oh!

Lyamin-Roman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

isanghao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants