refine fp8_utils #10848

risemeup1 · 2025-07-15T11:55:34Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

Others

PR changes

Others

Description

优化了 fp8_utils 中的各个函数，减少了一些重复实现，增加一下可维护性，主要做了以下工作

构建了FP8LinearFunctionBase基础类，把常用的padding，quant，gemm等逻辑进行了封装
把很多相似的类FP8Linear和FP8KeepXLinear中相似的函数进行了合并
把一些功能相似的函数，如common_fp8_mlp_fwd,fp8_mlp_fwd等进行了合并

paddle-bot · 2025-07-15T11:55:40Z

Thanks for your contribution!

…into rewrite_fp8_utils

zhangbo9674 · 2025-07-25T01:38:11Z

paddlenlp/transformers/fp8_utils.py


-    return res
+    @staticmethod
+    def run_deep_gemm(a, a_scale, b, b_scale, out=None, num_sms=112, m_indices=None, is_grouped=False):


这个 run_deep_gemm，需要考虑下和 wgrad_gemm 的区别

zhangbo9674 · 2025-07-25T01:38:37Z

paddlenlp/transformers/fp8_utils.py

-        if (x.shape[axis] + 128 - (x.shape[axis] % 128)) % 512 != 0:
-            padding_size = 512
+    @staticmethod
+    def kitchen_fp8_gemm(


不用特别提 fp8了吧

zhangbo9674 · 2025-07-25T01:40:57Z

paddlenlp/transformers/fp8_utils.py

-    do3_orig_shape = do3.shape
-    do3 = do3.reshape([-1, do3_orig_shape[-1]])
+    @staticmethod
+    def fp8_mlp_bwd_norm_rc(do3, x, norm_w, norm_eps, w1, w2):


有验证这个函数的正确性么

我确认下

zhangbo9674 · 2025-07-25T01:41:41Z

paddlenlp/transformers/fp8_utils.py


-    # ===== compute norm grad =====
-    dx, d_rms_norm_weight = fused_ln.fused_rms_norm_grad_func(x, norm_w, invar, d_norm_output, norm_eps)
+        if if_keep_x:


if if_keep_x，这两个 if 也太奇怪了

就叫keep_x吧

…into rewrite_fp8_utils

zhangbo9674 · 2025-07-31T02:28:06Z

paddlenlp/transformers/fp8_utils.py

-        if (x.shape[axis] + 128 - (x.shape[axis] % 128)) % 512 != 0:
-            padding_size = 512
+    @staticmethod
+    def kitchen_gemm(


USE_DS_GEMM 下的逻辑全被删除了

refine fp8_utils

1f21bd9

risemeup1 added 5 commits July 16, 2025 14:54

refine fp8_utils

5af608e

Merge branch 'dsv3_dev' of https://github.com/PaddlePaddle/PaddleNLP …

ab3e504

…into rewrite_fp8_utils

refine fp8_utils

3d1122b

fix

b38c1ba

fix

87fad1c

zhangbo9674 reviewed Jul 25, 2025

View reviewed changes

risemeup1 added 3 commits July 28, 2025 15:08

Merge branch 'dsv3_dev' of https://github.com/PaddlePaddle/PaddleNLP …

6195713

…into rewrite_fp8_utils

fix after review

a6702f7

fix

7e4c254

zhangbo9674 reviewed Jul 31, 2025

View reviewed changes

risemeup1 added 2 commits July 31, 2025 10:38

fix

2d42fc6

fix

6f04692

risemeup1 force-pushed the rewrite_fp8_utils branch from 1fb7393 to 6f04692 Compare July 31, 2025 02:44

risemeup1 added 2 commits July 31, 2025 14:19

fix

8807eee

fix

4a4a419

zhangbo9674 approved these changes Jul 31, 2025

View reviewed changes

phlrain merged commit c7b0059 into PaddlePaddle:dsv3_dev Jul 31, 2025
2 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refine fp8_utils #10848

refine fp8_utils #10848

risemeup1 commented Jul 15, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 15, 2025

Uh oh!

zhangbo9674 Jul 25, 2025

Uh oh!

risemeup1 Jul 28, 2025

Uh oh!

zhangbo9674 Jul 25, 2025

Uh oh!

risemeup1 Jul 28, 2025

Uh oh!

zhangbo9674 Jul 25, 2025

Uh oh!

risemeup1 Jul 28, 2025

Uh oh!

zhangbo9674 Jul 25, 2025

Uh oh!

risemeup1 Jul 28, 2025

Uh oh!

zhangbo9674 Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

refine fp8_utils #10848

refine fp8_utils #10848

Conversation

risemeup1 commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before submitting

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Jul 15, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

risemeup1 commented Jul 15, 2025 •

edited

Loading