Commit 2aa8cd0
Add better assertions for MXFP8 group gemm (#5203)
Summary:
Pull Request resolved: #5203
X-link: https://github.com/facebookresearch/FBGEMM/pull/2200
Currently only supporst BF16 out, but would silently fail if someone tries FP32 out. Add explicit assertion for this. And some small code refactor.
Reviewed By: q10, summerdengfb, jianyuh
Differential Revision: D88747914
fbshipit-source-id: da8337816d7792c3abb64ce67e768bd8ef5398731 parent c246916 commit 2aa8cd0
File tree
2 files changed
+3
-7
lines changed- fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions
- mx8mx8bf16_grouped
2 files changed
+3
-7
lines changedLines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
165 | 165 | | |
166 | 166 | | |
167 | 167 | | |
| 168 | + | |
168 | 169 | | |
169 | | - | |
| 170 | + | |
170 | 171 | | |
171 | 172 | | |
172 | 173 | | |
| |||
Lines changed: 1 addition & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
68 | | - | |
| 68 | + | |
69 | 69 | | |
70 | 70 | | |
71 | 71 | | |
72 | 72 | | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | 73 | | |
79 | 74 | | |
80 | 75 | | |
| |||
0 commit comments