feat: add batchnorm kernel #19

0xrushi · 2025-09-21T23:56:19Z

❯ python main.py batchnorm_benchmark --benchmark=True
Running BatchNorm benchmark: 512x2048, use_triton=True
Output shape: torch.Size([512, 2048]), dtype: torch.float32
Benchmark results
                             kernel_path  non_triton    triton  non_triton-triton
kernel                                                                           
batchnorm_benchmark  batchnorm_benchmark     0.00917  0.000153           0.009017

==================================

❯ python main.py batchnorm_benchmark --benchmark=True --batch_size=1024 --channels=4096
Running BatchNorm benchmark: 1024x4096, use_triton=True
Output shape: torch.Size([1024, 4096]), dtype: torch.float32
Benchmark results
                             kernel_path  non_triton    triton  non_triton-triton
kernel                                                                           
batchnorm_benchmark  batchnorm_benchmark    0.008819  0.000216           0.008603

==================================

0xrushi added 5 commits September 21, 2025 18:31

add bn

e9cc36b

bm

d9c950d

benchmark

0060d5b

gitignore

564b1be

deps

87c9bc6

0xrushi mentioned this pull request Sep 22, 2025

Implement BatchNorm in triton triton-lang/triton#900

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add batchnorm kernel #19

feat: add batchnorm kernel #19

Uh oh!

0xrushi commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add batchnorm kernel #19

Are you sure you want to change the base?

feat: add batchnorm kernel #19

Uh oh!

Conversation

0xrushi commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant