Support L2 cache clearing in do_bench_cudagraph #8384

yf225 · 2025-10-06T22:05:15Z

Different from triton.testing.do_bench, triton.testing.do_bench_cudagraph currently does not have L2 cache clearing (which is useful for measuring performance in cache-miss scenario common in real-world model training/inference). This PR adds clear_cache option arg to allow L2 cache clearing in do_bench_cudagraph.

I have written a PR description following these
rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /python/test for end-to-end tests

Jokeren · 2025-10-07T00:01:30Z

It seems fine to me as the default is False.

What do you think? @ThomasRaoux @peterbell10

peterbell10 · 2025-10-07T10:54:09Z

python/triton/testing.py

                    for x in grad_to_none:
                        x.grad = None
+                maybe_clear_cache()
                fn()


It's a bit weird to have the cache flushing time included in the benchmark measurement. I suppose for autotuning purposes it should be fine as all calls are effected the same way, but there should at least be a warning in the doc string.

yf225 requested a review from ptillet as a code owner October 6, 2025 22:05

Support L2 cache clearing in do_bench_cudagraph

9943a49

yf225 force-pushed the do_bench_cudagraph_cache_clear branch from bd5ed9e to 9943a49 Compare October 6, 2025 22:06

yf225 mentioned this pull request Oct 6, 2025

Add L2 cache clearing to do_bench_cudagraph, for more realistic timing measurement meta-pytorch/tritonbench#519

Merged

Merge branch 'main' into do_bench_cudagraph_cache_clear

7a720b0

peterbell10 reviewed Oct 7, 2025

View reviewed changes

yf225 mentioned this pull request Oct 7, 2025

Exclude L2 cache clear time from timing measurement meta-pytorch/tritonbench#527

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support L2 cache clearing in do_bench_cudagraph #8384

Support L2 cache clearing in do_bench_cudagraph #8384

Uh oh!

yf225 commented Oct 6, 2025 •

edited

Loading

Uh oh!

Jokeren commented Oct 7, 2025

Uh oh!

peterbell10 Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support L2 cache clearing in do_bench_cudagraph #8384

Are you sure you want to change the base?

Support L2 cache clearing in do_bench_cudagraph #8384

Uh oh!

Conversation

yf225 commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jokeren commented Oct 7, 2025

Uh oh!

peterbell10 Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yf225 commented Oct 6, 2025 •

edited

Loading