Skip to content

Conversation

yf225
Copy link
Contributor

@yf225 yf225 commented Oct 6, 2025

Triton do_bench_cudagraph (link) doesn't have L2 cache clearing which makes the result unreliable. This PR adds a version of do_bench_cudagraph that has L2 cache clearing and switch tritonbench to use it.

@yf225 yf225 requested review from oulgen and xuzhao9 October 6, 2025 19:25
@yf225 yf225 temporarily deployed to docker-s3-upload October 6, 2025 19:25 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 6, 2025 19:25 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 6, 2025 19:25 — with GitHub Actions Inactive
@meta-cla meta-cla bot added the cla signed label Oct 6, 2025
@xuzhao9
Copy link
Contributor

xuzhao9 commented Oct 6, 2025

Fantastic! Do you think we can upstream this to triton-lang/triton?

@yf225
Copy link
Contributor Author

yf225 commented Oct 6, 2025

Fantastic! Do you think we can upstream this to triton-lang/triton?

I believe so! will open a PR for this

@yf225 yf225 merged commit 379a315 into main Oct 6, 2025
7 checks passed
@yf225 yf225 deleted the do_bench_cudagraph_cache_clear branch October 6, 2025 20:09
@yf225
Copy link
Contributor Author

yf225 commented Oct 6, 2025

Open PR to triton-lang/triton at triton-lang/triton#8384

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants