Skip to content

Conversation

yf225
Copy link
Contributor

@yf225 yf225 commented Oct 7, 2025

triton-lang/triton#8384 added L2 cache clearing to do_bench_cudagraph, but it includes cache clearing time in timing measurement which is wrong. This PR fixes it.

@yf225 yf225 force-pushed the do_bench_cudagraph_exclude_cache_clear_time branch from 3069726 to dbd40e9 Compare October 7, 2025 22:07
@yf225 yf225 temporarily deployed to docker-s3-upload October 7, 2025 22:07 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 7, 2025 22:07 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 7, 2025 22:07 — with GitHub Actions Inactive
@yf225 yf225 force-pushed the do_bench_cudagraph_exclude_cache_clear_time branch from dbd40e9 to 0a608cd Compare October 7, 2025 23:13
@yf225 yf225 temporarily deployed to docker-s3-upload October 7, 2025 23:13 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 7, 2025 23:13 — with GitHub Actions Inactive
@yf225 yf225 force-pushed the do_bench_cudagraph_exclude_cache_clear_time branch from 0a608cd to 1eb387b Compare October 7, 2025 23:57
@yf225 yf225 force-pushed the do_bench_cudagraph_exclude_cache_clear_time branch from 1eb387b to 2eb490c Compare October 8, 2025 00:02
@yf225 yf225 temporarily deployed to docker-s3-upload October 8, 2025 00:03 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 8, 2025 00:03 — with GitHub Actions Inactive
@yf225 yf225 temporarily deployed to docker-s3-upload October 8, 2025 00:03 — with GitHub Actions Inactive
@yf225 yf225 merged commit 5d05cb9 into main Oct 8, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants