Commit ecaaefb
committed
Add kernelretsnoop and threadhist tools for CUDA kernel profiling
- Introduced `kernelretsnoop`, an eBPF-based tool to trace CUDA kernel thread exit timestamps, providing insights into thread execution times and performance bottlenecks.
- Implemented `threadhist`, a tool to analyze per-thread execution counts in CUDA kernels, helping to identify load imbalances and optimize thread configurations.
- Added example CUDA application (`vec_add.cu`) for testing both tools, demonstrating their usage in real scenarios.
- Created README files for both tools, detailing their functionality, usage, and examples.
- Included necessary build files and configurations for compiling and running the tools.1 parent 12a4f1e commit ecaaefb
File tree
15 files changed
+441
-357
lines changed- example/gpu
- cuda-counter-gpu-array
- cuda-counter-gpu-ringbuf
- kernelretsnoop
- threadhist
15 files changed
+441
-357
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
48 | 48 | | |
49 | 49 | | |
50 | 50 | | |
51 | | - | |
52 | | - | |
| 51 | + | |
| 52 | + | |
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
| |||
This file was deleted.
This file was deleted.
example/gpu/cuda-counter-gpu-ringbuf/.gitignore renamed to example/gpu/kernelretsnoop/.gitignore
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| |||
0 commit comments