ca7cce9e81
* update installation steps * Github Issue #50 Adding README's for samples * Making name change to ROCprofiler-SDK for consistency * Fix HIP trace documentation * Fix HSA trace in docs * Fix kernel trace in docs * Fixing memory copy and memory allocation traces * runtime trace and sys trace doc update * Fix scratch memory doc * kernel naming and filtering options * Adding collection period in docs * Perfetto configs update * summary output file * kernel trace format fix * update CHANGELOG * Agent index doc update * rocm-smi output * group by queue option * Updated --group-by-queue description * perfetto visualization --------- Co-authored-by: Ian Trowbridge <Ian.Trowbridge@amd.com>
1.8 KiB
1.8 KiB
| 1 | Kind | Agent_Id | Queue_Id | Stream_Id | Thread_Id | Dispatch_Id | Kernel_Id | Kernel_Name | Correlation_Id | Start_Timestamp | End_Timestamp | Private_Segment_Size | Group_Segment_Size | Workgroup_Size_X | Workgroup_Size_Y | Workgroup_Size_Z | Grid_Size_X | Grid_Size_Y | Grid_Size_Z |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 9 | 12 | void addition_kernel<float>(float*, float const*, float const*, int, int) | 9 | 1057678125307992 | 1057678125319352 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 3 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 8 | 11 | divide_kernel(float*, float const*, float const*, int, int) | 8 | 1057678125310912 | 1057678125319952 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 4 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 7 | 13 | multiply_kernel(float*, float const*, float const*, int, int) | 7 | 1057678125298952 | 1057678125310912 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 5 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 6 | 14 | subtract_kernel(float*, float const*, float const*, int, int) | 6 | 1057678125288912 | 1057678125298952 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 6 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 5 | 12 | void addition_kernel<float>(float*, float const*, float const*, int, int) | 5 | 1057678125279112 | 1057678125288912 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 7 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 10 | 14 | subtract_kernel(float*, float const*, float const*, int, int) | 10 | 1057678125319352 | 1057678125328912 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 8 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 12 | 11 | divide_kernel(float*, float const*, float const*, int, int) | 12 | 1057678125356832 | 1057678125366712 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 9 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 11 | 13 | multiply_kernel(float*, float const*, float const*, int, int) | 11 | 1057678125344352 | 1057678125356832 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 10 | KERNEL_DISPATCH | Agent 4 | 4 | 5 | 77 | 13 | 12 | void addition_kernel<float>(float*, float const*, float const*, int, int) | 13 | 1057678125382792 | 1057678125394032 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |