8778237298
* update installation steps
* Github Issue #50 Adding README's for samples
* Making name change to ROCprofiler-SDK for consistency
* Fix HIP trace documentation
* Fix HSA trace in docs
* Fix kernel trace in docs
* Fixing memory copy and memory allocation traces
* runtime trace and sys trace doc update
* Fix scratch memory doc
* kernel naming and filtering options
* Adding collection period in docs
* Perfetto configs update
* summary output file
* kernel trace format fix
* update CHANGELOG
* Agent index doc update
* rocm-smi output
* group by queue option
* Updated --group-by-queue description
* perfetto visualization
---------
Co-authored-by: Ian Trowbridge <Ian.Trowbridge@amd.com>
[ROCm/rocprofiler-sdk commit: ca7cce9e81]
1.8 KiB
1.8 KiB
| 1 | Kind | Agent_Id | Queue_Id | Stream_Id | Thread_Id | Dispatch_Id | Kernel_Id | Kernel_Name | Correlation_Id | Start_Timestamp | End_Timestamp | Private_Segment_Size | Group_Segment_Size | Workgroup_Size_X | Workgroup_Size_Y | Workgroup_Size_Z | Grid_Size_X | Grid_Size_Y | Grid_Size_Z |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 9 | 12 | void addition_kernel<float>(float*, float const*, float const*, int, int) | 9 | 1057678125307992 | 1057678125319352 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 3 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 8 | 11 | divide_kernel(float*, float const*, float const*, int, int) | 8 | 1057678125310912 | 1057678125319952 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 4 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 7 | 13 | multiply_kernel(float*, float const*, float const*, int, int) | 7 | 1057678125298952 | 1057678125310912 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 5 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 6 | 14 | subtract_kernel(float*, float const*, float const*, int, int) | 6 | 1057678125288912 | 1057678125298952 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 6 | KERNEL_DISPATCH | Agent 4 | 2 | 3 | 77 | 5 | 12 | void addition_kernel<float>(float*, float const*, float const*, int, int) | 5 | 1057678125279112 | 1057678125288912 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 7 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 10 | 14 | subtract_kernel(float*, float const*, float const*, int, int) | 10 | 1057678125319352 | 1057678125328912 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 8 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 12 | 11 | divide_kernel(float*, float const*, float const*, int, int) | 12 | 1057678125356832 | 1057678125366712 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 9 | KERNEL_DISPATCH | Agent 4 | 3 | 4 | 77 | 11 | 13 | multiply_kernel(float*, float const*, float const*, int, int) | 11 | 1057678125344352 | 1057678125356832 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |
| 10 | KERNEL_DISPATCH | Agent 4 | 4 | 5 | 77 | 13 | 12 | void addition_kernel<float>(float*, float const*, float const*, int, int) | 13 | 1057678125382792 | 1057678125394032 | 0 | 0 | 64 | 1 | 1 | 1024 | 1024 | 1 |