7e3ea0c58e
Cache packet creation in all cases to reduce the number of allocations/ destruction operations made down to KFD. There is a bug that we encounter after a period of runtime in KFD where allocations fail to be visable to the GPU (suspect this is a FW issue, similar to other FW issues they have had along the same lines). This sidesteps that issue in rocprof (and likely should be done regardless) Co-authored-by: Benjamin Welton <bewelton@amd.com>