SWDEV-459778 - Remove CPU wait for profiler

- No cpu wait is needed when profiler is attached, Doing this changes
the application profile when roctracer is attached.

Change-Id: I2b9cfc48d697cf5ed54bb6a240d8c12bdb079171


[ROCm/clr commit: 51e4368723]
このコミットが含まれているのは:
Saleel Kudchadker
2024-05-08 21:11:17 +00:00
committed by Maneesh Gupta
コミット 3a67addd48
-3
ファイルの表示
@@ -123,9 +123,6 @@ void HostQueue::finish(bool cpu_wait) {
// If command doesn't contain HW event and runtime didn't request CPU wait,
// then force marker submit
bool force_marker = false;
// Force CPU wait if profiler is enabled. Pytorch tests may use tracer's plugin and rely on
// profiling information to be available right after finish.
cpu_wait |= activity_prof::IsEnabled(OP_ID_DISPATCH);
if (AMD_DIRECT_DISPATCH && (command != nullptr) && !cpu_wait) {
void* hw_event =
(command->NotifyEvent() != nullptr) ? command->NotifyEvent()->HwEvent() : command->HwEvent();