نمودار کامیت

4663 کامیت‌ها

مولف SHA1 پیام تاریخ
Chauncey Hui cff74ae551 SWDEV-2 - Change OpenCL version number from 3518 to 3519 2022-10-25 03:00:08 -04:00
German e223b0f678 SWDEV-352487 - Don't add notifications as the last command
Change-Id: Ifed34485839ef2c9491e8e8f6bb3569932160b1c
2022-10-24 09:39:03 -04:00
Chauncey Hui a65ed69c1b SWDEV-2 - Change OpenCL version number from 3517 to 3518 2022-10-22 03:00:06 -04:00
Ajay 261620a2ef SWDEV-357207 - Linux: enable StreamWrite/Wait APIs in deviceAttribute
Change-Id: I593df2dbb7a0233dd42f8d873510d85bbb27feff
2022-10-21 12:44:35 -04:00
German c1c5127875 SWDEV-360183 - Finish the trace if queue was destroyed
- Make sure SQQT trace is captured for RGP server if the queue is destroyed before normal capture is done.
- Remove prepare queue from the logic. It's not really used for any HW capture and can cause RGP server abort if destroyed before capture is even started(delayed capture)

Change-Id: I6eb19963190a5769c6477a5496c1b831a6d59b89
2022-10-21 10:44:10 -04:00
Alex (Bin) Xie 06867f0c04 SWDEV-361591 - clean up header files
Change-Id: I25ca37a94524da0521f5b856a9fa9bab73f103c1
2022-10-21 10:23:55 -04:00
Chauncey Hui 904f3e3d75 SWDEV-2 - Change OpenCL version number from 3516 to 3517 2022-10-21 03:00:52 -04:00
Tao Sang 3d281114fb SWDEV-360706 - Fix WriteAqlArgAt() crash in X86
In X86 on Windows, sizeof(size_t)=4, but size=8, for
amd::KernelParameterDescriptor::HiddenGlobalOffsetX/Y/Z items.
Loose the condition to prevent crash.

Change-Id: I2216f71f4d4fd6dd3766023b1c821cb3d35d7848
2022-10-20 18:04:49 -04:00
Sourabh Betigeri b2665ccc4b SWDEV-361730 - Fix to match the kernel arg allocation sizes between gpu visible mem and host/runtime mem
Change-Id: I6495aa57e97b46373c1d90cc230435780fdcf3ef
2022-10-20 14:20:37 -04:00
Ajay a5b550e828 SWDEV-360944 - palBlit setArgument desc.size_ --> argSize
Fixes AMF tests that were impacted on Windows

Change-Id: I0326e794ec5a34fdec7a5ff4ed79d5bd055997d2
2022-10-20 12:44:15 -04:00
Laurent Morichetti 9a82118c85 SWDEV-362046 - Report HIP_OPS activities using the ROCr driver_node_id instead of the device's index
The ROCclr assigns zero-based IDs to GPUs in the order they are
discovered. That zero-based ID is what is used to identify the GPU
on which the HIP_OPS activity took place.

When multiple ranks are used, each rank's first logical device always
has GPU ID 0, regardless of which physical device is selected with
CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from
multiple ranks, GPU IDs from different processes may overlap.

The long term solution is to use the KFD's gpu_id which is stable
across APIs and processes. Unfortunately the gpu_id is not yet exposed
by the ROCr, so for now use the driver's node id.

Change-Id: Ib78854527d600d175bb76e2df0747c33f898c615
2022-10-20 12:31:30 -04:00
Julia Jiang dacd55f3d7 SWDEV-357122 - fix failure in vdi so as to handle unreasonable input size in MallocManagedNegativeTest
Change-Id: I1ed1916b652afc67327b0935c3c60fc2a404df30
2022-10-20 12:19:22 -04:00
Chauncey Hui 05b2bd7995 SWDEV-2 - Change OpenCL version number from 3515 to 3516 2022-10-20 03:00:06 -04:00
Christophe Paquot 470c4d7335 SWDEV-322620 - Virtual Memory Management
Hooked up the start VA hint address to PAL.

Change-Id: I4301d3eaaabcc85faf63fb25291291dcc3e8ebd1
2022-10-19 10:50:07 -04:00
Chauncey Hui 5cad630425 SWDEV-2 - Change OpenCL version number from 3514 to 3515 2022-10-19 03:00:13 -04:00
kjayapra-amd 951a5b9e7d SWDEV-307411 - Require comment/message string in guarantee().
Change-Id: I55a699c5366f6a7d167ede4c9be2ec5b15cea9d0
2022-10-13 16:21:21 -04:00
Chauncey Hui 4e0e0666d6 SWDEV-2 - Change OpenCL version number from 3513 to 3514 2022-10-13 03:00:09 -04:00
Saleel Kudchadker 6405b6cdba SWDEV-359548 - Detect fence optimization
- Use a dirty flag to determine fence optimization
- If fence is dirty submit a marker at top level to sync.

Change-Id: I53fb19b5bb05b7c7b37c41637a6c7aaf870b639a
2022-10-10 14:15:16 -04:00
Chauncey Hui 621faa03ec SWDEV-2 - Change OpenCL version number from 3512 to 3513 2022-10-08 03:00:09 -04:00
Saleel Kudchadker eef2f99513 SWDEV-356482 - Improve logging
Change-Id: I0c6286fcdb9d94398ed66262e95e8837898ff9fe
2022-10-06 11:13:24 -04:00
Chauncey Hui 3e3fa68657 SWDEV-2 - Change OpenCL version number from 3511 to 3512 2022-10-06 03:00:06 -04:00
Sourabh Betigeri 84fbb30b7c SWDEV-357246 - Adds a missing return statement
Change-Id: I2216f71f4d4fb6dd3766023b0c821cb3d35d7849
2022-10-05 16:29:32 -04:00
Laurent Morichetti e00965df50 SWDEV-351980 - Add FillBuffer byte count to the record
Change-Id: I90c791f5810b8a3f6b1d6a9e81c165b1a7515c92
2022-09-30 21:20:14 -07:00
Chauncey Hui 800fe4b239 SWDEV-2 - Change OpenCL version number from 3510 to 3511 2022-09-30 03:00:10 -04:00
Jeremy Newton 65fd4f5f07 SWDEV-1 - Fix typo in compiler utils
Found by the Debian maintainers:
https://github.com/ROCm-Developer-Tools/ROCclr/issues/28

Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com>
Change-Id: I8c09913f7b7f4ad02f12a0630f15e66bdd5360a7
2022-09-29 18:14:29 -04:00
Chauncey Hui f93d921c4f SWDEV-2 - Change OpenCL version number from 3509 to 3510 2022-09-23 03:00:11 -04:00
Saleel Kudchadker 9b5cbd37a2 SWDEV-352001 - Store last scopes for dispatch
- Store last fence scopes and use the last value to determine if we need a cache flush again. This helps cases where hipExtLaunchKernel API is
used.
- Purge code for ROC_EVENT_NO_FLUSH

Change-Id: I531cf9c9c60d5e2b3a9e265d0f52f79ed2fa8a8c
2022-09-22 11:34:10 -04:00
Chauncey Hui ae46d57519 SWDEV-2 - Change OpenCL version number from 3508 to 3509 2022-09-22 03:00:08 -04:00
Laurent Morichetti 52eb28930a SWDEV-351980 - Consolidate registration tables in the roctracer library
Remove the activity_prof::CallbacksTable. The table was redundant with
the information already stored in the roctracer library. Instead use a
single callback into the roctracer library to query whether the activity
is enabled, and to report it.

Change-Id: I2e05b0881bb4a1953c14361d00ea310d02eb6e0c
2022-09-21 05:54:09 -04:00
Laurent Morichetti e713b5c7d0 SWDEV-351980 - Enable profiling for commands reporting activities
Profiling should be enabled for any command reporting activities as the
activity record captures the profilingInfo's start and end timestamps.

Since IS_PROFILER_ON is only used to determine whether API tracing is
enabled, there is no need to expose it globally, it should be a property
of the activity_prof::CallbacksTable.

Change-Id: I44a0d19ed2862606cfbc9a98c1a07a336ab7e26c
2022-09-21 05:53:59 -04:00
Laurent Morichetti 4fbae91468 SWDEV-351980 - Move activity_ to the ProfilingInfo
The activity_ is only instantiated if profiling is enabled.

Remove the HIP private global record ID. Instead, use the correlation ID
stored in the hip_api_data_t by the profiler while the last HIP function
is in scope.

For NDRange and Copy commands, store the kernel name and byte size
(respectively) in the record.

General cleanups to improve the code's readability.

Change-Id: I01907484b0d9611eb9440c3a7c4865479dc42289
2022-09-21 05:53:47 -04:00
Chauncey Hui 5f77772e6e SWDEV-2 - Change OpenCL version number from 3507 to 3508 2022-09-21 03:00:08 -04:00
German df77380cee SWDEV-286150 - Switch PAL_DISABLE_SDMA to release build
Change-Id: Ibf800e1c767fd1e6bc9ebc4fc56b767b0a90bdef
2022-09-20 13:47:14 -04:00
Chauncey Hui ae8b862f1c SWDEV-2 - Change OpenCL version number from 3506 to 3507 2022-09-20 03:00:08 -04:00
Saleel Kudchadker 348a33434a SWDEV-301667 - Log file out per process
Change-Id: If1e797b8ffb397c03166d9b8e5b5369c42c39e60
2022-09-19 13:19:14 -04:00
Chauncey Hui df40e927c8 SWDEV-2 - Change OpenCL version number from 3505 to 3506 2022-09-17 03:00:06 -04:00
Ajay 373a7d1195 SWDEV-347670 - GPU StreamWait and StreamWrite support in Windows PAL backend
Change-Id: Ic4881305b6332e217f3d3127dce7e9d9d0a7df11
2022-09-15 13:57:40 -04:00
Chauncey Hui cf04d40a1a SWDEV-2 - Change OpenCL version number from 3504 to 3505 2022-09-15 03:00:09 -04:00
kjayapra-amd 362e7961d9 SWDEV-351338 - Increase AQL Queue depth and reduce ROC_ACTIVE_WAIT_TIME to 0.
Change-Id: Iec0005d57933d0df2f55b4f7bdb84ada8a386a0c
2022-09-14 10:49:53 -04:00
Maneesh Gupta 4afca0647e SWDEV-352502 - SWDEV-355630 - Revert "SWDEV-299940 - Query global memory available on device with HSA attribute HSA_AMD_AGENT_INFO_MEMORY_AVAIL"
This reverts commit 73021582d3.

Change-Id: I86175ccf2b543c3ff3dffec6ccae410976972f22
2022-09-12 07:34:32 -04:00
Chauncey Hui c54da46b0c SWDEV-2 - Change OpenCL version number from 3503 to 3504 2022-09-08 03:00:07 -04:00
Rakesh Roy f097cda948 SWDEV-353941 - Fix hipMemset latency issue for hipMallocManaged
- In case of HMM, use blit kernel instead of CPU memcpy for hipMemset

Change-Id: I89bfc96ff01a2375ed8df1b1c6bc05357dea84f7
2022-09-07 03:20:58 -04:00
Chauncey Hui 94e8ded315 SWDEV-2 - Change OpenCL version number from 3502 to 3503 2022-09-07 03:00:07 -04:00
Sourabh Betigeri b15b3173f4 SWDEV-326798 - Avoids waiting for the signal with a timeout and instead queries the current status
Change-Id: I8b4052d5439ca930ceb924da30cb0335f0826a95
2022-09-06 12:49:46 -04:00
Chauncey Hui b2d412bcf1 SWDEV-2 - Change OpenCL version number from 3501 to 3502 2022-09-02 03:00:06 -04:00
Yifan Zhang d6c35273e3 SWDEV-354850 - Enable gfx1103 on ROCm backend.
Signed-off-by: Yifan Zhang <yifan1.zhang@amd.com>
Change-Id: Ia420444fd0cb703fcdca1ed7e3d2e262ffd7f4db
2022-09-01 18:58:00 +08:00
Chauncey Hui 38e9bc9540 SWDEV-2 - Change OpenCL version number from 3500 to 3501 2022-09-01 03:00:06 -04:00
Joseph Greathouse 6b956f7627 SWDEV-330307 - Avoid releasing command before last use
The fix for SWDEV-329789 moved down the last use of the a
command object pointer in order to prevent a race condition.
However, the previous patch did not move down the release of
that command. By releasing the command early, another thread
could get a command with the same pointer. That second thread
could later submit work to the queue using that new command.
The first thread could then perform a comparison against the
queue's last command using its own now-stale pointer. This
could eventually allow the second thread to skip synchornizing
on the queue. This would result in host synchronizations
completing before their device work was actually complete.

Change-Id: I292b7b369743251ceafe453a4c5cae14a6d01046
2022-08-31 16:07:49 -04:00
Alex Xie c00f10daea SWDEV-351337 - Add asic ID to staging for PAL
Change-Id: I64a24cc0118457d492d76063c808465d212b47bd
2022-08-31 15:56:40 -04:00
Chauncey Hui 2587a700b8 SWDEV-2 - Change OpenCL version number from 3499 to 3500 2022-08-31 03:00:06 -04:00