rocm-systems

Autors	SHA1	Ziņojums	Datums
Chauncey Hui	b96ddeed4c	SWDEV-2 - Change OpenCL version number from 3522 to 3523 [ROCm/clr commit: `cfd9ced308`]	2022-11-04 03:00:06 -04:00
kjayapra-amd	9f4ef7089f	SWDEV-329423 - Enable NV3x target ID related changes on mainline. Change-Id: I54fafb371976aaa16a452ec65b0c340c5682c264 [ROCm/clr commit: `ea0ce45ad3`]	2022-11-03 21:11:24 +00:00
German	bc14b745e8	SWDEV-365121 - Use CP DMA for tiny transfers Sync between compute and SDMA engines can be very expensive under Windows. Use CP DMA for tiny transfers (< 1KiB) to avoid syncs and improve performance. Change-Id: I9db39a2199f7b9e337ed08fd36d9cbc150502f1f [ROCm/clr commit: `473621c008`]	2022-11-03 10:04:53 -04:00
Tao Sang	441effd3d7	SWDEV-363140 - Support performance counter on gfx11 Change-Id: I2216f71f4d4db6dd3766333b0c821cb3d35d7851 [ROCm/clr commit: `06573ac92f`]	2022-11-03 07:59:24 -04:00
Chauncey Hui	fb16e20571	SWDEV-2 - Change OpenCL version number from 3521 to 3522 [ROCm/clr commit: `4b3ff92abd`]	2022-11-03 03:00:09 -04:00
Laurent Morichetti	6d3939cd49	SWDEV-351980 - Store the correlation ID in the dispatch packet The profiler can retrieve this correlation ID to attribute waves to specific dispatch locations. Change-Id: I700e8a91219d612f6a2028c0dda0c92753f3526a [ROCm/clr commit: `b043b4f5a2`]	2022-11-02 23:40:10 -04:00
Chauncey Hui	a2df4f8400	SWDEV-2 - Change OpenCL version number from 3520 to 3521 [ROCm/clr commit: `b070ba9cc3`]	2022-11-01 03:00:05 -04:00
jatang	53f289e0ac	SWDEV-293816 - Only dispatch Counter Aql packet when it is created successfully. Change-Id: Ic8a7374d19012754c6de5d483ec5d07a56661d1b [ROCm/clr commit: `166ef6b622`]	2022-10-31 15:11:41 -04:00
Saleel Kudchadker	0d1d52d249	SWDEV-301667 - Use queue base address for AQL log Change-Id: Ie525991b341deeb93ab3cb2d77550e8e1d87ee20 [ROCm/clr commit: `cf1fd98074`]	2022-10-31 13:54:49 -04:00
German	c04e2300c8	SWDEV-363074 - Clean-up sync between SDMA and compute HIP can't rely on the resource tracking, used in OCL and requires different explicit sync. Make sure ROCCLR syncs compute only when SDMA is used and vise versa. The new logic will allow to enable CPDMA without unnecessary waits. Change-Id: Ib9d1788cfd5afa5ea2fec4c96a37d8b9c4d0059d [ROCm/clr commit: `ff6b4db70b`]	2022-10-31 10:02:01 -04:00
Chauncey Hui	30ec1ca8df	SWDEV-2 - Change OpenCL version number from 3519 to 3520 [ROCm/clr commit: `3bce4df27d`]	2022-10-29 03:00:09 -04:00
German	c15bee3157	SWDEV-363074 - Enable resource cache for SVM Blender creates and destroys big allocations during the benchmark. That causes big delays, because vidmm has to page-in/page-out memory. Change-Id: I2baf4545807127406e3d2870a7581ff9ae7bcdb5 [ROCm/clr commit: `dc4ad8c99c`]	2022-10-27 16:39:29 -04:00
Chauncey Hui	b0af040a7d	SWDEV-2 - Change OpenCL version number from 3518 to 3519 [ROCm/clr commit: `cff74ae551`]	2022-10-25 03:00:08 -04:00
German	f5f0a6c618	SWDEV-352487 - Don't add notifications as the last command Change-Id: Ifed34485839ef2c9491e8e8f6bb3569932160b1c [ROCm/clr commit: `e223b0f678`]	2022-10-24 09:39:03 -04:00
Chauncey Hui	4c1741acfd	SWDEV-2 - Change OpenCL version number from 3517 to 3518 [ROCm/clr commit: `a65ed69c1b`]	2022-10-22 03:00:06 -04:00
Ajay	09e359f925	SWDEV-357207 - Linux: enable StreamWrite/Wait APIs in deviceAttribute Change-Id: I593df2dbb7a0233dd42f8d873510d85bbb27feff [ROCm/clr commit: `261620a2ef`]	2022-10-21 12:44:35 -04:00
German	6ab9293146	SWDEV-360183 - Finish the trace if queue was destroyed - Make sure SQQT trace is captured for RGP server if the queue is destroyed before normal capture is done. - Remove prepare queue from the logic. It's not really used for any HW capture and can cause RGP server abort if destroyed before capture is even started(delayed capture) Change-Id: I6eb19963190a5769c6477a5496c1b831a6d59b89 [ROCm/clr commit: `c1c5127875`]	2022-10-21 10:44:10 -04:00
Alex (Bin) Xie	d77a150bf9	SWDEV-361591 - clean up header files Change-Id: I25ca37a94524da0521f5b856a9fa9bab73f103c1 [ROCm/clr commit: `06867f0c04`]	2022-10-21 10:23:55 -04:00
Chauncey Hui	7347172bc3	SWDEV-2 - Change OpenCL version number from 3516 to 3517 [ROCm/clr commit: `904f3e3d75`]	2022-10-21 03:00:52 -04:00
Tao Sang	326049efb8	SWDEV-360706 - Fix WriteAqlArgAt() crash in X86 In X86 on Windows, sizeof(size_t)=4, but size=8, for amd::KernelParameterDescriptor::HiddenGlobalOffsetX/Y/Z items. Loose the condition to prevent crash. Change-Id: I2216f71f4d4fd6dd3766023b1c821cb3d35d7848 [ROCm/clr commit: `3d281114fb`]	2022-10-20 18:04:49 -04:00
Sourabh Betigeri	644fc9421c	SWDEV-361730 - Fix to match the kernel arg allocation sizes between gpu visible mem and host/runtime mem Change-Id: I6495aa57e97b46373c1d90cc230435780fdcf3ef [ROCm/clr commit: `b2665ccc4b`]	2022-10-20 14:20:37 -04:00
Ajay	e81e8965c9	SWDEV-360944 - palBlit setArgument desc.size_ --> argSize Fixes AMF tests that were impacted on Windows Change-Id: I0326e794ec5a34fdec7a5ff4ed79d5bd055997d2 [ROCm/clr commit: `a5b550e828`]	2022-10-20 12:44:15 -04:00
Laurent Morichetti	860e33dbed	SWDEV-362046 - Report HIP_OPS activities using the ROCr driver_node_id instead of the device's index The ROCclr assigns zero-based IDs to GPUs in the order they are discovered. That zero-based ID is what is used to identify the GPU on which the HIP_OPS activity took place. When multiple ranks are used, each rank's first logical device always has GPU ID 0, regardless of which physical device is selected with CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from multiple ranks, GPU IDs from different processes may overlap. The long term solution is to use the KFD's gpu_id which is stable across APIs and processes. Unfortunately the gpu_id is not yet exposed by the ROCr, so for now use the driver's node id. Change-Id: Ib78854527d600d175bb76e2df0747c33f898c615 [ROCm/clr commit: `9a82118c85`]	2022-10-20 12:31:30 -04:00
Julia Jiang	a8c33d177a	SWDEV-357122 - fix failure in vdi so as to handle unreasonable input size in MallocManagedNegativeTest Change-Id: I1ed1916b652afc67327b0935c3c60fc2a404df30 [ROCm/clr commit: `dacd55f3d7`]	2022-10-20 12:19:22 -04:00
Chauncey Hui	cc132b1aec	SWDEV-2 - Change OpenCL version number from 3515 to 3516 [ROCm/clr commit: `05b2bd7995`]	2022-10-20 03:00:06 -04:00
Christophe Paquot	b2d02fe704	SWDEV-322620 - Virtual Memory Management Hooked up the start VA hint address to PAL. Change-Id: I4301d3eaaabcc85faf63fb25291291dcc3e8ebd1 [ROCm/clr commit: `470c4d7335`]	2022-10-19 10:50:07 -04:00
Chauncey Hui	3db95d8b68	SWDEV-2 - Change OpenCL version number from 3514 to 3515 [ROCm/clr commit: `5cad630425`]	2022-10-19 03:00:13 -04:00
kjayapra-amd	9bfae25a7c	SWDEV-307411 - Require comment/message string in guarantee(). Change-Id: I55a699c5366f6a7d167ede4c9be2ec5b15cea9d0 [ROCm/clr commit: `951a5b9e7d`]	2022-10-13 16:21:21 -04:00
Chauncey Hui	bfe5591f69	SWDEV-2 - Change OpenCL version number from 3513 to 3514 [ROCm/clr commit: `4e0e0666d6`]	2022-10-13 03:00:09 -04:00
Saleel Kudchadker	434df167df	SWDEV-359548 - Detect fence optimization - Use a dirty flag to determine fence optimization - If fence is dirty submit a marker at top level to sync. Change-Id: I53fb19b5bb05b7c7b37c41637a6c7aaf870b639a [ROCm/clr commit: `6405b6cdba`]	2022-10-10 14:15:16 -04:00
Chauncey Hui	9fdcdf6cd3	SWDEV-2 - Change OpenCL version number from 3512 to 3513 [ROCm/clr commit: `621faa03ec`]	2022-10-08 03:00:09 -04:00
Saleel Kudchadker	41d911d637	SWDEV-356482 - Improve logging Change-Id: I0c6286fcdb9d94398ed66262e95e8837898ff9fe [ROCm/clr commit: `eef2f99513`]	2022-10-06 11:13:24 -04:00
Chauncey Hui	3c01989266	SWDEV-2 - Change OpenCL version number from 3511 to 3512 [ROCm/clr commit: `3e3fa68657`]	2022-10-06 03:00:06 -04:00
Sourabh Betigeri	f25d8dae77	SWDEV-357246 - Adds a missing return statement Change-Id: I2216f71f4d4fb6dd3766023b0c821cb3d35d7849 [ROCm/clr commit: `84fbb30b7c`]	2022-10-05 16:29:32 -04:00
Laurent Morichetti	7bcdb29caa	SWDEV-351980 - Add FillBuffer byte count to the record Change-Id: I90c791f5810b8a3f6b1d6a9e81c165b1a7515c92 [ROCm/clr commit: `e00965df50`]	2022-09-30 21:20:14 -07:00
Chauncey Hui	9c0157e582	SWDEV-2 - Change OpenCL version number from 3510 to 3511 [ROCm/clr commit: `800fe4b239`]	2022-09-30 03:00:10 -04:00
Jeremy Newton	d4274a37c7	SWDEV-1 - Fix typo in compiler utils Found by the Debian maintainers: https://github.com/ROCm-Developer-Tools/ROCclr/issues/28 Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com> Change-Id: I8c09913f7b7f4ad02f12a0630f15e66bdd5360a7 [ROCm/clr commit: `65fd4f5f07`]	2022-09-29 18:14:29 -04:00
Chauncey Hui	f1c029e147	SWDEV-2 - Change OpenCL version number from 3509 to 3510 [ROCm/clr commit: `f93d921c4f`]	2022-09-23 03:00:11 -04:00
Saleel Kudchadker	0dd9add8e1	SWDEV-352001 - Store last scopes for dispatch - Store last fence scopes and use the last value to determine if we need a cache flush again. This helps cases where hipExtLaunchKernel API is used. - Purge code for ROC_EVENT_NO_FLUSH Change-Id: I531cf9c9c60d5e2b3a9e265d0f52f79ed2fa8a8c [ROCm/clr commit: `9b5cbd37a2`]	2022-09-22 11:34:10 -04:00
Chauncey Hui	a045c28998	SWDEV-2 - Change OpenCL version number from 3508 to 3509 [ROCm/clr commit: `ae46d57519`]	2022-09-22 03:00:08 -04:00
Laurent Morichetti	0cd3ec5056	SWDEV-351980 - Consolidate registration tables in the roctracer library Remove the activity_prof::CallbacksTable. The table was redundant with the information already stored in the roctracer library. Instead use a single callback into the roctracer library to query whether the activity is enabled, and to report it. Change-Id: I2e05b0881bb4a1953c14361d00ea310d02eb6e0c [ROCm/clr commit: `52eb28930a`]	2022-09-21 05:54:09 -04:00
Laurent Morichetti	353f9bc86c	SWDEV-351980 - Enable profiling for commands reporting activities Profiling should be enabled for any command reporting activities as the activity record captures the profilingInfo's start and end timestamps. Since IS_PROFILER_ON is only used to determine whether API tracing is enabled, there is no need to expose it globally, it should be a property of the activity_prof::CallbacksTable. Change-Id: I44a0d19ed2862606cfbc9a98c1a07a336ab7e26c [ROCm/clr commit: `e713b5c7d0`]	2022-09-21 05:53:59 -04:00
Laurent Morichetti	cbcc94b9e3	SWDEV-351980 - Move activity_ to the ProfilingInfo The activity_ is only instantiated if profiling is enabled. Remove the HIP private global record ID. Instead, use the correlation ID stored in the hip_api_data_t by the profiler while the last HIP function is in scope. For NDRange and Copy commands, store the kernel name and byte size (respectively) in the record. General cleanups to improve the code's readability. Change-Id: I01907484b0d9611eb9440c3a7c4865479dc42289 [ROCm/clr commit: `4fbae91468`]	2022-09-21 05:53:47 -04:00
Chauncey Hui	939199b98f	SWDEV-2 - Change OpenCL version number from 3507 to 3508 [ROCm/clr commit: `5f77772e6e`]	2022-09-21 03:00:08 -04:00
German	46409cc4e6	SWDEV-286150 - Switch PAL_DISABLE_SDMA to release build Change-Id: Ibf800e1c767fd1e6bc9ebc4fc56b767b0a90bdef [ROCm/clr commit: `df77380cee`]	2022-09-20 13:47:14 -04:00
Chauncey Hui	0226858812	SWDEV-2 - Change OpenCL version number from 3506 to 3507 [ROCm/clr commit: `ae8b862f1c`]	2022-09-20 03:00:08 -04:00
Saleel Kudchadker	b6696ed932	SWDEV-301667 - Log file out per process Change-Id: If1e797b8ffb397c03166d9b8e5b5369c42c39e60 [ROCm/clr commit: `348a33434a`]	2022-09-19 13:19:14 -04:00
Chauncey Hui	1b7f4194b5	SWDEV-2 - Change OpenCL version number from 3505 to 3506 [ROCm/clr commit: `df40e927c8`]	2022-09-17 03:00:06 -04:00
Ajay	c2a7fe7bd6	SWDEV-347670 - GPU StreamWait and StreamWrite support in Windows PAL backend Change-Id: Ic4881305b6332e217f3d3127dce7e9d9d0a7df11 [ROCm/clr commit: `373a7d1195`]	2022-09-15 13:57:40 -04:00
Chauncey Hui	46df61f614	SWDEV-2 - Change OpenCL version number from 3504 to 3505 [ROCm/clr commit: `cf04d40a1a`]	2022-09-15 03:00:09 -04:00

1 2 3 4 5 ...

4671 Revīzijas