نمودار کامیت

2930 کامیت‌ها

مولف SHA1 پیام تاریخ
Wilkin c7af4e8857 ROCm Runtime Support for respecting target xnack setting
This includes the changes provided by Konstantin, "Add xnack from elf header" (Change 136389).

Change-Id: I95e51141caa0d7c21903b09212c02e4906ec54a3


[ROCm/ROCR-Runtime commit: 8e3d26c617]
2018-03-20 16:57:15 -04:00
Felix Kuehling 1dc430eed2 Let KFD use VM from DRM render node
Move opening of DRM render nodes from topology to FMM aperture
initialization. Keep the same FDs open for the life time of the
process to match how KFD uses the VMs in the FDs. Call acquire_vm
ioctl during aperture initialization to let KFD use the VMs from
the render nodes.

Change-Id: Ie07d57788cbe685b1841cccc00820c12894a0356
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>


[ROCm/ROCR-Runtime commit: 8ac2150e81]
2018-03-20 15:42:45 -04:00
Philip Yang 09aff7edc9 Export microcode version of sDMA
Change-Id: I86fa5da5e72af13a2e76e6e3be4667a7220923d5
Signed-off-by: Philip Yang <Philip.Yang@amd.com>


[ROCm/ROCR-Runtime commit: 1bf93d4e89]
2018-03-19 08:38:50 -04:00
Konstantin Zhuravlyov 544b3a88ec Bring loader in sync with stg sc.
Change-Id: Ib4d9231ca61048557acdad8eb8f632688c4aadd8


[ROCm/ROCR-Runtime commit: b7915e9248]
2018-03-12 15:00:50 -04:00
Felix Kuehling ea79b3d1da Update kfd_ioctl.h from kernel
This adds new acquire_vm ioctl.

Change-Id: Ia6794bfd291706cecdb2d06f4902b324b48577df
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>


[ROCm/ROCR-Runtime commit: 19dacdecd3]
2018-03-09 16:36:52 -05:00
Felix Kuehling 16c8b8add8 Rework SVM aperture initialization
Query GPUVM aperture limits of all dGPUs to determine SVM aperture
base and limit. This depends on a recent KFD change that reports
the GPUVM apurture limits for dGPUs in the
AMDKFD_IOC_GET_PROCESS_APERTURES_NEW ioctl (drm/amdkfd: Simplify
dGPU SVM aperture handling).

Only initialize SVM aperture once, instead of once per GPU.

Don't call AMDKFD_IOC_SET_PROCESS_DGPU_APERTURE. It's not needed any
more and will not be upstreamed.

Change-Id: Ib3389e8ba18505ba15fc33f45fe8a57e690a565d
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>


[ROCm/ROCR-Runtime commit: 85e1a9bf5e]
2018-03-09 16:36:49 -05:00
Felix Kuehling e0046526ea Move dGPU memory aperture initialization
Define dgpu_mem_init before it's used and keep the code close to the
rest of the aperture initialization code.

Change-Id: I14ad11a364524a15affee9186b1298ba7d56d2c9
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>


[ROCm/ROCR-Runtime commit: c5cfb7e25b]
2018-03-09 15:00:12 -05:00
Sean Keely 3f4f6f778a Use atomic variable for Runtime ref_count_.
Change-Id: Ic4d0ad9ff93d0cc52cfe2df006ee3436d5960b07


[ROCm/ROCR-Runtime commit: ac5ccb45b7]
2018-03-06 03:45:14 -06:00
Sean Keely 282dba1036 Add exception safety to Runtime::Acquire.
Change-Id: Ia2a9baf08bb56971412f1ac3914592612de5f134


[ROCm/ROCR-Runtime commit: 31c05d2fc7]
2018-02-28 05:21:07 -06:00
Yong Zhao 6d4838e570 Add pkg config support in the hsakmt-roct-dev package
Change-Id: Ida6b3083bfc9405ef9b6b8e426dc7dc51d61a811
Signed-off-by: Yong Zhao <yong.zhao@amd.com>


[ROCm/ROCR-Runtime commit: 15e525af45]
2018-02-27 11:21:38 -05:00
Yong Zhao 21f0985b1e Turn off the verbose building message
Change-Id: If4ebdb6f87fde9c3cc76b16c57e862bfb972ed5e
Signed-off-by: Yong Zhao <yong.zhao@amd.com>


[ROCm/ROCR-Runtime commit: 2c426a026a]
2018-02-26 18:17:13 -05:00
Philip Yang 5372649b06 Close shmem file handle, to fix file handle leak
kfdtest hsaKmtOpenKFD failed after 1019 loop if using --gtest_loop=-1,
because default max open file handle limit is 1024. Found shmem file handle
is not closed from lsof output.

Change-Id: I474de2bae6c03e879a219dedf5f18639118b73e5
Signed-off-by: Philip Yang <Philip.Yang@amd.com>


[ROCm/ROCR-Runtime commit: 105291849f]
2018-02-23 10:50:52 -05:00
Jay Cornwall 5b079d80cf Allocate EOP queue local to GPU
On discrete GPUs place the EOP queue in VRAM. The reader/writer of this
queue is the CP and the size is small. Dispatch latency improves
through lower read latency in AQL completion phase.

Change-Id: Id8351dcddbd21fd7c7d699803c96434c9132db71
Signed-off-by: Jay Cornwall <Jay.Cornwall@amd.com>


[ROCm/ROCR-Runtime commit: e2c353dc0d]
2018-02-22 18:14:05 -05:00
Oak Zeng ad1f1ee5dc Support ptrace access invisible vram
Invisible device memory is mmapped as PROT_NONE.
Normal CPU access to the memory is still not allowed but
struct vm_area_struct will be created for the memory address
so ptrace can access the memory via the vma.

Change-Id: I07c69208716c920ccce33e6b494b610b61a0a7c1
Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>


[ROCm/ROCR-Runtime commit: 25170c3c57]
2018-02-20 14:13:00 -05:00
Chris Freehill 827e36d1b8 Reenable Memory_Max_Mem test
Change-Id: I2da50f886cd8d28d0f9ad8b8f77cfa13d392bf01


[ROCm/ROCR-Runtime commit: 81c923b26f]
2018-02-13 17:37:25 -06:00
Chris Freehill 942295129d Completely disable rocm-smi (take 2)
Change-Id: I68f403c539163bfe00ee2b59dbd36d1c6d7669f1


[ROCm/ROCR-Runtime commit: 146b3871df]
2018-02-12 06:43:47 -06:00
Chris Freehill 17c20b9d22 Completely disable rocm-smi from rocrtst until rocm-smi-lib is updated
Change-Id: I5cce06a2bbde7a3a48e391022c793a462794c6d1


[ROCm/ROCR-Runtime commit: bd0c4efc34]
2018-02-11 21:42:25 -06:00
Sean Keely f64d45a66d Improve fragment map reporting format.
Change-Id: I85d09d085b08de46271ec902c766a8609a4b921a


[ROCm/ROCR-Runtime commit: 95c926059d]
2018-02-09 14:03:03 -05:00
Sean Keely de3912dae4 Emit fragment map and thunk ptr info with VM faults.
Change-Id: If1302f674df7a636529c64bf66dfdda755a70c32


[ROCm/ROCR-Runtime commit: 9212e7a09f]
2018-02-09 14:02:26 -05:00
Sean Keely c1147397e0 Reduce to only one internal compute queue.
Change-Id: Ie42ecb3b242077624d74caeabfcd418dbbd9ff3e


[ROCm/ROCR-Runtime commit: 0eb534e3cf]
2018-02-09 14:02:15 -05:00
Sean Keely e844aa939d Defer creation of internal queues and blits until first needed.
Change-Id: I2e61d7e102f38389d806d9eb24beda910573157b


[ROCm/ROCR-Runtime commit: bd5dd47ca1]
2018-02-09 14:02:07 -05:00
Harish Kasiviswanathan 1a9af8b82d CMA: Initialize SizeCopied return parameter
UCX test cases are reporting uninitialized values when CMA fails. The
application should ideally ignore SizeCopied when the function fails but
it doesn't. This is leading to wrong diagnosis.

v2: Fill in partial SizeCopied in case of failure

Change-Id: I6b7e1c19a8b702ec91ca64201a3dda27bd897877
Signed-off-by: Harish Kasiviswanathan <Harish.Kasiviswanathan@amd.com>


[ROCm/ROCR-Runtime commit: 7de0199e99]
2018-02-08 12:46:40 -05:00
James Edwards 27ce54a0aa : Fix compilation errors with gcc 7.2.1 for hsa runtime.
Change-Id: I3356388753ca78cc0f1e0c3188220d7f3f60283d


[ROCm/ROCR-Runtime commit: b913795c31]
2018-02-07 09:22:39 -06:00
Evgeny f22fab8263 ExecutePM4 queue full check fix
Change-Id: Id56ece6d3f5eab1ef3a2758922022f0996c1efe4


[ROCm/ROCR-Runtime commit: 8a8d7ad814]
2018-02-05 19:35:39 -06:00
Chris Freehill 67598cc036 Don't support platform atomics for gfx9XX
Change-Id: I302c862494e221ae2b6b3e1a843f06586b0b28ba


[ROCm/ROCR-Runtime commit: 3449f7dea6]
2018-02-02 18:21:16 -05:00
Yong Zhao 1d401fe538 Revert "Workaround: make mmap memory resident for gfx902"
This reverts commit cbcae4eecc.

Change-Id: I9f4f0b6b426aeae4cb652b33cf0d4c0f57270ca5
Signed-off-by: Yong Zhao <yong.zhao@amd.com>


[ROCm/ROCR-Runtime commit: 55bb61ff9c]
2018-02-02 12:31:06 -05:00
Sean Keely 00b5aec5d4 Guard against IPC signal use of when profiling copy APIs.
Also update IPC signal API text to allow single process profiling with IPC signals.

Change-Id: I90b246623129d57183acb4ba1789beec360547c3


[ROCm/ROCR-Runtime commit: f59b001c75]
2018-01-31 19:05:32 -05:00
Sean Keely 9c247a4c6b Revert CRAT table workaround.
Change-Id: Ic2bf9e1fb1d00c5a31d52560e0eb37e0ae1ab08a


[ROCm/ROCR-Runtime commit: 91f559802d]
2018-01-30 18:26:53 -06:00
Tony Tye ab3c53761b Add support for R_AMDGPU_RELATIVE64
- Add support for R_AMDGPU_RELATIVE64 relocation record.
- Return status error if any unsupported relocation record encountered.

Change-Id: Icbb5dcb81109a70c1f2195412a0df58a11be9da1


[ROCm/ROCR-Runtime commit: d472b24d05]
2018-01-30 18:20:26 -05:00
Chris Freehill 2b39004aa6 Temporarily disable rocm-smi to integrate with new rocm-smi
Change-Id: I06701cd4ac80bb4f3a9ae48d5374b7d4a788f8a4


[ROCm/ROCR-Runtime commit: 8bf85cc668]
2018-01-26 06:44:01 -06:00
Laurent Morichetti 84945d39f8 Silence Valgrind warnings
Change-Id: I8803f3d310fccd69d0d04b2464b00dccc40270e3
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>


[ROCm/ROCR-Runtime commit: 056ddbbc82]
2018-01-25 16:48:17 -05:00
rohit pathania ea550637f0 Added CPU to GPU and GPU to CPU MemoryAccess Tests, Added enqueue latency Tests
Change-Id: I18643d283101b792fa25705c8149ddc5a9eefe73


[ROCm/ROCR-Runtime commit: ee917eca68]
2018-01-08 04:11:32 -05:00
Amber Lin 83bbc28ad9 Update README to reflect cmake change
New CMakeLists.txt sets a default module search so -DCMAKE_MODULE_PATH is
no longer required in the command.

Change-Id: I95189ce2f36016b7c4929239d0e512851bec5ef6
Signed-off-by: Amber Lin <Amber.Lin@amd.com>


[ROCm/ROCR-Runtime commit: 7031a77428]
2018-01-04 15:55:20 -05:00
Chris Freehill f113a2dadc Report physical memory instead of virtual memory
Change-Id: I18105e3982a96aea40e05cd78521c0c3acf75de4


[ROCm/ROCR-Runtime commit: 563581223c]
2017-12-20 22:11:50 -04:00
Sean Keely 640ca3ba4f Merge system heap info.
Workaround pending thunk spec clarification.

Change-Id: I9d96227efde3a551157733cf4050d474d1e658f2


[ROCm/ROCR-Runtime commit: fe1763848a]
2017-12-19 18:57:29 -06:00
Amber Lin 91ee9033f1 Update README to include new requirement
Latest Thunk requires the user to belong to video group. Add this
statement to README.md to notify external users on Github.

Change-Id: Id9843abf09de5b63a3b7c3f7b322bc9099c6ff1a
Signed-off-by: Amber Lin <Amber.Lin@amd.com>


[ROCm/ROCR-Runtime commit: 8bc83e1e9b]
2017-12-18 12:10:24 -05:00
rohit pathania 72073d5f4a Resubmitted added and modified common utilities functions for rocrtest with faile to open file fix
Change-Id: Ie45668df1a15c1be7e8bdb10b967b98fb3024252


[ROCm/ROCR-Runtime commit: 7310312291]
2017-12-18 05:06:22 -04:00
Kent Russell 061789c3fd Revert "added and modified common utilities functions for rocrtest"
This reverts commit 4c4869bf50.

Change-Id: I825b210ce4fc831f8a978faf1c7d83d54408efa4


[ROCm/ROCR-Runtime commit: c3a880db7d]
2017-12-15 06:04:50 -05:00
Sean Keely ecbd2eb820 Remove region/pool size limits for 902.
Temporary measure. Must be reverted once CRAT tables have been fixed.

Change-Id: Id2f2673edbf7b6fc5752f8d871042b4bf4de653c


[ROCm/ROCR-Runtime commit: b49e5b4917]
2017-12-14 16:02:05 -05:00
Yong Zhao cbcae4eecc Workaround: make mmap memory resident for gfx902
Change-Id: I5f90f316740f7995d54cb083a6d7e05bc4e2966e
Signed-off-by: Yong Zhao <yong.zhao@amd.com>


[ROCm/ROCR-Runtime commit: 716755b1de]
2017-12-14 15:11:01 -05:00
rohit pathania 4c4869bf50 added and modified common utilities functions for rocrtest
Change-Id: I80afa33a46b3d95058be306869e7ed54b2b7df64


[ROCm/ROCR-Runtime commit: 7e46704abb]
2017-12-14 12:01:16 -05:00
Sean Keely b8934623ee Don't use double mappings on GFX9 APUs.
Change-Id: I1225696211d4eac9ce982243ea0a1a9e8b2a318f


[ROCm/ROCR-Runtime commit: 1addb5e684]
2017-12-08 20:18:02 -05:00
Yong Zhao 738a69caeb Report gfx902 as GFX 9.0.2
This change is needed to match other higher level components.

Change-Id: I45114d23f2ed428dfbbb836061b3020c5ab166ec
Signed-off-by: Yong Zhao <yong.zhao@amd.com>


[ROCm/ROCR-Runtime commit: 0f83774635]
2017-12-07 16:08:10 -05:00
Sean Keely e155bd0cb1 Report library load errors in debug builds.
Change-Id: I24e63b15ad74fb86ecfe839f543800c2140c09d9


[ROCm/ROCR-Runtime commit: ca4c884306]
2017-12-05 18:49:33 -05:00
Oak Zeng c3dcb0a7bb Revert "Revert "More cleanup of fmm.c""
This reverts commit 5b0ba9550c.

Change-Id: I31afe4889794df8cf1e96f5f18771bed75a213d9
Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>


[ROCm/ROCR-Runtime commit: c2dc301792]
2017-12-04 15:48:11 -05:00
Oak Zeng 6c763e99fe Revert "Revert "Cleanup fmm.c""
This reverts commit dbd9a8736c,
Plus a bug fix to patch "Cleanup fmm.c":
Call id_in_array with correct parameter. The third parameter
of id_in_array is size in byte of the array, not the number
of array items. Call it correctly.

Change-Id: I72d8e2fcc0df32af76c72967386e92c1be18c159
Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>


[ROCm/ROCR-Runtime commit: 786e470241]
2017-12-04 15:48:11 -05:00
Felix Kuehling cfd86f5b7e Rename fmm_allocate_memory_in_device
to fmm_allocate_memory_object. This function name was confusingly
similar to fmm_allocate_device and __fmm_allocate_device. The new name
reflects its function better: allocate the VM object and the kernel
mode buffer object.

Change-Id: I6604d228004b4d41e871d4de784786823608b5d6
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>


[ROCm/ROCR-Runtime commit: 587d4f4bdf]
2017-12-04 10:23:55 -05:00
Oak Zeng dbd9a8736c Revert "Cleanup fmm.c"
This reverts commit b1a482dd52.
This change caused a regression ()
Revert temporarily

Change-Id: Ic3829264151e37d1f8c6927c6f464006234ba17f
Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>


[ROCm/ROCR-Runtime commit: f7689d4fef]
2017-11-29 09:43:11 -05:00
Oak Zeng 5b0ba9550c Revert "More cleanup of fmm.c"
This reverts commit 5883a2c86b.
This change caused a regression ()
Revert temporarily

Change-Id: I5af59d319afeb7f0b03e5a09e8397e3853b8b37b
Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>


[ROCm/ROCR-Runtime commit: 52f6a61970]
2017-11-29 09:42:19 -05:00
Sean Keely a427fde26a Cleanup Signal interfaces for doorbells.
Create an interface for doorbell signals to reduce code duplication.
No functional changes.

Change-Id: I101a8997dd582ff99e1537758c804b21fe3bb6af


[ROCm/ROCR-Runtime commit: d2e70bb999]
2017-11-28 22:12:19 -06:00