Gráfico de commits

244 Commits

Autor SHA1 Mensaje Fecha
Julia Jiang 43a2f75882 SWDEV-286322 - clean up trailing space (#2361)
Change-Id: I03c07e67a8d1fa1a874718ffba43eb396c2aa05c
2021-09-24 16:27:51 +05:30
Tao Sang 395f11c64c SWDEV-294588 - Enable NV printf DTests
Enable NV printf DTests as many as possible.
Fix the bugs due to behavour difference between
Hip-Rocclr and Cuda.
Add hipLimitPrintfFifoSize.

Change-Id: I3fe6dbc35a7a140a9919df197b7885df83d28049
2021-07-20 21:22:11 -04:00
Satyanvesh Dittakavi 7f2e5437cb SWDEV-284207 - Modify HIP samples to run on Navi and MI100 Asics
Change-Id: I5b8d4f8df36a5be2ea4c381b88454832c59253e6
2021-07-20 02:56:19 -04:00
Tao Sang d5142afe58 SWDEV-292273 - Add cmake tests
Add test on CXX and Fortran build in cmake.
Add test on hip::device interface linking in cmake.

Change-Id: I3fe6dba05a7a140a9a19df107b7885df83d28042
2021-07-08 12:51:46 -04:00
agunashe e2c6bb5b4e SWDEV-293742 - Update copyright end year
Change-Id: I2cb7bbd9a6d9da28116ba9dd9cec4e60525444e2
2021-07-07 12:57:23 -04:00
agodavar 56b3cb554a SWDEV-288510 - Add hip shared library path to run-time search for linker
Change-Id: I7eaf712c01556933b881b983a8994f4e06333116
2021-06-02 12:57:12 -04:00
Julia Jiang e0312fb454 SWDEV-272921 - Remove non-existing link for square sample
Change-Id: If8c87a6ded6aacaa4301b1b80ee218897219a6fb
2021-05-10 12:09:28 -04:00
Aaron En Ye Shi 458a09f151 SWDEV-222423 - Add HIP Sample 2_Cookbook/15_static_library
HIP supports emitting two types of static libraries. One type
will export host functions and is compatible with host linkers.
The second type exports device functions, but is generated with
ar manually. Also, add a README with steps on how to run these
samples with Makefile or CMake.

Change-Id: I1be15c2884583b370092bc8e4bf04f726f8f5a27
2021-04-27 16:05:15 -04:00
Julia Jiang cf02c80e13 SWDEV-270961 - update HIP documents
Change-Id: I5168d83a124e9c7d856fb7a46d9960d9f99143c6
2021-04-05 14:34:33 -04:00
Tao Sang 562282039e SWDEV-270973 - Add missing copyright headers
Change-Id: I6bb27650f74372dae6e29c79fd6bb2022cc062fe
2021-03-01 19:46:24 -05:00
Rahul Garg eff5d6cf6c SWDEV-269604 - Revert "Update dispatch latency with wall time measurements"
This reverts commit 024e78b865.

Hold wall time related updates till direct dispatch is ready.

Change-Id: I53b232f6f51bc2fc71b6b639fe0081e2907e9707
2021-02-25 04:50:44 +00:00
Julia Jiang ce2dff449d SWDEV-271416 - Remove HIP_DYNAMIC_SHARED macro in hip
Change-Id: I12f39ea8438eb7ce76d8ffb2151b4faa93689048
2021-02-08 13:17:46 -05:00
agodavar 1d1fd4d9f6 SWDEV-268552 - HIP sample segfaults during hipModuleLoad, fix mmap bug
Change-Id: I51c6061e7ac6b2524586d96fdd69b9f084b1906a
2021-01-27 23:39:50 -05:00
agodavar 3e11da745d SWDEV-268552 - HIP sample hipDispatchEnqueueRateMT segfaults, updated to use hipModuleLoadData
Change-Id: Ibf34c780d5bd4278adcedba1d6e057637c258485
2021-01-19 09:32:14 -05:00
Rahul Garg 024e78b865 Update dispatch latency with wall time measurements
Change-Id: I73bfed129171d6909b4638f714b694111b31ef56
2021-01-12 22:52:33 +00:00
Tao Sang 22094572e1 Remove hip-hcc codes: Part four
Remove __HCC__, __HCC_ONLY__, __HCC_CPP__, __HCC_C__,
__HCC_OR_HIP_CLANG__, __HIP_ROCclr__ and their guarded codes.

Remove Hcc codes from directed_tests and samples.

Remove __HIP_PLATFORM_HCC__ and __HIP_PLATFORM_NVCC__ from
some files where they are not necessary.

Add deprecation notice.

Change-Id: I1ae467eafd749d6c25bca204c1724b026be21fce
2021-01-08 10:53:34 -05:00
Tao Sang 693b89b3dd Remove hip-hcc codes: Part three
1.Rename include/hip/hcc_detail/ as include/hip/amd_detail/

2.Rename include/hip/nvcc_detail/ as include/hip/nvidia_detail/

3.Create __HIP_PLATFORM_AMD__ to replace __HIP_PLATFORM_HCC__

4.Create __HIP_PLATFORM_NVIDIA__ to replace __HIP_PLATFORM_NVCC__

After hcc_detail, nvcc_detail, __HIP_PLATFORM_HCC__ and __HIP_PLATFORM_NVCC__
have been removed from upstream, they will be removed from hip runtime.

Change-Id: I1ae457effd739d6c25bca203c1724b026be21fce
2021-01-05 10:39:31 -05:00
Aaron En Ye Shi 301d6d8f00 Add HIP Sample 2_Cookbook/17_llvm_ir_to_executable
HIP supports compiling kernels from LLVM IR into executable.
The device LLVM IR needs to be compiled into a fat binary
object. This device object is embedded into a host object using
llvm-mc directives. Then, any host linker may link the host and
device objects together into an executable. A README was added.

Change-Id: I8ebb6ae86b7ab4290f7cba2eea5584d73a7c453e
2020-12-09 15:48:53 -05:00
Aaron En Ye Shi 96c330c1af Add HIP Sample 2_Cookbook/16_assembly_to_executable
HIP supports compiling kernels from assembly into exec.
The device assembly needs to be compiled into a fat binary
object. This device object is embedded into a host object using
llvm-mc directives. Then, any host linker may link the host and
device objects together into an executable. A README is added.

Change-Id: I59d3a8b5363073810ffc3aa0d57f21b0df272369
2020-12-09 15:48:41 -05:00
Tao Sang ef2751c120 Make directed_test support static libs
1.Make directed_test apps linked against static libs
    of hip, rocclr, rocr, roct and amd_comgr.
    2.Remove custom_target amdhip64_static_combiner.
    3.Support EXCLUDE_HIP_LIB_TYPE <static|shared>.
    4.Simplify argument list parsing.
    5.Install rocclr when rocm is installed.
    6.Fix some original small bugs.

Revert "Revert "Make directed_test support static libs""
This reverts commit 144a6fb100.

Change-Id: I918eeae94487e5e2ff5bfde083667ac65fb6e702
2020-10-26 12:39:58 -04:00
Sarbojit Sarkar f40e7d71cc Added sample with __gfx*__ macro used
Change-Id: I80a474947ea1b755082dad5329d1aff3ee78840c
2020-10-14 06:56:39 -04:00
Tao Sang 144a6fb100 Revert "Make directed_test support static libs"
This reverts commit d00d627b9d.

Reason for revert: <INSERT REASONING HERE>

Change-Id: Ib2d1084ea21461f70f96f95c6025a08f88288357
2020-10-12 00:01:13 -04:00
Tao Sang d00d627b9d Make directed_test support static libs
1.Make directed_test apps linked against static libs
of hip, rocclr, rocr, roct and amd_comgr.
2.Remove custom_target amdhip64_static_combiner.
3.Support EXCLUDE_HIP_LIB_TYPE <static|shared>.
4.Simplify argument list parsing.
5.Fix hip_pch.o missing issue in samples' build.
6.Fix some original small bugs.

Change-Id: I30f5e65367613152ce86b80b13e1c9f2a28da803
2020-10-08 22:46:58 -04:00
Vladislav Sytchenko 08c35854c0 Revert "Revert "Merge branch 'amd-master-next' into amd-npi-next""
This reverts commit 051c84bba3.

Reason for revert: <INSERT REASONING HERE>

Change-Id: I92ceb171e31026ed1864704cef2fc1497b883ef9
2020-10-05 13:20:58 -04:00
Vladislav Sytchenko 051c84bba3 Revert "Merge branch 'amd-master-next' into amd-npi-next"
This reverts commit 607ac99389.

Reason for revert: <INSERT REASONING HERE>

Change-Id: I53322718dadde2c98f96140b8e260ec7ee9ef721
2020-10-05 13:02:39 -04:00
Julia Jiang ecbe97b9a9 SWDEV-253250-update instruction for hip sample
Change-Id: Iebc2887a3687e6c565999f4faf1cd4cf893e24cd
2020-09-25 13:51:19 -04:00
Satyanvesh Dittakavi cc28186c90 Amend hipBusBandwidth sample
Change-Id: I9230b553275146e984c6e7d9f11b76e520e14809
2020-09-18 13:45:19 -04:00
Tao Sang 5b8651c825 Make samples support cmake
Only cmake can support static lib of hip rt.
Thus samples will support static lib of hip
rt when this is done.

Change-Id: I70e8d06e85084369a035b42c5d1d56287c874ac9
2020-09-18 07:37:02 -04:00
Rahul Garg aea45f4b58 Fix hipDispatchEnqueueRateMT sample
Change-Id: I7ade0dcf8b32ac90a311022d12339740b2b3df44
2020-08-14 18:56:39 -04:00
Jason Tang c7212506c9 Add asicRevision
Change-Id: I59f3ad20b9bdadf77bd1e0725f7a401d7ad423a3
2020-06-16 17:54:20 -04:00
Jason Tang 593b8a093f SWDEV-227909 - Add gcnArchName
Change-Id: Iea6d16b5d693dd0d900fa424d7a321c39315430e
2020-06-05 15:33:55 -04:00
Dittakavi Satyanvesh ea913c964a enable occupancy sample on HIP-Clang
Change-Id: I236daad743bbbc7f8daa41e8157a4ff73058518b
2020-05-19 03:08:19 -04:00
Rahul Garg a8f78c4c7e Merge "Remove HIP_MARKER left overs due to HIP PR 2032" into amd-master-next 2020-05-07 01:36:57 -04:00
Rahul Garg 9ac4bb8627 Remove HIP_MARKER left overs due to HIP PR 2032
Change-Id: Ieae68dd3b12c92b1d6830619ca4c6ae43c400225
2020-05-06 20:26:18 -04:00
agodavar 7dda0bb8a0 SWDEV-233749 Added support for parallel build and link
Change-Id: Id227ea1fe7574612c33a8e6d91fc59a29490a35f
2020-05-06 14:54:39 -04:00
Aaron En Ye Shi 6c98a12a9b Merge "find_package(hip) changes for clang" into amd-master-next 2020-05-04 10:19:51 -04:00
Alex Xie 9136212b51 SWDEV-221166 - Detect support for large bar access through HIP runtime API
Change-Id: Iaa9756c1b5e40c1ab5afb38e44a6699fa5f6c13f
2020-05-01 20:39:52 -04:00
agodavar 14b060bfd3 find_package(hip) changes for clang
Change-Id: I9dc1be1dd5f9424e26ebcd2961207e1c4467550f
2020-05-01 20:48:32 +00:00
root 3c8b7f1e38 Merge master into amd-master-next
Change-Id: I3fc1dc0c860d627053537581e75561e8a7efe327
2020-04-26 22:19:37 +00:00
Michael LIAO 2ab06b307d [vdi] Fix texture reference sample.
- The driver code should not re-define `tex` again as it's already
  defined in the kernel code. Eventually, the driver code should be as
  regular C++ code instad of HIP code.

Change-Id: I8c7cab204b98990619d6e7109b990d7089ea9261
2020-04-23 23:41:56 -04:00
Maneesh Gupta e46a4e3e48 Merge in the rocclr based hip runtime (#2032)
* Merge master-next changes in master (include vdi development in master branch)
2020-04-23 09:12:06 -07:00
Michael LIAO 8ec28891de [vdi] Refactor texture/surface reference support.
Change-Id: I8014d82aae7139ef5f95e4b50c4fc6da200dbc9d
2020-04-21 11:56:48 -04:00
Tao Sang 6c0a08ba4f Solve issues with hip-vdi runtime static lib
1.Combine libamdhip64_static_base.a and libamdvdi_static.a into libamdhip64_static.a.
2.Let hipcc use -use-staticlib to link libamdhip64_static.a.
3.Add some samples for static lib.
4.Fix compiling failure of code object.

Change-Id: Ic8c95228eb139058da8b5d66ba8439486154ca6f
2020-04-20 15:37:02 -04:00
Tao Sang af5a4ca38b Revert "Solve issues with hip-vdi runtime static lib"
This reverts commit ed3b0eb391.

Reason for revert: It is causing dkms-no-npi-hipclang broken.
It is top priority to maintain dkms-no-npi-hipclang build, otherwise we lose track of regression analysis.
So revert the change for now and recommit it after fixing it.

 

Change-Id: Ia5136e888baecb6148c6c18eedbf37066fcb1eaa
2020-04-17 10:13:08 -05:00
Tao Sang ed3b0eb391 Solve issues with hip-vdi runtime static lib
1.Combine libamdhip64_static_base.a and libamdvdi_static.a into libamdhip64_static.a.
2.Let hipcc use -use-staticlib to link libamdhip64_static.a.
3.Add some samples for static lib.
4.Fix compiling failure of code object.
Change-Id: Ia2333622a8d05639b90974c4c5d3d85654ba0138
2020-04-15 13:46:36 -04:00
Maneesh Gupta 22e9bcf424 Merge branch 'amd-master' into amd-master-next
Change-Id: I3094c15008093f2072bcd38aca4ea90aeae2d97b
2020-04-09 06:31:00 -04:00
Evgeny Mankov a93ae7cbed [HIP][doc] Update docs due to moving of HIPIFY to a separate repo (#2001) 2020-04-07 14:03:19 +05:30
Rahul Garg 177457e54c Rename hipDrvOccupancy to hipModuleOccupancy and match CUDA syntax (#1943) 2020-04-07 14:02:52 +05:30
Sarbojit2019 5144ee8c04 Fix few memory leaks in HIP (#1969) 2020-03-27 14:08:30 +05:30
Joseph Greathouse 449e2c931b Fix maxSharedMemoryPerMultiProcessor attribute (#1927)
The maxSharedMemoryPerMultiProcessor attribute is meant to describe
the number of bytes of shared memory (LDS space in AMD terminology)
in each SM (CU in AMD terminology). For instance, on AMD GPUs this
is often 64KB per CU, and some Nvidia GPUs it's 96KB per SM.

This shared memory is a different address space from the normal
global memory. However, the current HIP-HCC properties fill this
in with a size that matches the totalGlboalMem property. This gives
a drastically too-high calculation for the amount of LDS space that
each CU has -- tens of GBs vs. 10s of KBs.

This patch fixes this by pulling the maxSharedMemoryPerMultiProcessor
property from the HSA pool that describes how much workgroup-local
space is available on each CU. The HSA runtime eventually pulls
this from the topology information about LDSSizeInKB, defined as
"Size of Local Data Store in Kilobytes per SIMD".

Previously, this HSA query was used to fill in the value of the
sharedMemPerBlock property. On today's AMD GPUs, we know that
the amount of LDS avaialble to the workgroup is identical to the
amount of LDS space in the CU. However, in the future this may
differ. As such, this patch changes around the order and fills
in the "PerMultiProcessor" property from the HSA query (since
what's what the query is defined to return), and then separately
fills in the "PerBlock" property as we know it.
2020-03-17 14:00:51 +05:30