Граф коммитов

6428 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg 2380cf7805 Bump version to 3.9
Change-Id: I5dfc997336fac42f3b6b0998a4bb212462de38f9
2020-08-24 21:52:22 -04:00
kjayapra-amd 9c75797d9f SWDEV-249499 - Add message to Guarantee assert code in hip runtime.
Change-Id: I1850463675bbc3e6ff5e3cef8211b1fd40e55f96
2020-08-24 17:35:39 -04:00
Jenkins 1d83fef5da Merge 'amd-master-next' into 'amd-npi-next'
Change-Id: I2121bcceb5d3321fdc4248de337101d628411ab1
2020-08-23 21:09:31 +00:00
Yaxun (Sam) Liu d537bb3859 Fix hipcc for extracing clang version
On teamcity clang --version returns something like

clang version 12.0.0
Target: x86_64-unknown-linux-gnu

currently hipcc does not handle it correctly since [^ ] causes new line and Targe:
matched. \S will make sure only version string matched.

Change-Id: I3659d6daf89b709d2b001e6b8024d69d8b79efc0
2020-08-21 18:21:46 -04:00
Todd tiantuo Li 99eb486937 SWDEV-240803 - add hipFuncSetSharedMemConfig
Change-Id: I160b04677b3e7b99b3981ae7ecc84a0e3811d5e8
2020-08-20 18:18:24 -04:00
Jenkins d01cbc7220 Merge 'amd-master-next' into 'amd-npi-next'
Change-Id: I4d7ded0a1316a469f0880490e563c3e3f00cc970
2020-08-20 21:09:41 +00:00
Lakhan singh Thakur 858b7e027e [dtest] Additional tests for Memcpy
APIs tested: hipMemcpy, hipMemcpyAsync, hipMemcpyHtoD, hipMemcpyHtoDAsync
             hipMemcpyDtoH, hipMemcpyDtoHAsync, hipMemcpyDtoD,
             hipMemcpyDtoDAsync

-Scenario-1:: The aim of this test case is to cover all the negative test cases
              for 8 hipMemcpy apis
-Scenario-2:: This test launches NUM_THREADS threads. Each thread in turn tests
              the working of 8 hipmemcpy apis
-Scenario-3:: This test case verifies the working of Memcpy apis for range of
              Memory sizes from smallest one unit transfer to 1GB.

SWDEV-238517 for enhancing hip unit tests

Change-Id: Iace19cc54e865fae8450448d6f37abed88597f19
2020-08-20 16:41:57 -04:00
Sudheer Kumar f648d2f2f5 [dtest] Enhanced tests for hipSetDevice and hipGetDevice
SWDEV-238517 for enhancing hip unit tests

Change-Id: I4fe169c560ce2e1fb471df67f03263903b12463b
2020-08-20 13:52:05 -04:00
Sourabh Betigeri cb2026c945 SWDEV-246104: Added new performance test to verify stream concurrency
Change-Id: Ieceaafd4d404b137936eb8aceecff27bbba42026
2020-08-20 13:42:55 -04:00
Aryan Salmanpour aafc12e7c1 [HipPerf] add a test for measuring create/copy/destroy for hip streams
SWDEV-245290/SWDEV-247767

Change-Id: Id52e3d2f7afee52763da107eb5ed04627942d9f0
2020-08-20 13:28:36 -04:00
Jenkins e2fdd12153 Merge 'amd-master-next' into 'amd-npi-next'
Change-Id: Ibbb31c0a6d1d92e96a6c9abed0d4d4eb0b357008
2020-08-18 09:09:47 +00:00
Todd tiantuo Li efcb882e19 move hipFuncSetCacheConfig from hip_device to hip_module
Change-Id: If5a930e1210e76531f5ba9a0e5f5ec98ad473a19
2020-08-17 17:39:01 -04:00
Aryan Salmanpour 4f400bc5e9 SWDEV-248057 - fix the calculation of allGridSize used in multi_grid_group() API
Change-Id: Ib470094e28dcacaa4769dc5c7ab08924f5b7fa41
2020-08-17 12:52:40 -04:00
Jatin Chaudhary 1340b3f07f Adding Anyorder flag to HIP
Change-Id: Ie20931541b3febe23fa9ac36ebc0c90de75a5f0a
2020-08-17 08:46:52 -04:00
Jenkins 6583e307ca Merge 'amd-master-next' into 'amd-npi-next'
Change-Id: I56b40b684a8be0dfbf330d0d1ed1ad703fde8417
2020-08-16 09:09:40 +00:00
Rahul Garg dd40abe96c Fix hipDispatchEnqueueRateMT sample
Change-Id: I7ade0dcf8b32ac90a311022d12339740b2b3df44
2020-08-14 18:56:39 -04:00
Todd tiantuo Li 9dfe15a843 SWDEV-240803 - add hipFuncSetAttribute and hipFuncAttribute
Change-Id: I3f4d67b19d89fd348fa5b884af4a2542ee4aba60
2020-08-14 17:39:29 -04:00
kjayapra-amd 6f8909be34 SWDEV-237658 - Do not attach events to user invisible Marker.
Change-Id: I3a74fd2614d6e2e1d51543471bbd5ae51a713a47
2020-08-14 10:55:34 -04:00
Jenkins d95ca630b5 Merge 'amd-master-next' into 'amd-npi-next'
Change-Id: I935fc8f681fad2df4e932407287a29a6a797351a
2020-08-14 09:09:52 +00:00
Freddy Paul 6e50c0f755 Enable developers to build hip without CMAKE_PREFIX_PATH
With recent changes in ROCclr to take HSA cmake target it require
HIP to have CMAKE_PREFIX_PATH added for install location for HSA,
amd_comgr,llvm.

User can provide CMAKE_PREFIX_PATH if above
packages are built/installed at custom location as:
cmake -DCMAKE_PREFIX_PATH="<path_hsa> <path_amd_comgr> <path_llvm>"

For default ROCm installation adding a default path will help.

Change-Id: Iefa443dc198da76b57422a641c9821c63cde4922
2020-08-13 23:13:40 -04:00
Aryan Salmanpour 0d0efcdbf4 [HipPerf] relocating/renaming some hip perf tests
Change-Id: Ie85d242cd68cd14a858e07ed27875a5196014688
2020-08-13 17:28:11 -04:00
Aryan Salmanpour a8e7816726 [HipPerf] report performance based on wall time only for hipPerfDevMemReadSpeed/hipPerfDevMemWriteSpeed
Change-Id: I1fda2ec76da6fad6852d328e0a3fc39e28af57bb
2020-08-13 14:23:58 -04:00
Jason Tang e2169ce6d0 SWDEV-232197 - Remove refering to gfxipVersion_
Change-Id: I91b6b972e26b18dabf6deda5a142bff491c8bb45
2020-08-12 16:34:18 -04:00
German Andryeyev 505877acea Allow prefetch on the null stream
Change-Id: I87e950460361fab65ab140988fc2288bcb194f08
2020-08-12 13:07:43 -04:00
Rahul Garg d620a25b64 Return error from hipMallocManaged for size =0
Change-Id: Ida0cecf05e1df731176d41e59422d1e606cc9bd9
2020-08-10 18:25:31 +00:00
Vlad Sytchenko 844dab3b22 Don't align 2D image buffer size
Cuda allows creating 2D textures from memory with unaligned size.
Aligning it will cause us to overshoot the size of the underlying
allocation.

Change-Id: Ia20dec2105e36ca4a8448c9c6299d0001b388d25
2020-08-09 15:53:52 -04:00
Julia jiang 9276429691 SWDEV-246878-Update document HIP_porting_guide
Change-Id: I7f4c92637f31da76293048133fff5af816c752ca
2020-08-07 14:50:01 -04:00
Aryan Salmanpour 19692d891f [HipPerf] add two subtests for measuring maximum device memory read/write speed
SWDEV-245290 / SWDEV-246220 [HIPPerf] Port OCLPerfDevMemWriteSpeed/OCLPerfDevMemReadSpeed into hip performance subtests

Change-Id: I5dc323c75cebbc17596dcb4ed9492e18c5246868
2020-08-07 14:21:27 -04:00
David Salinas 9a0f5fa32c correct -x option in hip::device
Change-Id: I8c288e678b4357019144a024395cda1fcead61fc
2020-08-07 12:33:43 -04:00
Aryan Salmanpour 0e7498071e [HipPerf] add a test for measuring shared memory read speed
SWDEV-245290/SWDEV-247330

Change-Id: If78fa0879ff58aab84775e412a86665c7e5959e7
2020-08-07 10:07:05 -04:00
Yaxun (Sam) Liu 0964d64071 Let hipcc not escape space between obj files extracted from library
hipcc extracts bundles from static libraries and pass them to clang. It should
not escape spaces between objects extracted from library.

Fixes: SWDEV-246544
Change-Id: Iad3902f28a43a3986917ee37a49f10382c329940
2020-08-07 09:34:27 -04:00
Jatin Chaudhary e1ddc75542 Bring .clang-format file in sync with ROCclr's
Change-Id: I1990722c0779d61f66d79a5d0649d1b64e42eaf7
2020-08-07 05:34:44 -04:00
Jatin Chaudhary 36b420ce68 Fixing the memchannel calculation
Change-Id: Ib5f5931cc0e1ee5b7cb356c2ede0cb951ee8c9de
2020-08-07 05:34:23 -04:00
Jatin Chaudhary 1e886f432d Cuda 11 changes: https://docs.nvidia.com/cuda/cuda-runtime-api/structcudaPointerAttributes.html\#structcudaPointerAttributes
Change-Id: I8a5389b88df286043c365a734983a4c5de352102
2020-08-07 05:33:33 -04:00
Saleel Kudchadker 2ef062480b Enable queue profile only if we attach a profiler
Submit explicit profile marker for hipEventRecord to record
timestamps. Enable explicit signal profiling if the API specifies
start and stop events.

Toggle this with env var HIP_FORCE_QUEUE_PROFILING=0

Change-Id: Iae449a63ec3ebf6c2880e65d7b1dd1031a29018f
2020-08-06 11:28:16 -04:00
Payam 39d681b6be adding duration to print log for hip api calls
Change-Id: Icffa8fcacfb8ba24861d980b9553faeee4efec8c
2020-08-06 06:42:21 -04:00
Sarbojit Sarkar d768406ec4 Added free device memory info
Recently there were few OOM(out of Memory) issues poped. Adding trace will help know genuine OOM issue. 
Change-Id: Ie2a56bba95cead24caf1af5c807e73c383c2f5d9
2020-08-06 01:16:36 -04:00
Anusha Godavarthy Surya 9ea54f7797 SWDEV-243961 Added overflow check for globalWorkSize
Change-Id: I153d99398eb4619baad8c0a52466e0d5d5f898ca
2020-08-05 11:18:07 -04:00
Aryan Salmanpour ee149e4610 [dtest] disable hipStreamCreateWithPriority test temporarily as it fails randomly on Jenkins
Change-Id: I4edcd555653ef8e7562245da3a6f310d60884fe0
2020-07-30 16:18:04 -04:00
jujiang 5f5486d4b1 SWDEV-242207-Update deprecated APIs
Change-Id: Icb64227be0eb44497de99240c0044a162ff3c085
2020-07-30 15:15:41 -04:00
Rahul Garg 3abb559f72 Support empty sources in add_executable and add_library
Change-Id: I1441f2c045aeda2bba99bffe1c99fcc4a59cea9b
2020-07-30 13:34:46 -04:00
Rahul Garg dd82a30348 Bump version to 3.8
Change-Id: I710ee1456aa108eb979eb5bef8f32081dda593d9
2020-07-30 13:30:55 -04:00
Jatin Chaudhary e7eaafe165 Fixing/Disabling tests on nvcc path
Change-Id: I7ed4a1246943867482ff595d5f98bb25bd79074b
2020-07-30 00:39:07 -04:00
Freddy Paul eb94afcff9 hip-rocclr:Remove dependency to rocm-utils
rocm-utils is a meta-pacakge that pull in lot more pacakges than
what is required for hip.

Correct dependency to set as of today will be rocminfo.

Change-Id: I065ca74b4520e0c9ce183d82d6026554ff43e1f0
2020-07-28 11:42:09 -04:00
Jason Tang 678b3eeed5 SWDEV-243576 - Fix hipMemcpy regression.
If the queue device doesn't match the device on any memory object, use the queue device from the memory object.

Change-Id: I5fdcf00494f8391574f4443332c01788b8da44ef
2020-07-28 03:04:00 -04:00
Jatin Chaudhary c5264eaa67 Removing redundant .hipInfo Install
Change-Id: Id8f7a4668d8b6965a47a464c94c19bc1036f3537
2020-07-28 02:01:53 -04:00
Rahul Garg dc904abd8a Remove libhip_hcc symlinks to libamdhip64
Change-Id: I7b5b47fc7ca9760316c2424c16b454d7919c665c
2020-07-27 23:58:29 +00:00
jujiang 448008348e SWDEV-242344-hipMemcpyAll failed on MGPU
Change-Id: I01671e322c84213964d3d48bd6b5dc704dba731a
2020-07-27 13:30:30 -04:00
Tao Sang ef9176db03 Remove __gnu_h2f_ieee and __gnu_f2h_ieee
Change-Id: Ide24d245d851e20961020323e52f33322a33fff9
2020-07-23 14:50:24 -04:00
Vlad Sytchenko ed26013ec1 2D/3D copy optimizations
SWDEV-244798

If {src/dst} ptr is marked as hipMemoryTypeHost, check if the memory was
prepinned. If it was, upgrade the copy type to hipMemoryTypeDevice to
avoid extra pinning.

Change-Id: Id287ef5b14ae67dfbcf80c4caa1b08a311191948
2020-07-23 11:24:16 -04:00