İşleme Grafiği

1481 İşleme

Yazar SHA1 Mesaj Tarih
Maneesh Gupta ee45f273fd 4_shfl and 5_2dshfl samples are unsupported on gfx701
Change-Id: I81eb880350f25e89573ba14c62b549c6c43f8c91


[ROCm/clr commit: f91583b294]
2017-03-14 15:56:18 +05:30
Maneesh Gupta f400aa48f5 default to gfx803 instead of fatbin if no arch specified
Change-Id: I83d56c6ede11c356d383b09d7eb3a5f08c8d8c84


[ROCm/clr commit: f32980847f]
2017-03-14 14:34:25 +05:30
Maneesh Gupta 59f1401f54 hipcc: Support targets specified via HCC_AMDGPU_TARGET
Change-Id: I69fda40d9f666325d377f4b4335e7ee693069214


[ROCm/clr commit: d29a097905]
2017-03-14 14:29:30 +05:30
Maneesh Gupta 5cf4c4e440 Add gfx900 support
Change-Id: I3be2fbdcb6d3fa776c4fe668586c67245a1323f2


[ROCm/clr commit: 1b92ae9917]
2017-03-14 13:51:38 +05:30
Maneesh Gupta 8d526e796e Merge branch 'amd-master' into amd-develop
[ROCm/clr commit: 0853c8ac2e]
2017-03-14 13:44:41 +05:30
Aditya Atluri 725f20ab1e make sure the inter-thread intrinsics are working post hawaii
Change-Id: I30ea5284c2160276f5bc0f937dfd386ca8640ce8


[ROCm/clr commit: af56898ea0]
2017-03-13 11:16:05 -05:00
Ben Sander 72b420bab4 Update hiphostregister test.
Move check to correct place.


[ROCm/clr commit: 9adbbd2980]
2017-03-12 09:51:33 -05:00
Ben Sander 49d7ea94f5 Refactor registered memory calls.
[ROCm/clr commit: b7acb85fa8]
2017-03-11 09:18:27 -06:00
Ben Sander e1c95b083d Add first step to a "registerd" mode in hipBusBandwidth.
[ROCm/clr commit: f23b5a1f90]
2017-03-11 09:18:27 -06:00
Ben Sander a0932fbcc3 Update hipHostRegister debug and pointerTracker debug and notes
[ROCm/clr commit: e43592721e]
2017-03-11 09:18:27 -06:00
Ben Sander c8595b0373 Fix copying of registered memory.
Set device properly so copying can recover context.
Enhance test to catch this case.


[ROCm/clr commit: 23a58775df]
2017-03-11 09:18:27 -06:00
Aditya Atluri c8a1b6bc30 fixed warning raised by g++ using hip_vector_types.h
Change-Id: I9e7cdfc8b28b03b690eecd068529cf7629296d68


[ROCm/clr commit: a47066153f]
2017-03-10 15:14:26 -06:00
Rahul Garg b2b8361763 IPC supported using ROCR APIs
Change-Id: I0a353b1240098f4b20fa266a871f5f5826290af9


[ROCm/clr commit: 3af487007b]
2017-03-10 23:45:28 +05:30
pensun a49f5c7e1d update porting guide for updated __HIP_DEVICE_COMPILE__
Change-Id: I0f025d354f76e2d728231bf112a77e8c8fcacc8c


[ROCm/clr commit: 1a2844e3a3]
2017-03-10 10:01:12 -06:00
Aditya Atluri a816a9eb54 Added architecture guards around __shfl, dpp and ds_permute device functions
Change-Id: I10f9b08618fbf25b61c1932278fc5759e41c0d66


[ROCm/clr commit: 046ec0375b]
2017-03-10 08:40:59 -06:00
Rahul Garg 4dd856eb67 Fix for HCSWAP-128, make 5_2dshfl cookbook sample only for fiji
Change-Id: I8869c28151bca1bd47a053a2808e93a801d16d00


[ROCm/clr commit: d48943699d]
2017-03-10 10:29:52 +05:30
Aditya Atluri 974cb587a8 make 4_shfl cookbook sample only for fiji
1. __shfl is not supported on hawaii gfx701

Change-Id: Iac09f5d30ee0674b8f58a6e74ec5c49b02be32ad


[ROCm/clr commit: 7f4b24886f]
2017-03-09 08:52:50 -06:00
pensun f4e9b51d35 fix typo in hip_porting_guide
Change-Id: I42553d9a4de2901dfdd57384b52a04e8fb22edde


[ROCm/clr commit: 308638c911]
2017-03-08 23:37:50 -06:00
pensun 3152223be6 add inline to all hip_complex operators
Change-Id: Ifba5966c297cbc9299c39ecfc45c7296003ebb5d


[ROCm/clr commit: 14a5d3c80d]
2017-03-08 14:06:13 -06:00
Ben Sander 0710dbde55 Fix bug in hipModuleGetFunction.
Modules with > 1 function didn't return the function correctly.
Also fix coding convention issues


[ROCm/clr commit: 09df0977c0]
2017-03-08 13:52:38 -06:00
Maneesh Gupta 27cf10c094 Disable hipMemPtrGetInfo test on nvcc path
Change-Id: I864e571314abfe5ae614e6792c86d7b457c920ee


[ROCm/clr commit: 8bd20732f9]
2017-03-08 16:16:08 +05:30
Aditya Atluri c0d91d1c3a fixed atan2f arguments
Change-Id: I0bb621e94d57594c3899e51d0c34ef43306cead0


[ROCm/clr commit: 2ea7c5d28a]
2017-03-07 14:06:03 -06:00
Aditya Atluri c861d10d1e Added new API, hipMemPtrGetInfo
1. This API returns memory allocation size of pointer
2. Added test to check its functionality

Change-Id: I87976d817b5a6ca5530336c09e7cb0420601cb2c


[ROCm/clr commit: 7b7d53f875]
2017-03-07 13:46:29 -06:00
Wen-Heng (Jack) Chung 9591c5a447 Changes to HIP to cope with Promote-free HCC
Squashed commit of the following:

commit c111b5bd10d7c2a5b0b1ad8b07f6e81185b47b39
Author: Wen-Heng (Jack) Chung <whchung@gmail.com>
Date:   Sat Mar 4 17:06:46 2017 +0800

    Use __device__ for all variables and functions to be used in kernel path

    Abolish __device and adopt [[hc]] in HIP implementation, so __device__ can be
    used on all HIP applications, no matter they are variables or functions.

    Change-Id: I20ca25857ce3bc3e42a5ebf65cafea2c8492f4c7

commit 30c0e4e4701bbf6bd9a7182e0320a71ff73d3a83
Author: Wen-Heng (Jack) Chung <whchung@gmail.com>
Date:   Thu Mar 2 12:14:11 2017 +0800

    XXX FIXME get around LDS spills caused in Promote-free HCC

    hipDynamicShared2 uses all 64KB of LDS for computation. But in Promote-free HCC
    there are cases where LDS spills would occur, which would make the test case to
    hang.

    In this workaround commit we reduce the size of dynamic LDS used to get around
    this known issue, and will revert this commit when LDS spills are resolved in
    HCC.

    Change-Id: If648b36200a4f9143951a8129192bcb7ed0bef5e

commit e803173be2d73e2f132a7ff7f61e7a20b4083d34
Author: Wen-Heng (Jack) Chung <whchung@gmail.com>
Date:   Wed Mar 1 21:41:41 2017 +0800

    Fix math functions which take pointer arguments

    Change-Id: I332c997e640edbc44824691e2a9434c6b3dadefa

commit de590c469e213c42090ff83dbd060f25bb1d6047
Author: Wen-Heng (Jack) Chung <whchung@gmail.com>
Date:   Wed Mar 1 18:38:54 2017 +0800

    Changes to cope with Promote-free HCC

    - abolish usage of address_space GNU attribute
    - use __device in file-scope global variables which would be accessed by GPU kernels
    - temporarily disable some math functions which take pointer arguments

    Change-Id: I730311dee848e20e763e35cd3980317fce0dce0d

Change-Id: I1f6b970b53b9401eeaaab08f04a7b9fed0fb8cf0


[ROCm/clr commit: 0005dd5f66]
2017-03-08 01:32:59 +08:00
Aditya Atluri 106030f2c6 added new field to hipDeviceProp_t structure gcnArch.
1. It is an integer containing gfx values 701, 801, 802, 803
2. On NV path, it is zero

Change-Id: I2b4c7f48981d0214d8c6b1905d2cc85b16203419


[ROCm/clr commit: 9f575721aa]
2017-03-07 11:24:32 -06:00
Maneesh Gupta 6cf5ff9372 Merge branch 'amd-develop' into amd-master
Change-Id: I393d1a8c7c5cdb2ca63098bf81ff9b98ea5e7b16


[ROCm/clr commit: 5bf174fc20]
2017-03-07 14:41:30 +05:30
Maneesh Gupta 928832a432 FindHIP: better handling for custom HCC_HOME
Change-Id: Ica267de11cde58d1e759cd1fd053b699649ea76a


[ROCm/clr commit: 90478d90d4]
2017-03-07 14:40:04 +05:30
Ben Sander 115631e210 Modify memcpy and memset to follow C/C++ standard:
- memcpy src is const
    - memset val is int


[ROCm/clr commit: 8e50134d31]
2017-03-06 16:38:22 -06:00
Rahul Garg d410e0f9ca Removed hsakmt headers
Change-Id: I4ffc95d5823489195ebc5638226b49ea2995f603


[ROCm/clr commit: 158cb58c36]
2017-03-06 22:37:05 +05:30
Siu Chi Chan f005cea396 fix hcc version detection in hipcc
Change-Id: I880be03ad67e99280a259369bfe25488bf53f0bd


[ROCm/clr commit: f9db1690c3]
2017-03-06 15:34:30 +05:30
Siu Chi Chan 02abf38a75 fix hcc version string extraction
Change-Id: Ie209b6deae55c779a577aaccb1bc21f969f69e14


[ROCm/clr commit: bb5df5b6c9]
2017-03-06 15:33:43 +05:30
Maneesh Gupta 1f6639bacc CMakeLists: Create test targets only if HIP install location is writable
Change-Id: I79f28884b0f117f2824ca8877c25b586bce62e5b


[ROCm/clr commit: 5df7b775f6]
2017-03-06 15:32:21 +05:30
Evgeny Mankov 0d33e087b3 [HIP] [DOC] Update CUDA_Runtime_API_functions_supported_by_HIP
+ update all Cuda 7.5 RT API enum values.


[ROCm/clr commit: b8dc082ffb]
2017-03-03 22:05:23 +03:00
Evgeny Mankov ec3679990d [HIPIFY] Sync HIPIFY with HIP by enum values.
+ all Cuda 7.5 RT API enum values are synced.
+ a few missing functions are added.
+ CONV_EXEC type is added for Execution control functions and data types.


[ROCm/clr commit: 4ba96241b4]
2017-03-03 21:59:05 +03:00
Evgeny Mankov 460971e93c [HIP] [FIX] Memcpy Async functions on nvcc path.
+ in hipMemcpyDtoDAsync: cuMemcpyDtoD -> cuMemcpyDtoDAsync
+ in hipMemcpyDtoHAsync: cuMemcpyDtoH -> cuMemcpyDtoHAsync

P.S.
"The types CUstream and cudaStream_t are identical and may be used interchangeably", thus explicit c-like type cast is not needed, aka CUstream(stream).


[ROCm/clr commit: 430856ca35]
2017-03-01 23:04:34 +03:00
Aditya Atluri b430c3be71 changed __half enabling from 4 to >3
Change-Id: Id974c6d5326e87a4c5941f831c5bb2747cdebd2d


[ROCm/clr commit: 37bc29e44b]
2017-02-28 17:13:29 -06:00
pensun 077e789db5 Define __HIPCC__ flag at compile time when using HIPCC on HCC path
Change-Id: I5e967e0e2327264d5d3b0ca705c2504fcd33d75e


[ROCm/clr commit: be95cc914f]
2017-02-28 16:20:48 -06:00
Evgeny Mankov 8c237b0837 [HIP] Add missing Device attribute on nvcc path.
+ missing cudaDevAttrComputeCapabilityMinor case as added for hipDeviceGetAttribute query for hipDeviceAttributeComputeCapabilityMinor.


[ROCm/clr commit: 47a27e6b17]
2017-02-28 18:40:13 +03:00
pensun e5bf4ec700 update hip_porting_guide regarding platform depended macros
Change-Id: I3029c6ae6cb280500bba294925ed6e9dc9dcc94d


[ROCm/clr commit: ae1c6ee68a]
2017-02-28 07:33:01 -06:00
Maneesh Gupta ee3f396aab packaging script changes for hip_hcc
Change-Id: I06cce6048204315a891e3a12638a3067644cfb43


[ROCm/clr commit: 1d5ce89b1b]
2017-02-28 12:02:53 +05:30
pensun 72f8c29529 remove extra spaces for hip_common.h platform defines
Change-Id: Ie0e39256abba307429985371671cde01f5ea2cc9


[ROCm/clr commit: c5ae8c423a]
2017-02-27 12:55:22 -06:00
Rahul Garg b4d190a445 Context management related changes in HIP.
-
-Contexts across threads are listed under device
-Device reset cleans up all contexts and re-initializes _primaryCtx

Change-Id: Ie1cfbb26d43a8dc6869be3e6ebaf7344ce374643


[ROCm/clr commit: c837b8d713]
2017-02-27 15:24:17 +05:30
Maneesh Gupta 37ab88ae1e Fix nvcc path samples that include math_functions.h
Change-Id: I94bb577b93983535178d8f0dcae57aaa72871534


[ROCm/clr commit: 528736cf58]
2017-02-27 13:20:05 +05:30
Maneesh Gupta 66e9900325 Disable some tests which are broken on nvcc path
Change-Id: I6f8df7687ff1798dc17f6c9b8a7f4cd029ce45d8


[ROCm/clr commit: 578dc68b0a]
2017-02-27 13:19:06 +05:30
Maneesh Gupta 26c65c826b Fix compilation of some broken tests on nvcc path
Change-Id: I92406af00c2de09d728e9b7f661c1f3873470560


[ROCm/clr commit: 40bcf5ed3b]
2017-02-27 13:17:51 +05:30
Maneesh Gupta 045c76a97d directed tests no longer run in an subdirectory
- target "make test" will no longer build and run tests. It will only run the tests.
- added new target "make check" which will build and run the tests.
- target "make check" will build tests serially. Use -j<N> to build tests in parallel.

Change-Id: I24c7932bf9798364a59f44631fbabcf9a5da5e17


[ROCm/clr commit: fe01dd74a2]
2017-02-27 13:14:08 +05:30
Maneesh Gupta f109c9e6da FindHIP: added new macro HIP_RESET_FLAGS
Change-Id: I0af491f6689abf1c1b5691261fe1f3e61a5d916d


[ROCm/clr commit: 58272083dd]
2017-02-27 13:11:11 +05:30
Aditya Atluri 78f1d54ad6 Added initial support for hipMemcpyFromSymbol. But not working!
Change-Id: I48d8c7de4ec9f85c6c942be995fb488a3931f5d7


[ROCm/clr commit: 7ac5017cb9]
2017-02-23 11:29:06 -06:00
Maneesh Gupta ae8fc88790 Merge branch 'amd-develop' into amd-master
Change-Id: I0e856db61fa4a50e190bd1d4c464ceb4a709b550


[ROCm/clr commit: 64db70756b]
2017-02-23 11:19:23 +05:30
Maneesh Gupta affb657bca Fix export interfaces in hip-config.cmake
Change-Id: Ifad4661ab17d7e6edb6ab300f1e92552ed917950


[ROCm/clr commit: d193187601]
2017-02-23 11:18:06 +05:30