Maneesh Gupta
822fc802a0
Enable promote free HCC by default
...
Change-Id: I5f82a8d958dd675a8a46d9d17458c71321daab7c
2017-04-03 11:18:30 +05:30
Sun, Peng
c82c84949c
Fix warpSize, for related issue in hipeigen and torch
...
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
2017-04-01 15:42:53 -05:00
Sun, Peng
b1ed910942
GGL update, fix for thread-safe access to streams (accelerator_views).
...
Change-Id: I6dd329a85b3ba7de23d52823febee0c53857a981
2017-04-01 14:50:39 -05:00
Aditya Atluri
1ef7222c3a
Fixed copyright and header names
...
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
2017-03-31 12:40:29 -05:00
Aditya Atluri
e0133e627d
fixed header names
...
Change-Id: I21650d6398187d3767b28e8ac81b2642d3b89a0e
2017-03-31 12:18:55 -05:00
Aditya Atluri
b9091ba818
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng
071f19521c
remove extra GGL header info
...
Change-Id: I09f0d1b64a7a31eb2e926f19b69b7bafbacc7787
2017-03-30 18:16:56 -05:00
Sun, Peng
f4efa422bf
Rename hipLaunchKernelV3 to hipLaunchKernelGGL
...
Change-Id: I303daae006db41e9b04eb591e0b09b2717a7cf66
2017-03-30 18:10:17 -05:00
Sun, Peng
cfc2d455e1
Enable GGL as the default kernel launch method
...
Change-Id: I8022d126ee28ff7e4d9a96209e399d4243d39d8b
2017-03-30 13:36:46 -05:00
Sun, Peng
d067c884be
Update GGL to fix one Torch build issue
...
Change-Id: I95a2a335902e3c368ed29f075ac72eabbb64c97e
2017-03-29 09:03:21 -05:00
sunway513
43f76be76e
resolve GGL merge conflicts
...
Change-Id: I7a5ec3696cf2dd1a77b1686536a1cb84cbfed66e
2017-03-27 22:46:41 -05:00
Rahul Garg
1d18006ab4
Fix for MemcpyFromSymbol on HIP/NVCC path
...
Change-Id: Ice38307f72870ae468cbf0861e104f0fa46dfd56
2017-03-27 00:35:10 +05:30
Rahul Garg
ecc0e14cf7
Added support for Primary Context Management APIs
...
Change-Id: I70f91b4492e112dd8e12ecf511fdc18a27944a06
2017-03-26 23:45:54 +05:30
Sun, Peng
91274394dc
update GGL implementation to use hipLaunchKernel
...
Change-Id: Ibc08185c814bb07d54f3e68016b10eb7b9f2bf4b
2017-03-21 15:46:56 -05:00
Sun, Peng
ec04521617
Update GGL implementation to extended overload set for make_lambda_wrapper
...
Change-Id: I949f113671ddf155db8689e8a7f23d415839a7b5
2017-03-21 15:46:44 -05:00
Sun, Peng
329e2182d6
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
Sun, Peng
d09afd23b8
merge Alex' GGL fix for non-specialized kernel function launch
...
Change-Id: Idbf7ca669c38ee5c0f654bcabdd1b498abb29f69
2017-03-20 16:34:24 -05:00
Aditya Atluri
d9f0bd25be
added support for lgammaf and lgamma
...
1. Implementation inside HIP
Change-Id: I657263b7276a57c56081d3336fef816b5f204eff
2017-03-17 18:26:10 -05:00
Sun, Peng
e7689e9e6e
Disable additional debug warning message
...
Change-Id: Ic5c374589bfad387a7c4c5346430a490e2c6e2a7
2017-03-17 15:03:03 -05:00
pensun
30d5f4ea10
Change the #define of GENERIC_GRID_LAUNCH to take valueat compilation, disable warning messages
...
Change-Id: Ic6c011529e26de359bcda1e7083727e7ee52887b
2017-03-17 14:59:34 -05:00
pensun
33c38de407
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
2017-03-17 14:59:34 -05:00
Ben Sander
8cbe310870
Move USE_PROMOTE_FREE_HCC
2017-03-17 12:04:13 -05:00
Ben Sander
29232ff283
Add __device__ to needful functions for promote-free.
2017-03-17 11:19:48 -05:00
Ben Sander
e85c167171
Add USE_PROMOTE_FREE_HCC for smooth transition to new HCC caps.
...
ADDRESS_SPACE_1 defines
2017-03-17 11:04:39 -05:00
Ben Sander
ecd8179a71
Add simple device-side assert macro
...
Currently swallows asserts but will compile.
2017-03-17 10:25:56 -05:00
Maneesh Gupta
1337ed9159
Merge branch 'amd-master' into amd-develop
...
Change-Id: I01a77c34e11b1b02cb20e11b105741f6bd383543
2017-03-15 09:56:45 +05:30
Rahul Garg
913867fe6a
hipMemsetD8 support for HIP/NVCC path
...
Change-Id: I48eee8266afd7b45a12d5ce2c4849b687a006c0f
2017-03-14 23:49:21 +05:30
Rahul Garg
1aba3c4375
Added hipMemsetD8
...
Change-Id: I6a230a036c9c46c72a77d5f93c16ce8a00c3f837
2017-03-14 22:11:34 +05:30
Wen-Heng (Jack) Chung
77e21dc09f
Revert "Changes to HIP to cope with Promote-free HCC"
...
This reverts commit efb9b9e86c .
Change-Id: I20a9bab3883ad09913b320210344d37599cb8fcd
2017-03-14 22:59:27 +08:00
Maneesh Gupta
675dd1cedf
Merge branch 'amd-master' into amd-develop
2017-03-14 13:44:41 +05:30
Aditya Atluri
c8969811db
make sure the inter-thread intrinsics are working post hawaii
...
Change-Id: I30ea5284c2160276f5bc0f937dfd386ca8640ce8
2017-03-13 11:16:05 -05:00
Ben Sander
524e007db5
Refactor registered memory calls.
2017-03-11 09:18:27 -06:00
Aditya Atluri
e79dd9f9c6
fixed warning raised by g++ using hip_vector_types.h
...
Change-Id: I9e7cdfc8b28b03b690eecd068529cf7629296d68
2017-03-10 15:14:26 -06:00
Rahul Garg
32d8a58f18
IPC supported using ROCR APIs
...
Change-Id: I0a353b1240098f4b20fa266a871f5f5826290af9
2017-03-10 23:45:28 +05:30
pensun
37ed319a20
update porting guide for updated __HIP_DEVICE_COMPILE__
...
Change-Id: I0f025d354f76e2d728231bf112a77e8c8fcacc8c
2017-03-10 10:01:12 -06:00
Aditya Atluri
1567d20aa8
Added architecture guards around __shfl, dpp and ds_permute device functions
...
Change-Id: I10f9b08618fbf25b61c1932278fc5759e41c0d66
2017-03-10 08:40:59 -06:00
pensun
7488d8c7fa
add inline to all hip_complex operators
...
Change-Id: Ifba5966c297cbc9299c39ecfc45c7296003ebb5d
2017-03-08 14:06:13 -06:00
Aditya Atluri
1546732604
Added new API, hipMemPtrGetInfo
...
1. This API returns memory allocation size of pointer
2. Added test to check its functionality
Change-Id: I87976d817b5a6ca5530336c09e7cb0420601cb2c
2017-03-07 13:46:29 -06:00
Wen-Heng (Jack) Chung
efb9b9e86c
Changes to HIP to cope with Promote-free HCC
...
Squashed commit of the following:
commit c111b5bd10d7c2a5b0b1ad8b07f6e81185b47b39
Author: Wen-Heng (Jack) Chung <whchung@gmail.com >
Date: Sat Mar 4 17:06:46 2017 +0800
Use __device__ for all variables and functions to be used in kernel path
Abolish __device and adopt [[hc]] in HIP implementation, so __device__ can be
used on all HIP applications, no matter they are variables or functions.
Change-Id: I20ca25857ce3bc3e42a5ebf65cafea2c8492f4c7
commit 30c0e4e4701bbf6bd9a7182e0320a71ff73d3a83
Author: Wen-Heng (Jack) Chung <whchung@gmail.com >
Date: Thu Mar 2 12:14:11 2017 +0800
XXX FIXME get around LDS spills caused in Promote-free HCC
hipDynamicShared2 uses all 64KB of LDS for computation. But in Promote-free HCC
there are cases where LDS spills would occur, which would make the test case to
hang.
In this workaround commit we reduce the size of dynamic LDS used to get around
this known issue, and will revert this commit when LDS spills are resolved in
HCC.
Change-Id: If648b36200a4f9143951a8129192bcb7ed0bef5e
commit e803173be2d73e2f132a7ff7f61e7a20b4083d34
Author: Wen-Heng (Jack) Chung <whchung@gmail.com >
Date: Wed Mar 1 21:41:41 2017 +0800
Fix math functions which take pointer arguments
Change-Id: I332c997e640edbc44824691e2a9434c6b3dadefa
commit de590c469e213c42090ff83dbd060f25bb1d6047
Author: Wen-Heng (Jack) Chung <whchung@gmail.com >
Date: Wed Mar 1 18:38:54 2017 +0800
Changes to cope with Promote-free HCC
- abolish usage of address_space GNU attribute
- use __device in file-scope global variables which would be accessed by GPU kernels
- temporarily disable some math functions which take pointer arguments
Change-Id: I730311dee848e20e763e35cd3980317fce0dce0d
Change-Id: I1f6b970b53b9401eeaaab08f04a7b9fed0fb8cf0
2017-03-08 01:32:59 +08:00
Aditya Atluri
f86f3b3b33
added new field to hipDeviceProp_t structure gcnArch.
...
1. It is an integer containing gfx values 701, 801, 802, 803
2. On NV path, it is zero
Change-Id: I2b4c7f48981d0214d8c6b1905d2cc85b16203419
2017-03-07 11:24:32 -06:00
Ben Sander
d572e0616e
Modify memcpy and memset to follow C/C++ standard:
...
- memcpy src is const
- memset val is int
2017-03-06 16:38:22 -06:00
Evgeny Mankov
6421a1e79e
[HIP] [FIX] Memcpy Async functions on nvcc path.
...
+ in hipMemcpyDtoDAsync: cuMemcpyDtoD -> cuMemcpyDtoDAsync
+ in hipMemcpyDtoHAsync: cuMemcpyDtoH -> cuMemcpyDtoHAsync
P.S.
"The types CUstream and cudaStream_t are identical and may be used interchangeably", thus explicit c-like type cast is not needed, aka CUstream(stream).
2017-03-01 23:04:34 +03:00
Aditya Atluri
e7ccc995ee
changed __half enabling from 4 to >3
...
Change-Id: Id974c6d5326e87a4c5941f831c5bb2747cdebd2d
2017-02-28 17:13:29 -06:00
pensun
fd610e497b
Define __HIPCC__ flag at compile time when using HIPCC on HCC path
...
Change-Id: I5e967e0e2327264d5d3b0ca705c2504fcd33d75e
2017-02-28 16:20:48 -06:00
Evgeny Mankov
f9ad2dca7e
[HIP] Add missing Device attribute on nvcc path.
...
+ missing cudaDevAttrComputeCapabilityMinor case as added for hipDeviceGetAttribute query for hipDeviceAttributeComputeCapabilityMinor.
2017-02-28 18:40:13 +03:00
pensun
df9cbb6067
remove extra spaces for hip_common.h platform defines
...
Change-Id: Ie0e39256abba307429985371671cde01f5ea2cc9
2017-02-27 12:55:22 -06:00
Maneesh Gupta
4a6166cd86
Fix nvcc path samples that include math_functions.h
...
Change-Id: I94bb577b93983535178d8f0dcae57aaa72871534
2017-02-27 13:20:05 +05:30
Aditya Atluri
2e245ae58c
Added initial support for hipMemcpyFromSymbol. But not working!
...
Change-Id: I48d8c7de4ec9f85c6c942be995fb488a3931f5d7
2017-02-23 11:29:06 -06:00
Aditya Atluri
639fd4dd5e
added runtime api hipMemcpyFromSymbolAsync
...
Change-Id: Ibaf925faf0ba464dd0ed6c5ea74c224c2ce38889
2017-02-22 19:16:35 -06:00
Aditya Atluri
a1f3955860
added typedef for half and half2
...
Change-Id: Ic844fa31b64a0354484b418df71869c2807200cc
2017-02-21 14:44:37 -06:00