Ben Sander
8bd34535b4
Add integer abs (initial implementation, can be optimized with OCML)
...
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
2017-04-11 01:16:28 +00:00
Sun, Peng
3c04722d10
Fix ifndef guard in hip_fp16.h
...
Change-Id: I0215556e7aa98a74e8a984e4de3fb6e8cafdfb24
2017-04-10 11:17:05 -05:00
Sun, Peng
bfa61eac2c
add math.h to cover sqrtf function
...
Change-Id: Ia37752710cea4ca77e0a4e61f8e69a0355d9488d
2017-04-10 08:53:12 -05:00
Sun, Peng
9884f2e83f
Update the define of __global__ for GGL
...
Change-Id: I563bb2a132403bcbe9e9f279b55406cf0255af7d
2017-04-07 16:39:33 -05:00
Sun, Peng
19987ede22
Add more operator overloading for float2 type, contributed by Aditya
...
Change-Id: If1ab7fb24d64bb5304142aed0951c9bd5ad47d20
2017-04-07 14:17:41 -05:00
Sun, Peng
f699e02715
update GGL to log launched kernel information
...
Change-Id: Ied0aa6055673c687071b4a579aecd17f0f3f09ce
2017-04-07 14:06:31 -05:00
Ben Sander
e9eaadd135
Finish adding start/stop event support to hipHccModuleLaunchKernel.
...
Change interface to use hipEvent_t rather than hipEvent_t*
Change-Id: I259062dc087a13d51dc27f84e1e8861f332a104d
2017-04-06 21:02:50 -05:00
sunway513
fb4e2307f1
Improve documentation for hipModuleLaunch functions.
...
Change-Id: I0e22621e499775740c3301347b7416d5f98c2414
2017-04-06 21:02:50 -05:00
Aditya Atluri
1358bd8f07
fixed header structure for complex data types
...
Change-Id: I16bf19005d933f42e8c8603c5d0b2df8ea3ad04f
2017-04-06 16:43:26 -05:00
Sun, Peng
49a38da6e3
GGL update, add while 0 guard for hipLaunchKernel API
...
Change-Id: Ie48ef8ca2ab5e26a51febfcd92417902c33fbf66
2017-04-06 10:48:11 -05:00
Aditya Atluri
42739c37ef
added __host__ for complex functions and corrected memset and memcpy test
...
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
2017-04-06 09:29:44 -05:00
Sun, Peng
335e107f5f
add extra guard to grid_launch_GGL header
...
Change-Id: I120619c08ea2d084804fcb1639efbe6c4648dde9
2017-04-05 22:25:41 -05:00
Sun, Peng
2ca9eec9fc
Add more include header file for GGL, to make it self-compilable
...
Change-Id: I833cb194784450fb86e7961a7f9fe196ce3c7da5
2017-04-03 11:16:35 -05:00
Maneesh Gupta
cea13c34c3
Enable promote free HCC by default
...
Change-Id: I5f82a8d958dd675a8a46d9d17458c71321daab7c
2017-04-03 11:18:30 +05:30
Sun, Peng
a7d7797781
Fix warpSize, for related issue in hipeigen and torch
...
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
2017-04-01 15:42:53 -05:00
Sun, Peng
df4cee1a57
GGL update, fix for thread-safe access to streams (accelerator_views).
...
Change-Id: I6dd329a85b3ba7de23d52823febee0c53857a981
2017-04-01 14:50:39 -05:00
Aditya Atluri
3859f59189
Fixed copyright and header names
...
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
2017-03-31 12:40:29 -05:00
Aditya Atluri
614e7db513
fixed header names
...
Change-Id: I21650d6398187d3767b28e8ac81b2642d3b89a0e
2017-03-31 12:18:55 -05:00
Aditya Atluri
1cead6a4cd
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng
e3a4547703
remove extra GGL header info
...
Change-Id: I09f0d1b64a7a31eb2e926f19b69b7bafbacc7787
2017-03-30 18:16:56 -05:00
Sun, Peng
9983fdb9e2
Rename hipLaunchKernelV3 to hipLaunchKernelGGL
...
Change-Id: I303daae006db41e9b04eb591e0b09b2717a7cf66
2017-03-30 18:10:17 -05:00
Sun, Peng
c081099de1
Enable GGL as the default kernel launch method
...
Change-Id: I8022d126ee28ff7e4d9a96209e399d4243d39d8b
2017-03-30 13:36:46 -05:00
Sun, Peng
a32f50b5c7
Update GGL to fix one Torch build issue
...
Change-Id: I95a2a335902e3c368ed29f075ac72eabbb64c97e
2017-03-29 09:03:21 -05:00
sunway513
6f38d0f55e
resolve GGL merge conflicts
...
Change-Id: I7a5ec3696cf2dd1a77b1686536a1cb84cbfed66e
2017-03-27 22:46:41 -05:00
Rahul Garg
41f0ebebf0
Fix for MemcpyFromSymbol on HIP/NVCC path
...
Change-Id: Ice38307f72870ae468cbf0861e104f0fa46dfd56
2017-03-27 00:35:10 +05:30
Rahul Garg
33a8cdc6d3
Added support for Primary Context Management APIs
...
Change-Id: I70f91b4492e112dd8e12ecf511fdc18a27944a06
2017-03-26 23:45:54 +05:30
Sun, Peng
4ca09e977d
update GGL implementation to use hipLaunchKernel
...
Change-Id: Ibc08185c814bb07d54f3e68016b10eb7b9f2bf4b
2017-03-21 15:46:56 -05:00
Sun, Peng
6b9be30b70
Update GGL implementation to extended overload set for make_lambda_wrapper
...
Change-Id: I949f113671ddf155db8689e8a7f23d415839a7b5
2017-03-21 15:46:44 -05:00
Sun, Peng
14f8c7cf74
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
Sun, Peng
e21d4f03f1
merge Alex' GGL fix for non-specialized kernel function launch
...
Change-Id: Idbf7ca669c38ee5c0f654bcabdd1b498abb29f69
2017-03-20 16:34:24 -05:00
Aditya Atluri
52859a8a40
added support for lgammaf and lgamma
...
1. Implementation inside HIP
Change-Id: I657263b7276a57c56081d3336fef816b5f204eff
2017-03-17 18:26:10 -05:00
Sun, Peng
2585812fba
Disable additional debug warning message
...
Change-Id: Ic5c374589bfad387a7c4c5346430a490e2c6e2a7
2017-03-17 15:03:03 -05:00
pensun
faf0fbebe9
Change the #define of GENERIC_GRID_LAUNCH to take valueat compilation, disable warning messages
...
Change-Id: Ic6c011529e26de359bcda1e7083727e7ee52887b
2017-03-17 14:59:34 -05:00
pensun
ad882222b0
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
2017-03-17 14:59:34 -05:00
Ben Sander
20ea5e179f
Move USE_PROMOTE_FREE_HCC
2017-03-17 12:04:13 -05:00
Ben Sander
92fd4c8f10
Add __device__ to needful functions for promote-free.
2017-03-17 11:19:48 -05:00
Ben Sander
f4256cb57c
Add USE_PROMOTE_FREE_HCC for smooth transition to new HCC caps.
...
ADDRESS_SPACE_1 defines
2017-03-17 11:04:39 -05:00
Ben Sander
c9f64bbd2d
Add simple device-side assert macro
...
Currently swallows asserts but will compile.
2017-03-17 10:25:56 -05:00
Maneesh Gupta
e3aa3d7ba6
Merge branch 'amd-master' into amd-develop
...
Change-Id: I01a77c34e11b1b02cb20e11b105741f6bd383543
2017-03-15 09:56:45 +05:30
Rahul Garg
09d1a46139
hipMemsetD8 support for HIP/NVCC path
...
Change-Id: I48eee8266afd7b45a12d5ce2c4849b687a006c0f
2017-03-14 23:49:21 +05:30
Rahul Garg
dccf9e2aa9
Added hipMemsetD8
...
Change-Id: I6a230a036c9c46c72a77d5f93c16ce8a00c3f837
2017-03-14 22:11:34 +05:30
Wen-Heng (Jack) Chung
17af837120
Revert "Changes to HIP to cope with Promote-free HCC"
...
This reverts commit 0005dd5f66 .
Change-Id: I20a9bab3883ad09913b320210344d37599cb8fcd
2017-03-14 22:59:27 +08:00
Maneesh Gupta
0853c8ac2e
Merge branch 'amd-master' into amd-develop
2017-03-14 13:44:41 +05:30
Aditya Atluri
af56898ea0
make sure the inter-thread intrinsics are working post hawaii
...
Change-Id: I30ea5284c2160276f5bc0f937dfd386ca8640ce8
2017-03-13 11:16:05 -05:00
Ben Sander
b7acb85fa8
Refactor registered memory calls.
2017-03-11 09:18:27 -06:00
Aditya Atluri
a47066153f
fixed warning raised by g++ using hip_vector_types.h
...
Change-Id: I9e7cdfc8b28b03b690eecd068529cf7629296d68
2017-03-10 15:14:26 -06:00
Rahul Garg
3af487007b
IPC supported using ROCR APIs
...
Change-Id: I0a353b1240098f4b20fa266a871f5f5826290af9
2017-03-10 23:45:28 +05:30
pensun
1a2844e3a3
update porting guide for updated __HIP_DEVICE_COMPILE__
...
Change-Id: I0f025d354f76e2d728231bf112a77e8c8fcacc8c
2017-03-10 10:01:12 -06:00
Aditya Atluri
046ec0375b
Added architecture guards around __shfl, dpp and ds_permute device functions
...
Change-Id: I10f9b08618fbf25b61c1932278fc5759e41c0d66
2017-03-10 08:40:59 -06:00
pensun
14a5d3c80d
add inline to all hip_complex operators
...
Change-Id: Ifba5966c297cbc9299c39ecfc45c7296003ebb5d
2017-03-08 14:06:13 -06:00