Ben Sander
a55ce5bee4
Add initial HIP_SYNC_NULL_STREAM=0 mode.
...
This eliminates host-synchronization for null stream. Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.
Add enhanced null-stream test.
Also refine HIP_TRACE_API.
[ROCm/hip commit: 27877f8854 ]
2017-05-16 19:04:25 -05:00
Rahul Garg
c6a2d65d5e
Added hipMallocPitch on HIP/NVCC path
...
Change-Id: Ie3ba7d3f95acac23805efa919531043b350a3f21
[ROCm/hip commit: 1223612331 ]
2017-05-16 07:15:13 +05:30
Ben Sander
b1b71c4c75
Add hipEventDisableSystemRelease flag.
...
[ROCm/hip commit: b306095ac2 ]
2017-05-12 16:06:03 -05:00
Aditya Atluri
bdce32238f
added guard against hip_runtime.h so that non-hcc compilers can use it
...
Change-Id: I3d68deda9ce8a5956e21e15a69e549d6c21e3e39
[ROCm/hip commit: a38e36ec2f ]
2017-05-09 10:14:16 -05:00
Rahul Garg
0557f54200
hipMemcpy2DAsync for HIP/NVCC
...
Change-Id: I46f0057fef49bdaaac41c1df80c3e27432b8f376
[ROCm/hip commit: 1cb51d614e ]
2017-05-04 06:47:55 +05:30
Rahul Garg
340cd36e8d
Added support for hipMemcpy2DAsync in HIP/HCC
...
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
[ROCm/hip commit: ec27c695c4 ]
2017-05-03 22:29:12 +05:30
Aditya Atluri
eb68f5d5af
fixed hipFuncSetCacheConfig on rocm path
...
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
[ROCm/hip commit: a5cb2d40ec ]
2017-04-28 11:53:11 -05:00
Aditya Atluri
a2c405c265
added hipFuncSetCacheConfig API for nvcc path
...
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
[ROCm/hip commit: fc6248ce82 ]
2017-04-26 18:56:57 -05:00
Sun, Peng
bbcd4b3761
fix hip_complex.h header on NV path
...
Change-Id: Ia95d003ca1b284bab1c76723050e6b3b89178f65
[ROCm/hip commit: 54561c8af3 ]
2017-04-25 00:15:20 -05:00
Ben Sander
ba2f2f56b8
Fix hipMalloc to return error code if allocation fails.
...
[ROCm/hip commit: 59df70662a ]
2017-04-24 22:30:54 -05:00
Aditya Atluri
c5840d81ad
changed arguments for hipPointerGetAttributes
...
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
[ROCm/hip commit: 72bcfa438b ]
2017-04-24 15:24:16 -05:00
Sun, Peng
f762edf621
add support of hipLaunchKernelGGL on NV path
...
Change-Id: I0aeafd80c2181873be385d985f1d8ed86a98d136
[ROCm/hip commit: f4287a29b2 ]
2017-04-19 11:00:25 -05:00
Sun, Peng
2517984a65
fix broken header in NV path
...
Change-Id: Ia3aff2a89d9ba49547f51ce03a3304dfab58ba25
[ROCm/hip commit: fa15ee3ccd ]
2017-04-19 10:47:40 -05:00
Ben Sander
b716a7781e
Add integer abs (initial implementation, can be optimized with OCML)
...
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
[ROCm/hip commit: 22f3b91cad ]
2017-04-11 01:16:28 +00:00
Sun, Peng
da363cb59d
Fix ifndef guard in hip_fp16.h
...
Change-Id: I0215556e7aa98a74e8a984e4de3fb6e8cafdfb24
[ROCm/hip commit: 71447dacad ]
2017-04-10 11:17:05 -05:00
Sun, Peng
28f1993c9e
add math.h to cover sqrtf function
...
Change-Id: Ia37752710cea4ca77e0a4e61f8e69a0355d9488d
[ROCm/hip commit: 2848d7a222 ]
2017-04-10 08:53:12 -05:00
Sun, Peng
b860a40161
Update the define of __global__ for GGL
...
Change-Id: I563bb2a132403bcbe9e9f279b55406cf0255af7d
[ROCm/hip commit: a9fd0d4e0d ]
2017-04-07 16:39:33 -05:00
Sun, Peng
2841eff506
Add more operator overloading for float2 type, contributed by Aditya
...
Change-Id: If1ab7fb24d64bb5304142aed0951c9bd5ad47d20
[ROCm/hip commit: a5ca430e5c ]
2017-04-07 14:17:41 -05:00
Sun, Peng
b4dbf01cdb
update GGL to log launched kernel information
...
Change-Id: Ied0aa6055673c687071b4a579aecd17f0f3f09ce
[ROCm/hip commit: 6d4af1ab1f ]
2017-04-07 14:06:31 -05:00
Ben Sander
cbfc58660e
Finish adding start/stop event support to hipHccModuleLaunchKernel.
...
Change interface to use hipEvent_t rather than hipEvent_t*
Change-Id: I259062dc087a13d51dc27f84e1e8861f332a104d
[ROCm/hip commit: 6656d33f75 ]
2017-04-06 21:02:50 -05:00
sunway513
7502166e5a
Improve documentation for hipModuleLaunch functions.
...
Change-Id: I0e22621e499775740c3301347b7416d5f98c2414
[ROCm/hip commit: 5995965558 ]
2017-04-06 21:02:50 -05:00
Aditya Atluri
6c7450d414
fixed header structure for complex data types
...
Change-Id: I16bf19005d933f42e8c8603c5d0b2df8ea3ad04f
[ROCm/hip commit: 0a07382d92 ]
2017-04-06 16:43:26 -05:00
Sun, Peng
9d1a5f886e
GGL update, add while 0 guard for hipLaunchKernel API
...
Change-Id: Ie48ef8ca2ab5e26a51febfcd92417902c33fbf66
[ROCm/hip commit: ad18bf3b29 ]
2017-04-06 10:48:11 -05:00
Aditya Atluri
26f6ce992d
added __host__ for complex functions and corrected memset and memcpy test
...
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
[ROCm/hip commit: 6dff2714e9 ]
2017-04-06 09:29:44 -05:00
Sun, Peng
72238e2119
add extra guard to grid_launch_GGL header
...
Change-Id: I120619c08ea2d084804fcb1639efbe6c4648dde9
[ROCm/hip commit: 1a8ea82497 ]
2017-04-05 22:25:41 -05:00
Sun, Peng
5c46c8920c
Add more include header file for GGL, to make it self-compilable
...
Change-Id: I833cb194784450fb86e7961a7f9fe196ce3c7da5
[ROCm/hip commit: a92cdbaf29 ]
2017-04-03 11:16:35 -05:00
Maneesh Gupta
c8214a5b20
Enable promote free HCC by default
...
Change-Id: I5f82a8d958dd675a8a46d9d17458c71321daab7c
[ROCm/hip commit: 822fc802a0 ]
2017-04-03 11:18:30 +05:30
Sun, Peng
dd0aa2c9e3
Fix warpSize, for related issue in hipeigen and torch
...
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
[ROCm/hip commit: c82c84949c ]
2017-04-01 15:42:53 -05:00
Sun, Peng
7559427041
GGL update, fix for thread-safe access to streams (accelerator_views).
...
Change-Id: I6dd329a85b3ba7de23d52823febee0c53857a981
[ROCm/hip commit: b1ed910942 ]
2017-04-01 14:50:39 -05:00
Aditya Atluri
fcb4331a6a
Fixed copyright and header names
...
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
[ROCm/hip commit: 1ef7222c3a ]
2017-03-31 12:40:29 -05:00
Aditya Atluri
7635e0a57e
fixed header names
...
Change-Id: I21650d6398187d3767b28e8ac81b2642d3b89a0e
[ROCm/hip commit: e0133e627d ]
2017-03-31 12:18:55 -05:00
Aditya Atluri
ccf799c453
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
[ROCm/hip commit: b9091ba818 ]
2017-03-31 12:11:34 -05:00
Sun, Peng
799a81029c
remove extra GGL header info
...
Change-Id: I09f0d1b64a7a31eb2e926f19b69b7bafbacc7787
[ROCm/hip commit: 071f19521c ]
2017-03-30 18:16:56 -05:00
Sun, Peng
c9dc0cf010
Rename hipLaunchKernelV3 to hipLaunchKernelGGL
...
Change-Id: I303daae006db41e9b04eb591e0b09b2717a7cf66
[ROCm/hip commit: f4efa422bf ]
2017-03-30 18:10:17 -05:00
Sun, Peng
7a0c077800
Enable GGL as the default kernel launch method
...
Change-Id: I8022d126ee28ff7e4d9a96209e399d4243d39d8b
[ROCm/hip commit: cfc2d455e1 ]
2017-03-30 13:36:46 -05:00
Sun, Peng
381a67f256
Update GGL to fix one Torch build issue
...
Change-Id: I95a2a335902e3c368ed29f075ac72eabbb64c97e
[ROCm/hip commit: d067c884be ]
2017-03-29 09:03:21 -05:00
sunway513
6e01fb345b
resolve GGL merge conflicts
...
Change-Id: I7a5ec3696cf2dd1a77b1686536a1cb84cbfed66e
[ROCm/hip commit: 43f76be76e ]
2017-03-27 22:46:41 -05:00
Rahul Garg
be8d979d08
Fix for MemcpyFromSymbol on HIP/NVCC path
...
Change-Id: Ice38307f72870ae468cbf0861e104f0fa46dfd56
[ROCm/hip commit: 1d18006ab4 ]
2017-03-27 00:35:10 +05:30
Rahul Garg
397088203d
Added support for Primary Context Management APIs
...
Change-Id: I70f91b4492e112dd8e12ecf511fdc18a27944a06
[ROCm/hip commit: ecc0e14cf7 ]
2017-03-26 23:45:54 +05:30
Sun, Peng
0215445fa0
update GGL implementation to use hipLaunchKernel
...
Change-Id: Ibc08185c814bb07d54f3e68016b10eb7b9f2bf4b
[ROCm/hip commit: 91274394dc ]
2017-03-21 15:46:56 -05:00
Sun, Peng
38feccc1d2
Update GGL implementation to extended overload set for make_lambda_wrapper
...
Change-Id: I949f113671ddf155db8689e8a7f23d415839a7b5
[ROCm/hip commit: ec04521617 ]
2017-03-21 15:46:44 -05:00
Sun, Peng
c5b6222d62
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
[ROCm/hip commit: 329e2182d6 ]
2017-03-21 10:26:09 -05:00
Sun, Peng
b5e6670807
merge Alex' GGL fix for non-specialized kernel function launch
...
Change-Id: Idbf7ca669c38ee5c0f654bcabdd1b498abb29f69
[ROCm/hip commit: d09afd23b8 ]
2017-03-20 16:34:24 -05:00
Aditya Atluri
5d1ae81def
added support for lgammaf and lgamma
...
1. Implementation inside HIP
Change-Id: I657263b7276a57c56081d3336fef816b5f204eff
[ROCm/hip commit: d9f0bd25be ]
2017-03-17 18:26:10 -05:00
Sun, Peng
90e79afc24
Disable additional debug warning message
...
Change-Id: Ic5c374589bfad387a7c4c5346430a490e2c6e2a7
[ROCm/hip commit: e7689e9e6e ]
2017-03-17 15:03:03 -05:00
pensun
81622d1263
Change the #define of GENERIC_GRID_LAUNCH to take valueat compilation, disable warning messages
...
Change-Id: Ic6c011529e26de359bcda1e7083727e7ee52887b
[ROCm/hip commit: 30d5f4ea10 ]
2017-03-17 14:59:34 -05:00
pensun
762090f564
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
[ROCm/hip commit: 33c38de407 ]
2017-03-17 14:59:34 -05:00
Ben Sander
e10e2bd267
Move USE_PROMOTE_FREE_HCC
...
[ROCm/hip commit: 8cbe310870 ]
2017-03-17 12:04:13 -05:00
Ben Sander
8d5c39fd52
Add __device__ to needful functions for promote-free.
...
[ROCm/hip commit: 29232ff283 ]
2017-03-17 11:19:48 -05:00
Ben Sander
a9ec705ee4
Add USE_PROMOTE_FREE_HCC for smooth transition to new HCC caps.
...
ADDRESS_SPACE_1 defines
[ROCm/hip commit: e85c167171 ]
2017-03-17 11:04:39 -05:00