Ben Sander
75f691ec2f
Add hipHostMallocCoherent, hipHostMallocNonCoherent
...
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.
2017-05-24 00:48:10 -05:00
pensun
6d0f58b939
fix GGL helper header file, reorder for C++17
...
Change-Id: I3d9ddfe670bf7e3e8e7bd85e52cc61f48c19c213
2017-05-22 08:52:43 -05:00
Evgeny Mankov
270f643c9c
[HIP] [HIPIFY] [FIX] cuModuleLoadDataEx -> hipModuleLoadDataEx
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/81
1. Do not use JIT options on HCC path, call hipModuleLoadData instead.
2. NVCC path is unchanged, to call cuModuleLoadDataEx with all options.
3. Get rid of manual hipification, based on #ifdef #else for NVCC/HIP.
4. Update documentation accordingly.
2017-05-19 17:22:14 +03:00
Aditya Atluri
bdc08fcf10
fixed vector type issues by reverting to old code, changed __half2 to map to vector types in llvm
...
Change-Id: I7317408c25e8c1a0c02a346042c9137e160c8bbd
2017-05-18 10:51:07 -05:00
Aditya Atluri
9dceccf136
changed vector types to make sure it generate proper llvm vector types
...
Change-Id: I6c4616dae137dc4eac35e5827dc5b7f3251e0247
2017-05-16 21:35:40 -05:00
Ben Sander
27877f8854
Add initial HIP_SYNC_NULL_STREAM=0 mode.
...
This eliminates host-synchronization for null stream. Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.
Add enhanced null-stream test.
Also refine HIP_TRACE_API.
2017-05-16 19:04:25 -05:00
Rahul Garg
1223612331
Added hipMallocPitch on HIP/NVCC path
...
Change-Id: Ie3ba7d3f95acac23805efa919531043b350a3f21
2017-05-16 07:15:13 +05:30
Ben Sander
b306095ac2
Add hipEventDisableSystemRelease flag.
2017-05-12 16:06:03 -05:00
Aditya Atluri
a38e36ec2f
added guard against hip_runtime.h so that non-hcc compilers can use it
...
Change-Id: I3d68deda9ce8a5956e21e15a69e549d6c21e3e39
2017-05-09 10:14:16 -05:00
Rahul Garg
1cb51d614e
hipMemcpy2DAsync for HIP/NVCC
...
Change-Id: I46f0057fef49bdaaac41c1df80c3e27432b8f376
2017-05-04 06:47:55 +05:30
Rahul Garg
ec27c695c4
Added support for hipMemcpy2DAsync in HIP/HCC
...
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
2017-05-03 22:29:12 +05:30
Aditya Atluri
a5cb2d40ec
fixed hipFuncSetCacheConfig on rocm path
...
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
2017-04-28 11:53:11 -05:00
Aditya Atluri
fc6248ce82
added hipFuncSetCacheConfig API for nvcc path
...
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
2017-04-26 18:56:57 -05:00
Sun, Peng
54561c8af3
fix hip_complex.h header on NV path
...
Change-Id: Ia95d003ca1b284bab1c76723050e6b3b89178f65
2017-04-25 00:15:20 -05:00
Ben Sander
59df70662a
Fix hipMalloc to return error code if allocation fails.
2017-04-24 22:30:54 -05:00
Aditya Atluri
72bcfa438b
changed arguments for hipPointerGetAttributes
...
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
2017-04-24 15:24:16 -05:00
Sun, Peng
f4287a29b2
add support of hipLaunchKernelGGL on NV path
...
Change-Id: I0aeafd80c2181873be385d985f1d8ed86a98d136
2017-04-19 11:00:25 -05:00
Sun, Peng
fa15ee3ccd
fix broken header in NV path
...
Change-Id: Ia3aff2a89d9ba49547f51ce03a3304dfab58ba25
2017-04-19 10:47:40 -05:00
Ben Sander
22f3b91cad
Add integer abs (initial implementation, can be optimized with OCML)
...
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
2017-04-11 01:16:28 +00:00
Sun, Peng
71447dacad
Fix ifndef guard in hip_fp16.h
...
Change-Id: I0215556e7aa98a74e8a984e4de3fb6e8cafdfb24
2017-04-10 11:17:05 -05:00
Sun, Peng
2848d7a222
add math.h to cover sqrtf function
...
Change-Id: Ia37752710cea4ca77e0a4e61f8e69a0355d9488d
2017-04-10 08:53:12 -05:00
Sun, Peng
a9fd0d4e0d
Update the define of __global__ for GGL
...
Change-Id: I563bb2a132403bcbe9e9f279b55406cf0255af7d
2017-04-07 16:39:33 -05:00
Sun, Peng
a5ca430e5c
Add more operator overloading for float2 type, contributed by Aditya
...
Change-Id: If1ab7fb24d64bb5304142aed0951c9bd5ad47d20
2017-04-07 14:17:41 -05:00
Sun, Peng
6d4af1ab1f
update GGL to log launched kernel information
...
Change-Id: Ied0aa6055673c687071b4a579aecd17f0f3f09ce
2017-04-07 14:06:31 -05:00
Ben Sander
6656d33f75
Finish adding start/stop event support to hipHccModuleLaunchKernel.
...
Change interface to use hipEvent_t rather than hipEvent_t*
Change-Id: I259062dc087a13d51dc27f84e1e8861f332a104d
2017-04-06 21:02:50 -05:00
sunway513
5995965558
Improve documentation for hipModuleLaunch functions.
...
Change-Id: I0e22621e499775740c3301347b7416d5f98c2414
2017-04-06 21:02:50 -05:00
Aditya Atluri
0a07382d92
fixed header structure for complex data types
...
Change-Id: I16bf19005d933f42e8c8603c5d0b2df8ea3ad04f
2017-04-06 16:43:26 -05:00
Sun, Peng
ad18bf3b29
GGL update, add while 0 guard for hipLaunchKernel API
...
Change-Id: Ie48ef8ca2ab5e26a51febfcd92417902c33fbf66
2017-04-06 10:48:11 -05:00
Aditya Atluri
6dff2714e9
added __host__ for complex functions and corrected memset and memcpy test
...
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
2017-04-06 09:29:44 -05:00
Sun, Peng
1a8ea82497
add extra guard to grid_launch_GGL header
...
Change-Id: I120619c08ea2d084804fcb1639efbe6c4648dde9
2017-04-05 22:25:41 -05:00
Sun, Peng
a92cdbaf29
Add more include header file for GGL, to make it self-compilable
...
Change-Id: I833cb194784450fb86e7961a7f9fe196ce3c7da5
2017-04-03 11:16:35 -05:00
Maneesh Gupta
822fc802a0
Enable promote free HCC by default
...
Change-Id: I5f82a8d958dd675a8a46d9d17458c71321daab7c
2017-04-03 11:18:30 +05:30
Sun, Peng
c82c84949c
Fix warpSize, for related issue in hipeigen and torch
...
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
2017-04-01 15:42:53 -05:00
Sun, Peng
b1ed910942
GGL update, fix for thread-safe access to streams (accelerator_views).
...
Change-Id: I6dd329a85b3ba7de23d52823febee0c53857a981
2017-04-01 14:50:39 -05:00
Aditya Atluri
1ef7222c3a
Fixed copyright and header names
...
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
2017-03-31 12:40:29 -05:00
Aditya Atluri
e0133e627d
fixed header names
...
Change-Id: I21650d6398187d3767b28e8ac81b2642d3b89a0e
2017-03-31 12:18:55 -05:00
Aditya Atluri
b9091ba818
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng
071f19521c
remove extra GGL header info
...
Change-Id: I09f0d1b64a7a31eb2e926f19b69b7bafbacc7787
2017-03-30 18:16:56 -05:00
Sun, Peng
f4efa422bf
Rename hipLaunchKernelV3 to hipLaunchKernelGGL
...
Change-Id: I303daae006db41e9b04eb591e0b09b2717a7cf66
2017-03-30 18:10:17 -05:00
Sun, Peng
cfc2d455e1
Enable GGL as the default kernel launch method
...
Change-Id: I8022d126ee28ff7e4d9a96209e399d4243d39d8b
2017-03-30 13:36:46 -05:00
Sun, Peng
d067c884be
Update GGL to fix one Torch build issue
...
Change-Id: I95a2a335902e3c368ed29f075ac72eabbb64c97e
2017-03-29 09:03:21 -05:00
sunway513
43f76be76e
resolve GGL merge conflicts
...
Change-Id: I7a5ec3696cf2dd1a77b1686536a1cb84cbfed66e
2017-03-27 22:46:41 -05:00
Rahul Garg
1d18006ab4
Fix for MemcpyFromSymbol on HIP/NVCC path
...
Change-Id: Ice38307f72870ae468cbf0861e104f0fa46dfd56
2017-03-27 00:35:10 +05:30
Rahul Garg
ecc0e14cf7
Added support for Primary Context Management APIs
...
Change-Id: I70f91b4492e112dd8e12ecf511fdc18a27944a06
2017-03-26 23:45:54 +05:30
Sun, Peng
91274394dc
update GGL implementation to use hipLaunchKernel
...
Change-Id: Ibc08185c814bb07d54f3e68016b10eb7b9f2bf4b
2017-03-21 15:46:56 -05:00
Sun, Peng
ec04521617
Update GGL implementation to extended overload set for make_lambda_wrapper
...
Change-Id: I949f113671ddf155db8689e8a7f23d415839a7b5
2017-03-21 15:46:44 -05:00
Sun, Peng
329e2182d6
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
Sun, Peng
d09afd23b8
merge Alex' GGL fix for non-specialized kernel function launch
...
Change-Id: Idbf7ca669c38ee5c0f654bcabdd1b498abb29f69
2017-03-20 16:34:24 -05:00
Aditya Atluri
d9f0bd25be
added support for lgammaf and lgamma
...
1. Implementation inside HIP
Change-Id: I657263b7276a57c56081d3336fef816b5f204eff
2017-03-17 18:26:10 -05:00
Sun, Peng
e7689e9e6e
Disable additional debug warning message
...
Change-Id: Ic5c374589bfad387a7c4c5346430a490e2c6e2a7
2017-03-17 15:03:03 -05:00