Sun, Peng
47921e39d2
Fix error related to undefined reference of __get_dynamicgroupbaseptr().
...
Change-Id: I14951e1725e35dd5f5e53805f81cdb58661f59f2
[ROCm/hip commit: 43df5ba660 ]
2017-06-08 19:24:32 -05:00
Sun, Peng
8b14507650
Add clang version guard so the hip_fp16.h header won't be picked up by gcc
...
Change-Id: Ia21335a455bc93210901b44bc8c76a7f4a385b55
[ROCm/hip commit: 06816fb68b ]
2017-06-08 19:24:32 -05:00
Sun, Peng
3f9b16c397
Improve HIP kernel names, attributes and codegen, contributed by Alex Voicu
...
Change-Id: I2cafbdc5a98e26c7f4fad84739c915e7dc09993c
[ROCm/hip commit: a9808961bd ]
2017-06-05 11:39:00 -05:00
Ben Sander
445042f916
Fix HIP_SYNC_NULL_STREAM=0 mode.
...
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow
[ROCm/hip commit: 39c18e5e5f ]
2017-06-05 08:50:22 -05:00
Aditya Atluri
47ec040140
added half data type and vector destructors
...
1. Added half data types to hip_fp16.h
2. Added destructor to vector data types
Change-Id: Id5ae76a663bb90a4bde2839ec79c58fbaee5072f
[ROCm/hip commit: cb7c4c423c ]
2017-06-02 11:19:33 -05:00
Evgeny Mankov
7a3befc555
[HIP] [HIPIFY] CUDA Driver API 8.0.44 JIT options support.
...
[ROCm/hip commit: 6e99e388ea ]
2017-05-31 18:55:29 +03:00
Ben Sander
b3e01ed74f
Add event controls for release fences.
...
Env var : HIP_EVENT_SYS_RELEASE
Event allocation flags : hipEventReleaseToDevice, hipEventReleaseToSystem
(remove hipEventDisableSystemRelease)
Update test for new functionality.
[ROCm/hip commit: ec7102f461 ]
2017-05-27 16:02:34 -05:00
Ben Sander
c88190d698
Updates so hip compiles on CUDA.
...
[ROCm/hip commit: be8d0ba644 ]
2017-05-27 15:55:07 -05:00
Ben Sander
5b9ce41633
Add hipHostMallocCoherent, hipHostMallocNonCoherent
...
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.
[ROCm/hip commit: 75f691ec2f ]
2017-05-24 00:48:10 -05:00
pensun
b19e862dc4
fix GGL helper header file, reorder for C++17
...
Change-Id: I3d9ddfe670bf7e3e8e7bd85e52cc61f48c19c213
[ROCm/hip commit: 6d0f58b939 ]
2017-05-22 08:52:43 -05:00
Evgeny Mankov
042de3e175
[HIP] [HIPIFY] [FIX] cuModuleLoadDataEx -> hipModuleLoadDataEx
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/81
1. Do not use JIT options on HCC path, call hipModuleLoadData instead.
2. NVCC path is unchanged, to call cuModuleLoadDataEx with all options.
3. Get rid of manual hipification, based on #ifdef #else for NVCC/HIP.
4. Update documentation accordingly.
[ROCm/hip commit: 270f643c9c ]
2017-05-19 17:22:14 +03:00
Aditya Atluri
721cb0f7db
fixed vector type issues by reverting to old code, changed __half2 to map to vector types in llvm
...
Change-Id: I7317408c25e8c1a0c02a346042c9137e160c8bbd
[ROCm/hip commit: bdc08fcf10 ]
2017-05-18 10:51:07 -05:00
Aditya Atluri
4dbebe0409
changed vector types to make sure it generate proper llvm vector types
...
Change-Id: I6c4616dae137dc4eac35e5827dc5b7f3251e0247
[ROCm/hip commit: 9dceccf136 ]
2017-05-16 21:35:40 -05:00
Ben Sander
a55ce5bee4
Add initial HIP_SYNC_NULL_STREAM=0 mode.
...
This eliminates host-synchronization for null stream. Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.
Add enhanced null-stream test.
Also refine HIP_TRACE_API.
[ROCm/hip commit: 27877f8854 ]
2017-05-16 19:04:25 -05:00
Rahul Garg
c6a2d65d5e
Added hipMallocPitch on HIP/NVCC path
...
Change-Id: Ie3ba7d3f95acac23805efa919531043b350a3f21
[ROCm/hip commit: 1223612331 ]
2017-05-16 07:15:13 +05:30
Ben Sander
b1b71c4c75
Add hipEventDisableSystemRelease flag.
...
[ROCm/hip commit: b306095ac2 ]
2017-05-12 16:06:03 -05:00
Aditya Atluri
bdce32238f
added guard against hip_runtime.h so that non-hcc compilers can use it
...
Change-Id: I3d68deda9ce8a5956e21e15a69e549d6c21e3e39
[ROCm/hip commit: a38e36ec2f ]
2017-05-09 10:14:16 -05:00
Rahul Garg
0557f54200
hipMemcpy2DAsync for HIP/NVCC
...
Change-Id: I46f0057fef49bdaaac41c1df80c3e27432b8f376
[ROCm/hip commit: 1cb51d614e ]
2017-05-04 06:47:55 +05:30
Rahul Garg
340cd36e8d
Added support for hipMemcpy2DAsync in HIP/HCC
...
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
[ROCm/hip commit: ec27c695c4 ]
2017-05-03 22:29:12 +05:30
Aditya Atluri
eb68f5d5af
fixed hipFuncSetCacheConfig on rocm path
...
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
[ROCm/hip commit: a5cb2d40ec ]
2017-04-28 11:53:11 -05:00
Aditya Atluri
a2c405c265
added hipFuncSetCacheConfig API for nvcc path
...
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
[ROCm/hip commit: fc6248ce82 ]
2017-04-26 18:56:57 -05:00
Sun, Peng
bbcd4b3761
fix hip_complex.h header on NV path
...
Change-Id: Ia95d003ca1b284bab1c76723050e6b3b89178f65
[ROCm/hip commit: 54561c8af3 ]
2017-04-25 00:15:20 -05:00
Ben Sander
ba2f2f56b8
Fix hipMalloc to return error code if allocation fails.
...
[ROCm/hip commit: 59df70662a ]
2017-04-24 22:30:54 -05:00
Aditya Atluri
c5840d81ad
changed arguments for hipPointerGetAttributes
...
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
[ROCm/hip commit: 72bcfa438b ]
2017-04-24 15:24:16 -05:00
Sun, Peng
f762edf621
add support of hipLaunchKernelGGL on NV path
...
Change-Id: I0aeafd80c2181873be385d985f1d8ed86a98d136
[ROCm/hip commit: f4287a29b2 ]
2017-04-19 11:00:25 -05:00
Sun, Peng
2517984a65
fix broken header in NV path
...
Change-Id: Ia3aff2a89d9ba49547f51ce03a3304dfab58ba25
[ROCm/hip commit: fa15ee3ccd ]
2017-04-19 10:47:40 -05:00
Ben Sander
b716a7781e
Add integer abs (initial implementation, can be optimized with OCML)
...
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
[ROCm/hip commit: 22f3b91cad ]
2017-04-11 01:16:28 +00:00
Sun, Peng
da363cb59d
Fix ifndef guard in hip_fp16.h
...
Change-Id: I0215556e7aa98a74e8a984e4de3fb6e8cafdfb24
[ROCm/hip commit: 71447dacad ]
2017-04-10 11:17:05 -05:00
Sun, Peng
28f1993c9e
add math.h to cover sqrtf function
...
Change-Id: Ia37752710cea4ca77e0a4e61f8e69a0355d9488d
[ROCm/hip commit: 2848d7a222 ]
2017-04-10 08:53:12 -05:00
Sun, Peng
b860a40161
Update the define of __global__ for GGL
...
Change-Id: I563bb2a132403bcbe9e9f279b55406cf0255af7d
[ROCm/hip commit: a9fd0d4e0d ]
2017-04-07 16:39:33 -05:00
Sun, Peng
2841eff506
Add more operator overloading for float2 type, contributed by Aditya
...
Change-Id: If1ab7fb24d64bb5304142aed0951c9bd5ad47d20
[ROCm/hip commit: a5ca430e5c ]
2017-04-07 14:17:41 -05:00
Sun, Peng
b4dbf01cdb
update GGL to log launched kernel information
...
Change-Id: Ied0aa6055673c687071b4a579aecd17f0f3f09ce
[ROCm/hip commit: 6d4af1ab1f ]
2017-04-07 14:06:31 -05:00
Ben Sander
cbfc58660e
Finish adding start/stop event support to hipHccModuleLaunchKernel.
...
Change interface to use hipEvent_t rather than hipEvent_t*
Change-Id: I259062dc087a13d51dc27f84e1e8861f332a104d
[ROCm/hip commit: 6656d33f75 ]
2017-04-06 21:02:50 -05:00
sunway513
7502166e5a
Improve documentation for hipModuleLaunch functions.
...
Change-Id: I0e22621e499775740c3301347b7416d5f98c2414
[ROCm/hip commit: 5995965558 ]
2017-04-06 21:02:50 -05:00
Aditya Atluri
6c7450d414
fixed header structure for complex data types
...
Change-Id: I16bf19005d933f42e8c8603c5d0b2df8ea3ad04f
[ROCm/hip commit: 0a07382d92 ]
2017-04-06 16:43:26 -05:00
Sun, Peng
9d1a5f886e
GGL update, add while 0 guard for hipLaunchKernel API
...
Change-Id: Ie48ef8ca2ab5e26a51febfcd92417902c33fbf66
[ROCm/hip commit: ad18bf3b29 ]
2017-04-06 10:48:11 -05:00
Aditya Atluri
26f6ce992d
added __host__ for complex functions and corrected memset and memcpy test
...
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
[ROCm/hip commit: 6dff2714e9 ]
2017-04-06 09:29:44 -05:00
Sun, Peng
72238e2119
add extra guard to grid_launch_GGL header
...
Change-Id: I120619c08ea2d084804fcb1639efbe6c4648dde9
[ROCm/hip commit: 1a8ea82497 ]
2017-04-05 22:25:41 -05:00
Sun, Peng
5c46c8920c
Add more include header file for GGL, to make it self-compilable
...
Change-Id: I833cb194784450fb86e7961a7f9fe196ce3c7da5
[ROCm/hip commit: a92cdbaf29 ]
2017-04-03 11:16:35 -05:00
Maneesh Gupta
c8214a5b20
Enable promote free HCC by default
...
Change-Id: I5f82a8d958dd675a8a46d9d17458c71321daab7c
[ROCm/hip commit: 822fc802a0 ]
2017-04-03 11:18:30 +05:30
Sun, Peng
dd0aa2c9e3
Fix warpSize, for related issue in hipeigen and torch
...
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
[ROCm/hip commit: c82c84949c ]
2017-04-01 15:42:53 -05:00
Sun, Peng
7559427041
GGL update, fix for thread-safe access to streams (accelerator_views).
...
Change-Id: I6dd329a85b3ba7de23d52823febee0c53857a981
[ROCm/hip commit: b1ed910942 ]
2017-04-01 14:50:39 -05:00
Aditya Atluri
fcb4331a6a
Fixed copyright and header names
...
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
[ROCm/hip commit: 1ef7222c3a ]
2017-03-31 12:40:29 -05:00
Aditya Atluri
7635e0a57e
fixed header names
...
Change-Id: I21650d6398187d3767b28e8ac81b2642d3b89a0e
[ROCm/hip commit: e0133e627d ]
2017-03-31 12:18:55 -05:00
Aditya Atluri
ccf799c453
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
[ROCm/hip commit: b9091ba818 ]
2017-03-31 12:11:34 -05:00
Sun, Peng
799a81029c
remove extra GGL header info
...
Change-Id: I09f0d1b64a7a31eb2e926f19b69b7bafbacc7787
[ROCm/hip commit: 071f19521c ]
2017-03-30 18:16:56 -05:00
Sun, Peng
c9dc0cf010
Rename hipLaunchKernelV3 to hipLaunchKernelGGL
...
Change-Id: I303daae006db41e9b04eb591e0b09b2717a7cf66
[ROCm/hip commit: f4efa422bf ]
2017-03-30 18:10:17 -05:00
Sun, Peng
7a0c077800
Enable GGL as the default kernel launch method
...
Change-Id: I8022d126ee28ff7e4d9a96209e399d4243d39d8b
[ROCm/hip commit: cfc2d455e1 ]
2017-03-30 13:36:46 -05:00
Sun, Peng
381a67f256
Update GGL to fix one Torch build issue
...
Change-Id: I95a2a335902e3c368ed29f075ac72eabbb64c97e
[ROCm/hip commit: d067c884be ]
2017-03-29 09:03:21 -05:00
sunway513
6e01fb345b
resolve GGL merge conflicts
...
Change-Id: I7a5ec3696cf2dd1a77b1686536a1cb84cbfed66e
[ROCm/hip commit: 43f76be76e ]
2017-03-27 22:46:41 -05:00