Graf commitů

538 Commity

Autor SHA1 Zpráva Datum
Ben Sander 75f691ec2f Add hipHostMallocCoherent, hipHostMallocNonCoherent
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.
2017-05-24 00:48:10 -05:00
pensun 6d0f58b939 fix GGL helper header file, reorder for C++17
Change-Id: I3d9ddfe670bf7e3e8e7bd85e52cc61f48c19c213
2017-05-22 08:52:43 -05:00
Evgeny Mankov 270f643c9c [HIP] [HIPIFY] [FIX] cuModuleLoadDataEx -> hipModuleLoadDataEx
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/81

1.	Do not use JIT options on HCC path, call hipModuleLoadData instead.
2.	NVCC path is unchanged, to call cuModuleLoadDataEx with all options.
3.	Get rid of manual hipification, based on #ifdef #else for NVCC/HIP.
4.	Update documentation accordingly.
2017-05-19 17:22:14 +03:00
Aditya Atluri bdc08fcf10 fixed vector type issues by reverting to old code, changed __half2 to map to vector types in llvm
Change-Id: I7317408c25e8c1a0c02a346042c9137e160c8bbd
2017-05-18 10:51:07 -05:00
Aditya Atluri 9dceccf136 changed vector types to make sure it generate proper llvm vector types
Change-Id: I6c4616dae137dc4eac35e5827dc5b7f3251e0247
2017-05-16 21:35:40 -05:00
Ben Sander 27877f8854 Add initial HIP_SYNC_NULL_STREAM=0 mode.
This eliminates host-synchronization for null stream.  Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.

Add enhanced null-stream test.

Also refine HIP_TRACE_API.
2017-05-16 19:04:25 -05:00
Rahul Garg 1223612331 Added hipMallocPitch on HIP/NVCC path
Change-Id: Ie3ba7d3f95acac23805efa919531043b350a3f21
2017-05-16 07:15:13 +05:30
Ben Sander b306095ac2 Add hipEventDisableSystemRelease flag. 2017-05-12 16:06:03 -05:00
Aditya Atluri a38e36ec2f added guard against hip_runtime.h so that non-hcc compilers can use it
Change-Id: I3d68deda9ce8a5956e21e15a69e549d6c21e3e39
2017-05-09 10:14:16 -05:00
Rahul Garg 1cb51d614e hipMemcpy2DAsync for HIP/NVCC
Change-Id: I46f0057fef49bdaaac41c1df80c3e27432b8f376
2017-05-04 06:47:55 +05:30
Rahul Garg ec27c695c4 Added support for hipMemcpy2DAsync in HIP/HCC
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
2017-05-03 22:29:12 +05:30
Aditya Atluri a5cb2d40ec fixed hipFuncSetCacheConfig on rocm path
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
2017-04-28 11:53:11 -05:00
Aditya Atluri fc6248ce82 added hipFuncSetCacheConfig API for nvcc path
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
2017-04-26 18:56:57 -05:00
Sun, Peng 54561c8af3 fix hip_complex.h header on NV path
Change-Id: Ia95d003ca1b284bab1c76723050e6b3b89178f65
2017-04-25 00:15:20 -05:00
Ben Sander 59df70662a Fix hipMalloc to return error code if allocation fails. 2017-04-24 22:30:54 -05:00
Aditya Atluri 72bcfa438b changed arguments for hipPointerGetAttributes
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
2017-04-24 15:24:16 -05:00
Sun, Peng f4287a29b2 add support of hipLaunchKernelGGL on NV path
Change-Id: I0aeafd80c2181873be385d985f1d8ed86a98d136
2017-04-19 11:00:25 -05:00
Sun, Peng fa15ee3ccd fix broken header in NV path
Change-Id: Ia3aff2a89d9ba49547f51ce03a3304dfab58ba25
2017-04-19 10:47:40 -05:00
Ben Sander 22f3b91cad Add integer abs (initial implementation, can be optimized with OCML)
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
2017-04-11 01:16:28 +00:00
Sun, Peng 71447dacad Fix ifndef guard in hip_fp16.h
Change-Id: I0215556e7aa98a74e8a984e4de3fb6e8cafdfb24
2017-04-10 11:17:05 -05:00
Sun, Peng 2848d7a222 add math.h to cover sqrtf function
Change-Id: Ia37752710cea4ca77e0a4e61f8e69a0355d9488d
2017-04-10 08:53:12 -05:00
Sun, Peng a9fd0d4e0d Update the define of __global__ for GGL
Change-Id: I563bb2a132403bcbe9e9f279b55406cf0255af7d
2017-04-07 16:39:33 -05:00
Sun, Peng a5ca430e5c Add more operator overloading for float2 type, contributed by Aditya
Change-Id: If1ab7fb24d64bb5304142aed0951c9bd5ad47d20
2017-04-07 14:17:41 -05:00
Sun, Peng 6d4af1ab1f update GGL to log launched kernel information
Change-Id: Ied0aa6055673c687071b4a579aecd17f0f3f09ce
2017-04-07 14:06:31 -05:00
Ben Sander 6656d33f75 Finish adding start/stop event support to hipHccModuleLaunchKernel.
Change interface to use hipEvent_t rather than hipEvent_t*

Change-Id: I259062dc087a13d51dc27f84e1e8861f332a104d
2017-04-06 21:02:50 -05:00
sunway513 5995965558 Improve documentation for hipModuleLaunch functions.
Change-Id: I0e22621e499775740c3301347b7416d5f98c2414
2017-04-06 21:02:50 -05:00
Aditya Atluri 0a07382d92 fixed header structure for complex data types
Change-Id: I16bf19005d933f42e8c8603c5d0b2df8ea3ad04f
2017-04-06 16:43:26 -05:00
Sun, Peng ad18bf3b29 GGL update, add while 0 guard for hipLaunchKernel API
Change-Id: Ie48ef8ca2ab5e26a51febfcd92417902c33fbf66
2017-04-06 10:48:11 -05:00
Aditya Atluri 6dff2714e9 added __host__ for complex functions and corrected memset and memcpy test
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
2017-04-06 09:29:44 -05:00
Sun, Peng 1a8ea82497 add extra guard to grid_launch_GGL header
Change-Id: I120619c08ea2d084804fcb1639efbe6c4648dde9
2017-04-05 22:25:41 -05:00
Sun, Peng a92cdbaf29 Add more include header file for GGL, to make it self-compilable
Change-Id: I833cb194784450fb86e7961a7f9fe196ce3c7da5
2017-04-03 11:16:35 -05:00
Maneesh Gupta 822fc802a0 Enable promote free HCC by default
Change-Id: I5f82a8d958dd675a8a46d9d17458c71321daab7c
2017-04-03 11:18:30 +05:30
Sun, Peng c82c84949c Fix warpSize, for related issue in hipeigen and torch
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
2017-04-01 15:42:53 -05:00
Sun, Peng b1ed910942 GGL update, fix for thread-safe access to streams (accelerator_views).
Change-Id: I6dd329a85b3ba7de23d52823febee0c53857a981
2017-04-01 14:50:39 -05:00
Aditya Atluri 1ef7222c3a Fixed copyright and header names
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
2017-03-31 12:40:29 -05:00
Aditya Atluri e0133e627d fixed header names
Change-Id: I21650d6398187d3767b28e8ac81b2642d3b89a0e
2017-03-31 12:18:55 -05:00
Aditya Atluri b9091ba818 added new api hipHccModuleLaunchKernel
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming

Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng 071f19521c remove extra GGL header info
Change-Id: I09f0d1b64a7a31eb2e926f19b69b7bafbacc7787
2017-03-30 18:16:56 -05:00
Sun, Peng f4efa422bf Rename hipLaunchKernelV3 to hipLaunchKernelGGL
Change-Id: I303daae006db41e9b04eb591e0b09b2717a7cf66
2017-03-30 18:10:17 -05:00
Sun, Peng cfc2d455e1 Enable GGL as the default kernel launch method
Change-Id: I8022d126ee28ff7e4d9a96209e399d4243d39d8b
2017-03-30 13:36:46 -05:00
Sun, Peng d067c884be Update GGL to fix one Torch build issue
Change-Id: I95a2a335902e3c368ed29f075ac72eabbb64c97e
2017-03-29 09:03:21 -05:00
sunway513 43f76be76e resolve GGL merge conflicts
Change-Id: I7a5ec3696cf2dd1a77b1686536a1cb84cbfed66e
2017-03-27 22:46:41 -05:00
Rahul Garg 1d18006ab4 Fix for MemcpyFromSymbol on HIP/NVCC path
Change-Id: Ice38307f72870ae468cbf0861e104f0fa46dfd56
2017-03-27 00:35:10 +05:30
Rahul Garg ecc0e14cf7 Added support for Primary Context Management APIs
Change-Id: I70f91b4492e112dd8e12ecf511fdc18a27944a06
2017-03-26 23:45:54 +05:30
Sun, Peng 91274394dc update GGL implementation to use hipLaunchKernel
Change-Id: Ibc08185c814bb07d54f3e68016b10eb7b9f2bf4b
2017-03-21 15:46:56 -05:00
Sun, Peng ec04521617 Update GGL implementation to extended overload set for make_lambda_wrapper
Change-Id: I949f113671ddf155db8689e8a7f23d415839a7b5
2017-03-21 15:46:44 -05:00
Sun, Peng 329e2182d6 revert workaround for square sample and update doc on GGL
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
Sun, Peng d09afd23b8 merge Alex' GGL fix for non-specialized kernel function launch
Change-Id: Idbf7ca669c38ee5c0f654bcabdd1b498abb29f69
2017-03-20 16:34:24 -05:00
Aditya Atluri d9f0bd25be added support for lgammaf and lgamma
1. Implementation inside HIP

Change-Id: I657263b7276a57c56081d3336fef816b5f204eff
2017-03-17 18:26:10 -05:00
Sun, Peng e7689e9e6e Disable additional debug warning message
Change-Id: Ic5c374589bfad387a7c4c5346430a490e2c6e2a7
2017-03-17 15:03:03 -05:00