Aditya Atluri
f97eec0d3b
added kernel only compilation feature for hipcc
...
1. Added hipgenisa.sh file to compile kernel code to hsa code object
2. Changed hipcc to call hipgenisa.sh, making hipcc compiling the kernels
Change-Id: I976459c1ebb24343e1b1fe38b4c3a203f1adffa9
[ROCm/clr commit: 2f8b2fca6b ]
2016-08-31 13:05:57 -05:00
Aditya Atluri
df6f400a8e
added doxygen comments for hipModuleLaunchKernel
...
Change-Id: I8a52d2e62f4b7eea8e05d779b9fda49f0ac45130
[ROCm/clr commit: 6bfbf17fb6 ]
2016-08-31 10:18:46 -05:00
Rahul Garg
cf294e9b60
CUresult to hipError mapping updates
...
Change-Id: I602a70acda798a47ebbbef84b438b21c399821c3
[ROCm/clr commit: e0c5ad6ff6 ]
2016-08-30 15:55:51 +05:30
Aditya Atluri
eb6863017c
remove HIP_INIT_API from ihipSynchronize
...
Change-Id: Ibe0739efe55573c023212d9c28ba847c777e434c
[ROCm/clr commit: 7fdebfb8e3 ]
2016-08-29 21:42:22 -05:00
Aditya Atluri
ab83d52bd4
Added HIP_INIT_API for all context apis
...
Change-Id: I3dd8c068236d9572659a58683fdb59d2b9dccaf3
[ROCm/clr commit: cfac62f7e6 ]
2016-08-29 21:36:34 -05:00
Aditya Atluri
50a2e2db2a
Added doxygen comments for module apis
...
Change-Id: I1825249bf91efe7d058f9026a82ec47855759c98
[ROCm/clr commit: d87bfd425c ]
2016-08-29 21:24:19 -05:00
Ben Sander
7adfdbe991
Add guideline for HIP_INIT_API
...
Change-Id: Ib8fa1e952f31158e9435dfc37ff23db9fda6fd7e
[ROCm/clr commit: 6fd03dde79 ]
2016-08-29 18:40:23 -05:00
Ben Sander
ae336b6a6d
Update supported API list
...
Change-Id: Idbc941e4464d200aca7ac3382060edc08ab94ef2
[ROCm/clr commit: a03a71badd ]
2016-08-29 18:39:50 -05:00
Ben Sander
78cfa5f6d6
Refactor trace code for hipLaunchKernel.
...
- Use standard print functions for streams.
- Add HIP_INIT macro, for cases where we want to initialize HIP but not
log an API (ihipPreKernelLaunch).
Change-Id: If43cf8a363d918bcd3722a2e6a965d4cfa2e03e7
[ROCm/clr commit: fa9b95838a ]
2016-08-29 18:37:57 -05:00
Ben Sander
f538d2e510
remove stray printf
...
Change-Id: Ie64778a83dfe684ffaab3c31bc3d09b713f825b9
[ROCm/clr commit: eb61114739 ]
2016-08-29 18:36:30 -05:00
Aditya Atluri
76c122aecb
added docs to how to use hipModuleLaunchKernel
...
Change-Id: I626241552c69cdae56501371374ca8f2c6776c85
[ROCm/clr commit: 5dc94eb45d ]
2016-08-29 15:38:04 -05:00
Aditya Atluri
78420e4051
added sample to porting driver guide
...
Change-Id: Ia2b54311cec617e5e2a162fd31415c8caad4a374
[ROCm/clr commit: fa86d293bf ]
2016-08-29 15:25:35 -05:00
Aditya Atluri
9af7591223
added hipInit for cuInit nvcc path
...
Change-Id: I594d08d936ac5d06f16e42c63062ac5776b65a0c
[ROCm/clr commit: 5d3c72a7ff ]
2016-08-29 15:20:09 -05:00
Aditya Atluri
41a363a6b7
Changed module api to work with nvcc and hipcc path
...
- Added cuda and cudart linking for nvcc path in hipcc
- Added hipMemcpyHtoD and hipMemcpyDtoH for nvcc path
- Changed hipDeviceptr to hipDeviceptr_t
- Fixed hipMemcpy*to* API signatues
Change-Id: I6ef076b507f92502efda475c83dcdcdd462afc37
[ROCm/clr commit: fb0c432095 ]
2016-08-29 15:05:12 -05:00
Aditya Atluri
4c46364a3f
added sample for module api which works on both cuda and rocm
...
Change-Id: Id88abb3698a224177fd0531e3e16013dde6dd95e
[ROCm/clr commit: 1673bdefec ]
2016-08-29 15:00:26 -05:00
Ben Sander
f317725b7c
Doc update. Describe memcpytosymbol, threadfence_system workarounds
...
[ROCm/clr commit: d76372a049 ]
2016-08-29 13:45:20 -05:00
Aditya Atluri
25fe3f0777
Fixed nvcc path for module apis
...
Change-Id: I43c7a6bc226f4f270c37f3c4af86b3b3efd0b175
[ROCm/clr commit: 645975a05b ]
2016-08-28 22:50:44 -05:00
Aditya Atluri
9fe6924154
Cleaned up module api
...
- Moved HIP_INIT_API from internal to all public apis
- renamed hipLaunchModuleKernel to hipModuleLaunchKernel
- Changed tests according to the changed api
Change-Id: I822ff63c7c5b7dad340ece49456baf9d89428e9f
[ROCm/clr commit: 71991ed8f8 ]
2016-08-28 16:48:57 -05:00
Ben Sander
713eb5636d
Remove cutnpaste check on uninitialized hsa status
...
Change-Id: Icc0256b9b1bd01d45cca4feb1106cfe7427bcd15
[ROCm/clr commit: 46c3ff52a6 ]
2016-08-28 16:35:27 -05:00
Ben Sander
40ca19440c
Remove deprecated message from hipLaunchModuleKernel
...
Change-Id: I87675453ae4363e3340a9d1491bb00543fa8c6e0
[ROCm/clr commit: f7ee14e637 ]
2016-08-28 16:34:34 -05:00
Aditya Atluri
6a596c353a
added memset and memcpy device functions
...
- Added memcpy and memset device functions
- Added test for memcpy and memset
Change-Id: Icd21a8dd964953b86d5e92889bf1664bee647219
[ROCm/clr commit: 5e7c396bcd ]
2016-08-27 11:13:56 -05:00
Aditya Atluri
fa734c5a6a
Added explicit memory copy direction apis
...
- Fixed stale printf in context api
- Added 4 sync memcpy apis
1. hipMemcpyHtoD
2. hipMemcpyDtoH
3. hipMemcpyDtoD
4. hipMemcpyHtoH
- Added test for added apis
Change-Id: I4a9c382445b62631f8d0bcbb9a670322288b72b1
[ROCm/clr commit: f722a132a4 ]
2016-08-26 13:11:01 -05:00
Ben Sander
99340539a9
Add docs on porting driver API
...
[ROCm/clr commit: 8b8181df41 ]
2016-08-26 10:46:39 -05:00
Ben Sander
613cf84e5b
Doc update to clarify supported / unsupported features
...
[ROCm/clr commit: 8d02cf466f ]
2016-08-26 10:46:39 -05:00
Aditya Atluri
33a38a0e26
Added NVCC support and name changes
...
- Added NVCC support for module APIs
- Changed hipFunction and hipModule data types to hipFunction_t and hipModule_t
- Created new intenal ihipModuleGetFunction as it is used twice
- Changed test to match with the new data types
Change-Id: I300a1c7fd40ed7065b1b8b9de97e3a06b96ed729
[ROCm/clr commit: 74a8fb132d ]
2016-08-26 10:32:01 -05:00
Rahul Garg
c43c365811
Added logic to update primary ctx when ctx stack is empty, updated hipCtxDestroy and ctxGetCurrent functions
...
Change-Id: Ia0a8943c121bc1279788a1cfa9be59af614b04a6
[ROCm/clr commit: 452108b0e6 ]
2016-08-26 19:03:23 +05:30
Rahul Garg
5830aeb77e
Resolved errors due to hipCtxXXX APIs
...
Change-Id: Iffac0095c4352864eca622ea318d2291571b5153
[ROCm/clr commit: 77991293ba ]
2016-08-26 15:32:49 +05:30
Rahul Garg
93e5faf1ce
NVCC path support for hipCtxXXX APIs
...
Change-Id: Ic7dbfbdaee9d00c0de1363c50758e5e29a96a8b2
[ROCm/clr commit: da80bf6326 ]
2016-08-26 14:10:36 +05:30
Rahul Garg
8e09eacd0b
Addition of hipCtxEnablePeerAccess and hipCtxDisablePeerAccess functions
...
Change-Id: I381c8cbbde17eae7d9bb5d4cb1596cebf4bda039
[ROCm/clr commit: a1d401336b ]
2016-08-26 13:51:33 +05:30
Maneesh Gupta
7064683540
hipcc needs to link against supc++
...
Change-Id: Ica0949099a0bdfe2a493341dc1cd96ec93f34f11
[ROCm/clr commit: 0c051fede1 ]
2016-08-26 10:35:30 +05:30
Aditya Atluri
82deaeb581
Changed how hipEvent_t is typedefed internall
...
- Mapped hipEvent_t directly to ihipEvent_t* instead of a handle
Change-Id: I5a8bcca0ef962932e0738c03eb1fc914d23022ae
[ROCm/clr commit: 25be4fb532 ]
2016-08-25 14:34:41 -05:00
Aditya Atluri
d769b71efc
Added hipModuleGetGlobal and hipModuleLoadData
...
Change-Id: Iaec873f7d86b72911b6ad32e067a4dfe3d552fe6
[ROCm/clr commit: f22a3189a3 ]
2016-08-25 14:16:53 -05:00
Evgeny Mankov
bdb114188c
clang-hipify: code refactoring - API (Driver/Runtime/Blas) distinguishing is added.
...
[ROCm/clr commit: 5cca5b3dca ]
2016-08-25 19:36:37 +03:00
Evgeny Mankov
9768864f17
clang-hipify: code refactoring and performance improvement
...
[ROCm/clr commit: 207eb02736 ]
2016-08-24 18:51:36 +03:00
Aditya Atluri
069d132f41
changed internal structure of hipFunction and hipModule
...
Change-Id: Ifa343782e29d7e056efc47e56253311013005093
[ROCm/clr commit: af4dc556eb ]
2016-08-24 09:47:11 -05:00
Aditya Atluri
9c36b77d39
Module test correction and hipModuleUnload API
...
- Corrected the hipModule.cpp test to minimal code
- Added hipModuleUnload API
- Added hipModuleUnload API test
Change-Id: I9c40337043d7972a570b795e1bfc104bd2c4d8aa
[ROCm/clr commit: 768287f2a2 ]
2016-08-23 14:19:15 -05:00
Aditya Atluri
021cd7174a
Added module api test with gcn binary
...
Change-Id: I61c3ecc2b34168d10f1a7b15d668630eb2c69c8c
[ROCm/clr commit: 2f426aaaea ]
2016-08-23 13:50:19 -05:00
Aditya Atluri
52305186ef
Added stream synchronisation for hipLaunchModuleKernel
...
- The module kernel launch is now in sync with commands in its stream
- Moved launch kernel inside ihipStream
Change-Id: Ic00cfcf4882bf81b6203c36881a52575ea68b529
[ROCm/clr commit: c0fbe7891c ]
2016-08-22 14:17:55 -05:00
Aditya Atluri
b9f564ae19
Added nvcc path for hipComplex APIs
...
- Changed from inline to static inline for hipComplex AMD APIs
- Added NVCC path for hipComplex APIs mapped to cuComplex APIs
Change-Id: I809cf3a11b5b1c8bbc7a57c5fbcc3dc6745ccb95
[ROCm/clr commit: 98c740c51f ]
2016-08-22 10:29:46 -05:00
Rahul Garg
f1f04949ac
Added support for hipCtxSynchronize and hipCtxGetFlags,modified hipDeviceSynchronize
...
Change-Id: If7bac667a262fa8c0cb3dc93e97f2534855acd07
[ROCm/clr commit: b0ba622ed5 ]
2016-08-22 16:15:27 +05:30
Rahul Garg
ab47c624ec
Added initial draft for performance optimizations, started with unpinned memory transfers
...
Change-Id: Icbce2aec347d015bc66cc0c08f6193057bf36b4c
[ROCm/clr commit: 778bb2e78a ]
2016-08-22 11:37:37 +05:30
Aditya Atluri
b2301b8912
Added more complex apis and copyright
...
- New header which redirects to CUDA/HIP path added for hipComplex.h
- Added more complex device api including fma
- Added copyright to new files
Change-Id: Iff0dece4c438e97d0ae33efa4312975d465a6464
[ROCm/clr commit: a362489b27 ]
2016-08-19 23:02:04 -05:00
Aditya Atluri
d5cc978648
Added support for complex device functions
...
- Added complex number arithmetic operation for float and double datatypes
- TODO: make them host functions and support half
- Added new function which is not in CUDA, hipCsqabs which is square of absolute value
Change-Id: Ib96e194ad45dc64fcba29eb19ad0376542e0591d
[ROCm/clr commit: 4d278259b9 ]
2016-08-19 21:48:23 -05:00
Aditya Atluri
9571f355ac
Added support for executable and symbols for data structures
...
- symbol handle is added to hipFunction
- executable handle is added to hipModule
- This way, the APIs doesn't need to track the values
Change-Id: I7cf05329cf79fe946319d7746bd9f5503268fda4
[ROCm/clr commit: 9eaabb507c ]
2016-08-19 08:49:34 -05:00
Maneesh Gupta
9fedb8dc6d
Make it easier to switch between HIP library formats
...
Change-Id: Id16406795a81f4bb64dbcb76b9b8763ffe59aac6
[ROCm/clr commit: 8f19a51521 ]
2016-08-19 13:07:22 +05:30
Maneesh Gupta
a6f964da02
CMakeLists: use macro for appending to config file
...
Change-Id: I71ca3769b58b146f95368c2a2f6615c1eb47e121
[ROCm/clr commit: 78576ab1a8 ]
2016-08-19 12:17:00 +05:30
Evgeny Mankov
45bd4c79d7
clang-hipify: After translating any symbol forcibly include the hip header file in case it wasn’t.
...
Fixes https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/34
[ROCm/clr commit: 00b9782187 ]
2016-08-18 20:59:51 +03:00
Aditya Atluri
a0740e524c
Added hipLaunchModuleKernel and new error codes
...
- hipLaunchModuleKernel maps to cuLaunchKernel
- Whole lot of new error codes added for the use of driver api
- KernelParams arguments is not yet supported
- hipLaunchModuleKernel is a synchronous api (will change eventually)
- All the commands in a stream will wait on host when hipLaunchModuleKernel is called on it
Change-Id: Ib4a4fae1db06fbb3a81d5a5575b026aa821264ed
[ROCm/clr commit: 792811fd52 ]
2016-08-18 11:26:55 -05:00
Maneesh Gupta
cf2ea9e48f
Enable hipDynamicShared directed test
...
Change-Id: I31e7e83ecb3e15fb25b63d6bb6fa9291484c9ef5
[ROCm/clr commit: 49d8faef54 ]
2016-08-18 14:15:50 +05:30
Maneesh Gupta
34464a647f
Fix version related bug in CMakeLists.txt
...
Change-Id: I31c567575185a4e85f5f24d3f105f7cb1beed425
[ROCm/clr commit: eb90dddbc4 ]
2016-08-18 13:56:25 +05:30