Ben Sander
639d152ff8
Refactor hipHostRegister test.
...
Run all tests in one command.
Run 128 offsets.
[ROCm/hip commit: 6437f5d2b2 ]
2017-05-05 17:29:04 -05:00
Ben Sander
2e2c773643
Update streamEventTEst.
...
- add checks for events across devices.
- refactor test to make sure it runs long enough to sensitive sync
techniques.
- add tests for DeviceSync, streamWaitEvent.
[ROCm/hip commit: 2d0f509de5 ]
2017-05-05 17:29:04 -05:00
wsttiger
3bf30acbd7
Improve hipStreamWaitEvent test.
...
- use addOne kernel, use local initializer rather than init_array.
- use addOneReverse test to add from back of array. Test alternate fwd and backward to stress dependency logic.
- check device-side dependencies.
[ROCm/hip commit: 2a253680da ]
2017-05-05 17:29:04 -05:00
Evgeny Mankov
dfbc039ebb
[HIPIFY] LLVM 3.9 support
...
3.8 and 3.9 are both supported. 3.8 is stable, 3.9 needs more testing.
[ROCm/hip commit: 05be936fd6 ]
2017-05-05 21:28:02 +03:00
Rahul Garg
0557f54200
hipMemcpy2DAsync for HIP/NVCC
...
Change-Id: I46f0057fef49bdaaac41c1df80c3e27432b8f376
[ROCm/hip commit: 1cb51d614e ]
2017-05-04 06:47:55 +05:30
Evgeny Mankov
869dfcd01f
[HIPIFY] CUDA RT memcpy functions update.
...
cudaMemcpyFromSymbol -> hipMemcpyFromSymbol
cudaMemcpyFromSymbolAsync -> hipMemcpyFromSymbolAsync
cudaMemcpy2DAsync -> hipMemcpy2DAsync
[ROCm/hip commit: 9b65358c68 ]
2017-05-03 23:05:44 +03:00
Evgeny Mankov
4841eb2bff
[HIPIFY] Blas update: add a few functions, supported by HIP.
...
cublasDaxpy -> hipblasDaxpy
cublasDgemv -> hipblasDgemv
cublasDger -> hipblasDger
cublasDgemm -> hipblasDgemm
cublasDgemmBatched -> hipblasDgemmBatched
cublasGetStream -> hipblasGetStream
cublasSetStream -> hipblasSetStream
cublasDaxpy -> hipblasDaxpy
[ROCm/hip commit: 1269897606 ]
2017-05-03 22:45:56 +03:00
Evgeny Mankov
115709ba28
[HIPIFY] HIPIFY and HIP sync with CUDA Driver API data types.
...
+ Update CUDA_Driver_API_functions_supported_by_HIP.md.
+ Final update of HIPIFY with CUDA driver data types.
[TODO] Syncing HIPIFY and HIP by CUDA Driver API functions.
[ROCm/hip commit: 70c94d7b83 ]
2017-05-03 21:59:48 +03:00
Rahul Garg
340cd36e8d
Added support for hipMemcpy2DAsync in HIP/HCC
...
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
[ROCm/hip commit: ec27c695c4 ]
2017-05-03 22:29:12 +05:30
Evgeny Mankov
d439bbd746
[HIPIFY] Rename enumConstantDecl -> enumDecl
...
Reason: not to mix up with clang's enumConstantDecl, used for enum DeclRefExpr (enum constant).
[ROCm/hip commit: c7958cbb8b ]
2017-04-28 21:59:33 +03:00
Evgeny Mankov
a83172c49b
* [HIPIFY] [FIX] Replacement error: enum cudaMemcpyKind kind -> hipMemcpyKindyKind kind
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/72
[Solution] [Workaround]
Offset calculation for enum VarDecl as param decl, declared with enum type specifier.
[Result]
enum cudaMemcpyKind kind -> enum hipMemcpyKind kind
[ToDo]
Test on terminal qualifiers (const, etc).
[ROCm/hip commit: eddd021996 ]
2017-04-28 21:45:36 +03:00
Evgeny Mankov
4fe32b302f
[HIPIFY] [FIX] replacement error: cudaError_t -> hipError_t_t
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/71
[Solution]
getUnqualifiedType for enumConstantDecl's type is added, except ordinary enum declarations (w/o typedef).
[ToDo]
Find more appropriate way of distinguishing redefined enum declarations and ordinary ones.
[ROCm/hip commit: 3d88932c8d ]
2017-04-28 21:03:03 +03:00
Aditya Atluri
eb68f5d5af
fixed hipFuncSetCacheConfig on rocm path
...
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
[ROCm/hip commit: a5cb2d40ec ]
2017-04-28 11:53:11 -05:00
Aditya Atluri
a565e85c08
fixed fast math expf and exp10f
...
Change-Id: I73963220f902efebb0a7404c5f8966dffb4c35ca
[ROCm/hip commit: ab2eb420e2 ]
2017-04-26 19:01:10 -05:00
Aditya Atluri
a2c405c265
added hipFuncSetCacheConfig API for nvcc path
...
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
[ROCm/hip commit: fc6248ce82 ]
2017-04-26 18:56:57 -05:00
Maneesh Gupta
531c0f0030
Merge branch 'amd-master' into amd-develop
...
[ROCm/hip commit: ff454e4b8e ]
2017-04-26 23:55:13 +05:30
Sun, Peng
bbcd4b3761
fix hip_complex.h header on NV path
...
Change-Id: Ia95d003ca1b284bab1c76723050e6b3b89178f65
[ROCm/hip commit: 54561c8af3 ]
2017-04-25 00:15:20 -05:00
Ben Sander
0e14f7f58e
Add host offset checking
...
[ROCm/hip commit: cd93a2f229 ]
2017-04-24 22:30:54 -05:00
Ben Sander
e0bd60a22f
Refactor hipHostRegister test.
...
- Add more testing for offsets.
- Parse cmdline options and use --tests.
[ROCm/hip commit: b7b17c3841 ]
2017-04-24 22:30:54 -05:00
Ben Sander
244e79d54d
Refactor hipHostRegister to cover misaligned cases.
...
[ROCm/hip commit: 8ba993e36c ]
2017-04-24 22:30:54 -05:00
Ben Sander
6f754878eb
Tailor pointer info for src/dst before calling HCC copy routines.
...
HCC sometimes uses the srcPtrInfo or dstPtrInfo to determine the
pointer. Make sure these use the actual pointer and not the base
of the allocation.
[ROCm/hip commit: 5a52b79782 ]
2017-04-24 22:30:54 -05:00
Ben Sander
ed7b16786a
Add test for non-page-aligned mem copies.
...
[ROCm/hip commit: ab410add5c ]
2017-04-24 22:30:54 -05:00
Ben Sander
70ed508153
Add corrected test for offsets
...
[ROCm/hip commit: c5d89d9e7f ]
2017-04-24 22:30:54 -05:00
Ben Sander
9699bcc3da
Refactor hipMemcpy test to share mem alloc for multiple copies.
...
[ROCm/hip commit: 2bc0a6030e ]
2017-04-24 22:30:54 -05:00
Ben Sander
3ef01532b9
Add negative testing for memory full condition.
...
[ROCm/hip commit: dc001ef9b4 ]
2017-04-24 22:30:54 -05:00
Ben Sander
58fd2f29ec
Fix hip debug for case where copyAgent is null (host-to-host)
...
[ROCm/hip commit: 9e41e3c6e2 ]
2017-04-24 22:30:54 -05:00
Ben Sander
ba2f2f56b8
Fix hipMalloc to return error code if allocation fails.
...
[ROCm/hip commit: 59df70662a ]
2017-04-24 22:30:54 -05:00
Aditya Atluri
2b8fbf40a8
fixed build issues with hipPointerGetAttributes
...
Change-Id: I3f5fbc05bdaef720884ba949075928752a070377
[ROCm/hip commit: 59ab3659ee ]
2017-04-24 15:31:07 -05:00
Aditya Atluri
c5840d81ad
changed arguments for hipPointerGetAttributes
...
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
[ROCm/hip commit: 72bcfa438b ]
2017-04-24 15:24:16 -05:00
Maneesh Gupta
f884e55aca
Merge branch 'amd-develop' into amd-master
...
Change-Id: I312fb9d1181733ef5160d1e993e2ae57ced0f6b3
[ROCm/hip commit: 88fb807af0 ]
2017-04-24 08:51:03 +05:30
Maneesh Gupta
1b446eab71
Updated release notes
...
Change-Id: Ia98aff420ea9d488924dce8fe9168cec9da301ab
[ROCm/hip commit: 3a519ee9e5 ]
2017-04-24 08:48:35 +05:30
James Edwards
63cca4cd29
Specify full path of hip libraries in link file.
...
Change-Id: I49b788f3489e7abff6b11006ff97fdfca4e5942c
[ROCm/hip commit: 4869bf5a7c ]
2017-04-22 15:54:14 -05:00
James Edwards
236c084472
Properly link hip cmake file into top level lib directory.
...
Change-Id: I2113a86ca6985f34fd0cfb091abdbce0f632cfc2
[ROCm/hip commit: 36353a560f ]
2017-04-21 22:34:26 -05:00
Evgeny Mankov
22ccefce80
[HIPIFY] [DOC] Readme.md update: Ubuntu 16.04 support
...
[ROCm/hip commit: 080dd2e0d3 ]
2017-04-22 01:01:31 +03:00
Evgeny Mankov
d4c04914d2
[HIPIFY] Initial sync HIPIFY with HIP by CUDA Driver API data types.
...
+ Introduce CUDA_Driver_API_functions_supported_by_HIP.md.
+ Initial update of HIPIFY with CUDA driver data types.
+ Initial sync HIP types against CUDA Driver and Runtime API types.
+ Typo fixes.
[ROCm/hip commit: 3c1dd246d6 ]
2017-04-21 21:46:30 +03:00
Ben Sander
84f9f6dc5c
Fix compilation error with nvcc (c++ nullptr)
...
[ROCm/hip commit: 6d5ac3fbac ]
2017-04-21 09:01:34 -05:00
Sun, Peng
f762edf621
add support of hipLaunchKernelGGL on NV path
...
Change-Id: I0aeafd80c2181873be385d985f1d8ed86a98d136
[ROCm/hip commit: f4287a29b2 ]
2017-04-19 11:00:25 -05:00
Sun, Peng
2517984a65
fix broken header in NV path
...
Change-Id: Ia3aff2a89d9ba49547f51ce03a3304dfab58ba25
[ROCm/hip commit: fa15ee3ccd ]
2017-04-19 10:47:40 -05:00
James Edwards
a4f76aa6b0
Fix RPM HIP packages from specifying /opt
...
Change-Id: Iec3c3b81eef4c8888d425eefc80b12488a8d20a1
[ROCm/hip commit: 1b6d08ada0 ]
2017-04-19 12:35:42 +05:30
James Edwards
1963e91a8e
Fix RPM HIP packages from specifying /opt
...
Change-Id: Iec3c3b81eef4c8888d425eefc80b12488a8d20a1
[ROCm/hip commit: 7699aeb3d3 ]
2017-04-17 23:58:34 -05:00
Maneesh Gupta
e3768b3f14
FindHIP: Apply remove_item on non-empty lists only
...
Change-Id: Ib7fcb992d7e1bb679d4d86676fe3d980ba204815
[ROCm/hip commit: 486716a400 ]
2017-04-18 10:21:20 +05:30
Ben Sander
dc8636a178
Update bugs - Add CreateKernel, new signature for static kerns.
...
[ROCm/hip commit: 4f363df159 ]
2017-04-16 14:22:48 -05:00
Maneesh Gupta
2137285ffc
FindHIP: Handle remove_item from empty lists
...
Change-Id: I6adf31b32edeae9e8454b1a2528064cf3985fca1
[ROCm/hip commit: 0a9feb4f61 ]
2017-04-13 14:41:11 +05:30
Maneesh Gupta
bb976eb6ad
Merge branch 'amd-develop' into amd-master
...
Change-Id: I05572d2b32f1df70b54e2efeb32c8a4d8055912d
[ROCm/hip commit: 3a56e5c09b ]
2017-04-13 12:39:36 +05:30
Maneesh Gupta
8c52b6a748
Do not rebuild cmake cache by default
...
Change-Id: Ie21e99beaa3465b54b5a6a77439c455f34de98b3
[ROCm/hip commit: bfa08cd49a ]
2017-04-13 12:38:38 +05:30
Maneesh Gupta
3e1d2dde23
dtests should ignore HIP_PATH env var
...
Change-Id: I27b1cdab6e6b799987dad3ce97b56c764b1b8867
[ROCm/hip commit: 83097e9da4 ]
2017-04-13 12:12:04 +05:30
Maneesh Gupta
f7f4fa0d23
Add hip-config.cmake to hip_hcc package
...
Due to the way hip packages are generated, for the purpose of
packaging hip-targets*.cmake are not generated at build time.
However hip-config*.cmake are generated at build time.
This will be fixed in future.
Change-Id: I5d79bc58a4f7a324ae06457130d8372ffe403830
[ROCm/hip commit: 7bb378bd4d ]
2017-04-13 10:34:33 +05:30
Ben Sander
b716a7781e
Add integer abs (initial implementation, can be optimized with OCML)
...
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
[ROCm/hip commit: 22f3b91cad ]
2017-04-11 01:16:28 +00:00
Ben Sander
1175fe9b5a
Doc update for Serialization.
...
Describe workaround for partial specialization
[ROCm/hip commit: 310c130fc6 ]
2017-04-10 19:45:18 -05:00
Sun, Peng
da363cb59d
Fix ifndef guard in hip_fp16.h
...
Change-Id: I0215556e7aa98a74e8a984e4de3fb6e8cafdfb24
[ROCm/hip commit: 71447dacad ]
2017-04-10 11:17:05 -05:00