Ben Sander
bd7a374f20
Make hipMultiThreadStreams1 test a little harsher.
...
Fail faster if synchronization rules are violated.
Run vectorAddRevers to read last elements of array first - if the
vector add kernel starts before preceding copy finishes we
will read stale data and flag the error.
Increase default array sizes, so synchronization errors more easily
exposed.
[ROCm/hip commit: 2e1fec47ab ]
2017-05-16 19:04:25 -05:00
Ben Sander
f3dcff0906
Fix HIP_TRACE_API so kernel launch only printed when requested.
...
[ROCm/hip commit: 427f8472aa ]
2017-05-16 19:04:25 -05:00
Ben Sander
a55ce5bee4
Add initial HIP_SYNC_NULL_STREAM=0 mode.
...
This eliminates host-synchronization for null stream. Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.
Add enhanced null-stream test.
Also refine HIP_TRACE_API.
[ROCm/hip commit: 27877f8854 ]
2017-05-16 19:04:25 -05:00
Ben Sander
cfe81dfbf4
Update tests README
...
[ROCm/hip commit: c67b828a5a ]
2017-05-16 19:04:25 -05:00
Ben Sander
3107e70ea2
Doc update - split hip_debugging.md into separate file.
...
[ROCm/hip commit: 704ba30b32 ]
2017-05-16 19:04:25 -05:00
Ben Sander
7d07d804d8
split debugging into separate .md file
...
[ROCm/hip commit: db097ab392 ]
2017-05-16 19:04:25 -05:00
Ben Sander
2cf05ad54b
Add HIP_TRACE_API=4. Only display memory allocation/free apis.
...
[ROCm/hip commit: 0edab14139 ]
2017-05-16 19:04:25 -05:00
emankov
dce8786e26
[HIPIFY] *.inl extension support for batch processing
...
[ROCm/hip commit: 30000ef130 ]
2017-05-16 19:52:39 +03:00
Evgeny Mankov
1a183a71a5
[HIPIFY] cudaMallocPitch -> hipMallocPitch
...
[ROCm/hip commit: 12d8c53c90 ]
2017-05-16 18:21:25 +03:00
Rahul Garg
c6a2d65d5e
Added hipMallocPitch on HIP/NVCC path
...
Change-Id: Ie3ba7d3f95acac23805efa919531043b350a3f21
[ROCm/hip commit: 1223612331 ]
2017-05-16 07:15:13 +05:30
Evgeny Mankov
6b76c979ce
[HIPIFY] CUDA Driver API: Primary Context Management support.
...
[ROCm/hip commit: a97cb6810c ]
2017-05-15 15:35:52 +03:00
Aditya Atluri
62e5cda4af
added gfx900 to hipDeviceProp_t
...
Change-Id: I49e7a32f218926fd55f1c94c5dc2366d6c8ac4ca
[ROCm/hip commit: a43149135e ]
2017-05-12 21:43:34 -05:00
Ben Sander
be423aa034
Remove old USE_ switches no longer needed.
...
[ROCm/hip commit: 0679831384 ]
2017-05-12 16:06:03 -05:00
Ben Sander
b1b71c4c75
Add hipEventDisableSystemRelease flag.
...
[ROCm/hip commit: b306095ac2 ]
2017-05-12 16:06:03 -05:00
Evgeny Mankov
3408458f9f
[HIPIFY] Fix string routines.
...
Some Clang tooling functions return std::string, some return StringRef. Assigning of returning std::string to StringRef variables leads to garbage in it.
DEBUG build is always affected.
[ROCm/hip commit: 25d470c380 ]
2017-05-11 21:50:36 +03:00
Ben Sander
ff72a701b7
hipHostMalloc allocation are mapped to all devices by default.
...
Support hipHostMallocPortable flag.
Default flags are hipHostMallocPortable | hipHostMallocMapped.
Also:
-refactor tests to move addCount and addCountReverse into HipTest
namespace.
-test multi-GPU host memory.
[ROCm/hip commit: c3ccaa01e5 ]
2017-05-10 17:34:36 -05:00
Ben Sander
5d558861c2
Fix hipStreamWaitEvent for single GPU.
...
[ROCm/hip commit: e0c3ea15b2 ]
2017-05-10 13:23:49 -05:00
Aditya Atluri
bdce32238f
added guard against hip_runtime.h so that non-hcc compilers can use it
...
Change-Id: I3d68deda9ce8a5956e21e15a69e549d6c21e3e39
[ROCm/hip commit: a38e36ec2f ]
2017-05-09 10:14:16 -05:00
Ben Sander
ef98415d0e
Fix some typos, add additional guidance for -BSymbolic
...
[ROCm/hip commit: 687809104b ]
2017-05-05 17:29:04 -05:00
Ben Sander
639d152ff8
Refactor hipHostRegister test.
...
Run all tests in one command.
Run 128 offsets.
[ROCm/hip commit: 6437f5d2b2 ]
2017-05-05 17:29:04 -05:00
Ben Sander
2e2c773643
Update streamEventTEst.
...
- add checks for events across devices.
- refactor test to make sure it runs long enough to sensitive sync
techniques.
- add tests for DeviceSync, streamWaitEvent.
[ROCm/hip commit: 2d0f509de5 ]
2017-05-05 17:29:04 -05:00
wsttiger
3bf30acbd7
Improve hipStreamWaitEvent test.
...
- use addOne kernel, use local initializer rather than init_array.
- use addOneReverse test to add from back of array. Test alternate fwd and backward to stress dependency logic.
- check device-side dependencies.
[ROCm/hip commit: 2a253680da ]
2017-05-05 17:29:04 -05:00
Evgeny Mankov
dfbc039ebb
[HIPIFY] LLVM 3.9 support
...
3.8 and 3.9 are both supported. 3.8 is stable, 3.9 needs more testing.
[ROCm/hip commit: 05be936fd6 ]
2017-05-05 21:28:02 +03:00
Rahul Garg
0557f54200
hipMemcpy2DAsync for HIP/NVCC
...
Change-Id: I46f0057fef49bdaaac41c1df80c3e27432b8f376
[ROCm/hip commit: 1cb51d614e ]
2017-05-04 06:47:55 +05:30
Evgeny Mankov
869dfcd01f
[HIPIFY] CUDA RT memcpy functions update.
...
cudaMemcpyFromSymbol -> hipMemcpyFromSymbol
cudaMemcpyFromSymbolAsync -> hipMemcpyFromSymbolAsync
cudaMemcpy2DAsync -> hipMemcpy2DAsync
[ROCm/hip commit: 9b65358c68 ]
2017-05-03 23:05:44 +03:00
Evgeny Mankov
4841eb2bff
[HIPIFY] Blas update: add a few functions, supported by HIP.
...
cublasDaxpy -> hipblasDaxpy
cublasDgemv -> hipblasDgemv
cublasDger -> hipblasDger
cublasDgemm -> hipblasDgemm
cublasDgemmBatched -> hipblasDgemmBatched
cublasGetStream -> hipblasGetStream
cublasSetStream -> hipblasSetStream
cublasDaxpy -> hipblasDaxpy
[ROCm/hip commit: 1269897606 ]
2017-05-03 22:45:56 +03:00
Evgeny Mankov
115709ba28
[HIPIFY] HIPIFY and HIP sync with CUDA Driver API data types.
...
+ Update CUDA_Driver_API_functions_supported_by_HIP.md.
+ Final update of HIPIFY with CUDA driver data types.
[TODO] Syncing HIPIFY and HIP by CUDA Driver API functions.
[ROCm/hip commit: 70c94d7b83 ]
2017-05-03 21:59:48 +03:00
Rahul Garg
340cd36e8d
Added support for hipMemcpy2DAsync in HIP/HCC
...
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
[ROCm/hip commit: ec27c695c4 ]
2017-05-03 22:29:12 +05:30
Evgeny Mankov
d439bbd746
[HIPIFY] Rename enumConstantDecl -> enumDecl
...
Reason: not to mix up with clang's enumConstantDecl, used for enum DeclRefExpr (enum constant).
[ROCm/hip commit: c7958cbb8b ]
2017-04-28 21:59:33 +03:00
Evgeny Mankov
a83172c49b
* [HIPIFY] [FIX] Replacement error: enum cudaMemcpyKind kind -> hipMemcpyKindyKind kind
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/72
[Solution] [Workaround]
Offset calculation for enum VarDecl as param decl, declared with enum type specifier.
[Result]
enum cudaMemcpyKind kind -> enum hipMemcpyKind kind
[ToDo]
Test on terminal qualifiers (const, etc).
[ROCm/hip commit: eddd021996 ]
2017-04-28 21:45:36 +03:00
Evgeny Mankov
4fe32b302f
[HIPIFY] [FIX] replacement error: cudaError_t -> hipError_t_t
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/71
[Solution]
getUnqualifiedType for enumConstantDecl's type is added, except ordinary enum declarations (w/o typedef).
[ToDo]
Find more appropriate way of distinguishing redefined enum declarations and ordinary ones.
[ROCm/hip commit: 3d88932c8d ]
2017-04-28 21:03:03 +03:00
Aditya Atluri
eb68f5d5af
fixed hipFuncSetCacheConfig on rocm path
...
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
[ROCm/hip commit: a5cb2d40ec ]
2017-04-28 11:53:11 -05:00
Aditya Atluri
a565e85c08
fixed fast math expf and exp10f
...
Change-Id: I73963220f902efebb0a7404c5f8966dffb4c35ca
[ROCm/hip commit: ab2eb420e2 ]
2017-04-26 19:01:10 -05:00
Aditya Atluri
a2c405c265
added hipFuncSetCacheConfig API for nvcc path
...
Change-Id: I87fae35bc0e10a0dca5ae1c5015fe5d9e52a1d0d
[ROCm/hip commit: fc6248ce82 ]
2017-04-26 18:56:57 -05:00
Maneesh Gupta
531c0f0030
Merge branch 'amd-master' into amd-develop
...
[ROCm/hip commit: ff454e4b8e ]
2017-04-26 23:55:13 +05:30
Sun, Peng
bbcd4b3761
fix hip_complex.h header on NV path
...
Change-Id: Ia95d003ca1b284bab1c76723050e6b3b89178f65
[ROCm/hip commit: 54561c8af3 ]
2017-04-25 00:15:20 -05:00
Ben Sander
0e14f7f58e
Add host offset checking
...
[ROCm/hip commit: cd93a2f229 ]
2017-04-24 22:30:54 -05:00
Ben Sander
e0bd60a22f
Refactor hipHostRegister test.
...
- Add more testing for offsets.
- Parse cmdline options and use --tests.
[ROCm/hip commit: b7b17c3841 ]
2017-04-24 22:30:54 -05:00
Ben Sander
244e79d54d
Refactor hipHostRegister to cover misaligned cases.
...
[ROCm/hip commit: 8ba993e36c ]
2017-04-24 22:30:54 -05:00
Ben Sander
6f754878eb
Tailor pointer info for src/dst before calling HCC copy routines.
...
HCC sometimes uses the srcPtrInfo or dstPtrInfo to determine the
pointer. Make sure these use the actual pointer and not the base
of the allocation.
[ROCm/hip commit: 5a52b79782 ]
2017-04-24 22:30:54 -05:00
Ben Sander
ed7b16786a
Add test for non-page-aligned mem copies.
...
[ROCm/hip commit: ab410add5c ]
2017-04-24 22:30:54 -05:00
Ben Sander
70ed508153
Add corrected test for offsets
...
[ROCm/hip commit: c5d89d9e7f ]
2017-04-24 22:30:54 -05:00
Ben Sander
9699bcc3da
Refactor hipMemcpy test to share mem alloc for multiple copies.
...
[ROCm/hip commit: 2bc0a6030e ]
2017-04-24 22:30:54 -05:00
Ben Sander
3ef01532b9
Add negative testing for memory full condition.
...
[ROCm/hip commit: dc001ef9b4 ]
2017-04-24 22:30:54 -05:00
Ben Sander
58fd2f29ec
Fix hip debug for case where copyAgent is null (host-to-host)
...
[ROCm/hip commit: 9e41e3c6e2 ]
2017-04-24 22:30:54 -05:00
Ben Sander
ba2f2f56b8
Fix hipMalloc to return error code if allocation fails.
...
[ROCm/hip commit: 59df70662a ]
2017-04-24 22:30:54 -05:00
Aditya Atluri
2b8fbf40a8
fixed build issues with hipPointerGetAttributes
...
Change-Id: I3f5fbc05bdaef720884ba949075928752a070377
[ROCm/hip commit: 59ab3659ee ]
2017-04-24 15:31:07 -05:00
Aditya Atluri
c5840d81ad
changed arguments for hipPointerGetAttributes
...
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
[ROCm/hip commit: 72bcfa438b ]
2017-04-24 15:24:16 -05:00
Maneesh Gupta
f884e55aca
Merge branch 'amd-develop' into amd-master
...
Change-Id: I312fb9d1181733ef5160d1e993e2ae57ced0f6b3
[ROCm/hip commit: 88fb807af0 ]
2017-04-24 08:51:03 +05:30
Maneesh Gupta
1b446eab71
Updated release notes
...
Change-Id: Ia98aff420ea9d488924dce8fe9168cec9da301ab
[ROCm/hip commit: 3a519ee9e5 ]
2017-04-24 08:48:35 +05:30