Evgeny Mankov
cbb5c63dd6
[HIPIFY] Sync more CUDA Driver API functions.
...
+ 4.14. Event Management
+ 4.15. Execution Control
ToDo: 4.16 - 4.31 modules of CUDA Driver API.
2017-06-22 21:53:32 +03:00
Aditya Atluri
fba69b1ce7
removed rm for /opt/rocm/hip/src in inline asm sample
...
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Rahul Garg
0fd6b59e22
Added device side abort function in HIP/NVCC
...
Change-Id: I6ae35a72a8b9c34852619f02da1a046c8d3b2ed3
2017-06-20 09:38:56 +05:30
Rahul Garg
d9935cd089
Abort device function in HIP/HCC, need new HCC
...
Change-Id: I4195ab75e9b7b48c8b8128d6925ddc0fa5e9e009
2017-06-18 12:31:31 +05:30
Aditya Atluri
f139d5a52f
fixed float2int functions
...
Change-Id: I67be79149f06daacf0f0d131bdedabf294126248
2017-06-16 09:07:06 -05:00
Aditya Atluri
34e14bb02d
removed bad copy constructor
...
Change-Id: I661991d9d43941a61848b0b8e9879c0bfa811b40
2017-06-16 09:02:26 -05:00
Rahul Garg
3f25611692
Arguments validation in hipDeviceGetPCIBusId
...
Change-Id: I89770517c3ac94e4bf476344d27c18f03cfcde08
2017-06-15 00:21:47 +05:30
Evgeny Mankov
094f1b1f2f
[HIPIFY] Sync HIPIFY with HIP by CUDA Driver API functions.
...
+ 4.12. Unified Addressing
+ 4.13. Stream Management
ToDo: 4.14 - 4.31 modules of CUDA Driver API.
2017-06-14 19:55:55 +03:00
Sun, Peng
9bce2af76f
Additional GGL make_kernel_functor_* macros, contributed by Alex
...
Change-Id: I01aabb7d2b5418fcefb1bbf78eb5d1888dbc5c96
2017-06-14 11:10:52 -05:00
Rahul Garg
85708089d1
Validity check of input arguments in Ipc Mem APIs
...
Change-Id: Ia48e949d19f354f10c7e44cc2457fd4154bf6d76
2017-06-14 15:18:57 +05:30
Maneesh Gupta
4fa2090b78
Bump HIP base version to 1.2
...
Change-Id: I8ecc164afed4383f78579ed86a5c8c11a73b0780
2017-06-14 09:45:46 +05:30
Sandeep Kumar
01842faa1c
Add peer2peer bandwidth and latency test
...
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Evgeny Mankov
82b37fe481
[HIPIFY] Initial sync HIPIFY with HIP by CUDA Driver API functions.
...
+ CUDA_Driver_API_functions_supported_by_HIP.md update.
+ Initial update of HIPIFY with CUDA driver API functions:
1.Error Handling, 2.Initialization, 3.Version Management, 5-6.Device Management, 7.Primary Context, 8-9.Context, 10.Module Management, 11.Memory Management.
+ Sync HIP functions against CUDA Driver and Runtime API functions.
+ Typo fixes.
ToDo: 12-30 modules of CUDA Driver API.
2017-06-13 20:25:11 +03:00
Rahul Garg
0efd737767
Input args NULL check in hipChooseDevice
...
Change-Id: I1a7b8cded2f81d739645bbf3dab2f04bb9c3c796
2017-06-13 13:35:50 +05:30
Rahul Garg
ce6e455677
Update P2P test for using memcpy and kernel tests
...
Change-Id: Ib0f8fc9425e6e85fd11d7d02395c52bc713dcb37
2017-06-12 11:53:25 +05:30
Maneesh Gupta
6174e69f87
Updated RELEASE.md
...
Change-Id: Ic451612555c66f3ed7131514fc97fcc41091370a
2017-06-12 11:20:28 +05:30
Maneesh Gupta
15a3464630
Update directed tests README.md
...
Change-Id: I395245454d376508f04e5a4a62c8933895cb3867
2017-06-12 11:19:55 +05:30
Patrick Flick
821c238bad
fix typo
2017-06-12 10:15:27 +05:30
Maneesh Gupta
d2b90ad93c
Merge branch hipify-updates into amd-develop
...
Change-Id: I13d8750027a2a8787e4eb2e1ed525cf69d14b805
2017-06-12 10:10:19 +05:30
Maneesh Gupta
c5366a55f1
Initial implementation of hipify-cmakefile
...
Change-Id: Id365da9f887b5c3409639f000b430d093fd4f6b3
2017-06-12 09:57:17 +05:30
Sun, Peng
682dda4418
Fix error related to undefined reference of __get_dynamicgroupbaseptr().
...
Change-Id: I14951e1725e35dd5f5e53805f81cdb58661f59f2
2017-06-08 19:24:32 -05:00
Sun, Peng
5450021f93
Add clang version guard so the hip_fp16.h header won't be picked up by gcc
...
Change-Id: Ia21335a455bc93210901b44bc8c76a7f4a385b55
2017-06-08 19:24:32 -05:00
Ben Sander
9bfc7b0e13
Use amHostCoherentFlag. Requires new HCC version.
2017-06-07 09:06:40 -05:00
Maneesh Gupta
ff4fae7d20
hip_hcc package: add libstdc++-static as a rpm dependency
...
Change-Id: I83a79353492a6be3d788b7c0ce4a8f3aa740d9d9
2017-06-07 15:50:28 +05:30
Maneesh Gupta
ff8ade59aa
hipMemcpy-size test: reduce max size to make it work correctly on nvcc path
...
Change-Id: I9ce9f5a9e141ffd8ddf961269010b33358e02771
2017-06-07 15:25:54 +05:30
Maneesh Gupta
a50f5ca0ac
hipDeviceMemcpy test: make it functional on nvcc path
...
Change-Id: Id10c79b48747ed701adbd0a233c53cd60cfa743b
2017-06-07 15:24:44 +05:30
Maneesh Gupta
a7dc938ec0
p2p_copy_coherency test: gracefully handle single gpu case
...
Change-Id: I216663f67ef58c673136332635dab8b57079b909
2017-06-07 15:23:37 +05:30
Ben Sander
c2baa4f6e6
Enable HCC_OPT_FLUSH=1.
...
Requires appropriate HCC with this support :
commit 38e392b517a46a09a3b1c8f388e6a0db3741c510
2017-06-07 00:15:05 -05:00
Sun, Peng
3b6a863eef
Improve HIP kernel names, attributes and codegen, contributed by Alex Voicu
...
Change-Id: I2cafbdc5a98e26c7f4fad84739c915e7dc09993c
2017-06-05 11:39:00 -05:00
Ben Sander
344b6cb0c0
Enable HIP_SYNC_NULL_STREAM=0 optimization.
2017-06-05 08:50:41 -05:00
Ben Sander
823281dcba
Fix HIP_SYNC_NULL_STREAM=0 mode.
...
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow
2017-06-05 08:50:22 -05:00
Ben Sander
863b7c3f56
Update tests.
...
Fix some NVCC issues.
Add hipStreamSync2, record_event tests.
2017-06-04 20:18:37 -05:00
Ben Sander
15f54fb943
Update tests, add p2p coherency test.
2017-06-03 17:11:34 -05:00
Aditya Atluri
fdcc223842
added half data type and vector destructors
...
1. Added half data types to hip_fp16.h
2. Added destructor to vector data types
Change-Id: Id5ae76a663bb90a4bde2839ec79c58fbaee5072f
2017-06-02 11:19:33 -05:00
emankov
c5f9758f4b
[HIPIFY] annotation
2017-06-02 16:33:48 +03:00
emankov
e7779650e9
[HIPIFY] rename legacy hipify perl script and its usage to hipify-perl
2017-06-02 16:30:43 +03:00
Evgeny Mankov
ee85243bcd
[HIPIFY] All CUDA 8.0.44 API functions update
...
(for both Driver and Runtime APIs)
1) P2P
cuDeviceGetP2PAttribute cudaDeviceGetP2PAttribute
2) Memory Mngmnt
cuMemPrefetchAsync cudaMemPrefetchAsync
cuMemAdvise cudaMemAdvise
cuMemRangeGetAttribute cudaMemRangeGetAttribute
cuMemRangeGetAttributes cudaMemRangeGetAttributes
3) Streams (Driver API only, no analogues in Runtime API)
cuStreamWaitValue32
cuStreamWaitValue32
cuStreamWriteValue32
4) Texture Reference Mngmnt (Driver API only, no analogues in Runtime API)
cuTexRefSetBorderColor
cuTexRefGetBorderColor
2017-06-01 21:08:33 +03:00
Siu Chi Chan
969931b1ce
fix atomicCAS:remove load for the return value after CAS
2017-05-31 15:20:19 -04:00
Evgeny Mankov
463c026976
[HIP] [HIPIFY] CUDA Driver API 8.0.44 JIT options support.
2017-05-31 18:55:29 +03:00
Maneesh Gupta
06ee0d3704
Fix hipMemoryAllocate test for single GPU
...
Change-Id: If121c18ab490ba125dc689ffc08a8839fd280c38
2017-05-31 10:16:57 +05:30
Maneesh Gupta
2145e94049
Disable rcbrtf, scalblnf, scalbnf in single precision device test
...
Change-Id: I8a250a64a0cb05132d022a11d9766ced9cdf11a7
2017-05-31 10:16:19 +05:30
Maneesh Gupta
da19087ae2
Disable rcbrt, scalbln and scalbn double precision device test
...
Change-Id: I46bd895701c46d3592b553090cafba99e41a2e2d
2017-05-31 10:15:41 +05:30
Sandeep Kumar
f6b98854ba
Add readme for inline asm and unroll cookbook samples
...
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar
c964a5f208
Add inline asm hip directed tests for v_add and v_mac
...
Change-Id: Ie5ace2e42d5da89b16e040537df2bb13d3883c6d
2017-05-31 09:25:40 +05:30
Sandeep Kumar
5696eaf842
Add unroll and inline asm cookbook samples
...
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar
e104c2e3bf
Print msg for single gpu
...
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30
Ben Sander
6cc5dc0326
Set event->_stream on hipHccModuleLaunchKernel path if start/stop used
...
Ensure _stream is always non-null in recorded events.
Fixes isDefaultStream fault.
2017-05-30 21:55:46 -05:00
Evgeny Mankov
997ed19bb8
[HIPIFY] Add the rest CUDA Runtime API 8.0.44 Data structures.
...
+ sync with corresponding CUDA Driver API Data structures.
P.S.
There is no any new changes in CUDA Runtime API 8.0.61 Data structures since 8.0.44.
2017-05-30 19:45:59 +03:00
Evgeny Mankov
a020eb76dd
[HIPIFY] Add the rest CUDA Driver API 8.0.44 Data structures.
...
+ Memory advise values
+ Memory Range Attributes
+ P2P Attributes
P.S.
There is no any new changes in CUDA Driver API 8.0.61 Data structures since 8.0.44.
2017-05-30 18:29:14 +03:00
Evgeny Mankov
ef86f943ac
[HIPIFY] Add more CUDA Driver API 8.0.44 Data structures.
2017-05-30 17:58:13 +03:00