lthakur
|
a05ac35ab1
|
HIP test case for 1D texture fetch (#424)
|
2018-05-29 14:08:01 +05:30 |
|
Maneesh Gupta
|
08779ae5d5
|
Merge pull request #473 from gargrahul/add_tex1d_nvcc
Add 1d texture types for NVCC path
|
2018-05-29 13:54:34 +05:30 |
|
Rahul Garg
|
024f77ce61
|
Add 1d texture types for NVCC path
|
2018-05-28 15:02:06 +05:30 |
|
Maneesh Gupta
|
323a6226b0
|
Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
Fixed memcpy2D for pinned memory case using 2D kernel
|
2018-05-22 10:42:28 +05:30 |
|
Maneesh Gupta
|
df3bb9fc32
|
Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes
Add support for the hipFuncGetAttributes interface.
|
2018-05-22 09:37:41 +05:30 |
|
Maneesh Gupta
|
22af1f6875
|
Merge pull request #462 from ROCm-Developer-Tools/mangupta-hipMemcpy-fix
hipMemcpy returns success if sizeBytes is 0.
|
2018-05-22 06:34:22 +05:30 |
|
Maneesh Gupta
|
2cb59db654
|
Merge pull request #418 from pfultz2/host-device-targets
Add host and device targets
|
2018-05-22 06:33:56 +05:30 |
|
Rahul Garg
|
f47a8236d7
|
Fixed memcpy2D for pinned memory case using 2D kernel
|
2018-05-21 22:14:45 +05:30 |
|
Evgeny Mankov
|
ba9d1ec2fd
|
Merge pull request #463 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - finishing
|
2018-05-21 18:35:16 +03:00 |
|
Evgeny Mankov
|
a5df3a484c
|
[HIPIFY][DNN] support of cuDNN 7.1.3 - finishing
|
2018-05-21 18:31:20 +03:00 |
|
Maneesh Gupta
|
0180a82963
|
hipMemcpy returns success if sizeBytes is 0.
Fixes SWDEV-153754 & SWDEV-154178.
|
2018-05-21 15:38:44 +05:30 |
|
Maneesh Gupta
|
8ea218cd1b
|
Merge pull request #460 from mangupta/fix_nvcc_tests
Disable incomplete unit tests that don't work on nvcc path
|
2018-05-21 12:05:12 +05:30 |
|
Maneesh Gupta
|
333cc86f49
|
Merge pull request #459 from mangupta/add_malloc3d_nvcc
Add hipMalloc3D to nvcc detail
|
2018-05-21 12:04:52 +05:30 |
|
Maneesh Gupta
|
305592d622
|
Disable incomplete unit tests that don't work on nvcc path
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
|
2018-05-21 11:35:03 +05:30 |
|
Maneesh Gupta
|
661561eead
|
Add hipMalloc3D to nvcc detail
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
|
2018-05-21 11:33:09 +05:30 |
|
Maneesh Gupta
|
883241495d
|
Merge pull request #456 from founta/master
defined hipPitchedPtr when using hipcc with nvcc
|
2018-05-21 10:53:39 +05:30 |
|
Maneesh Gupta
|
cac3f1c7cd
|
Merge pull request #455 from ROCm-Developer-Tools/magic
Change HIP fat binary magic number
|
2018-05-21 09:52:03 +05:30 |
|
Maneesh Gupta
|
343083b807
|
Merge pull request #454 from ROCm-Developer-Tools/hip-clang-hipcc
Let hipcc suport hip-clang
|
2018-05-21 09:51:42 +05:30 |
|
Siu Chi Chan
|
76fc1cce43
|
Merge pull request #458 from gargrahul/fix_memcpy2dasync_pinned
Fix for memcpy2DAsync for pinned host memory case
|
2018-05-18 15:58:55 -04:00 |
|
Alex Voicu
|
cd6c979c27
|
Update hip_module.cpp
Typo.
|
2018-05-18 17:50:45 +01:00 |
|
Rahul Garg
|
afe62e7030
|
Fix for memcpy2DAsync for pinned host memory case
|
2018-05-18 21:09:50 +05:30 |
|
founta
|
1a108ef7f3
|
defined hipPitchedPtr
Added a define for hipPitchedPtr to resolve a compiler error
|
2018-05-18 09:11:50 -04:00 |
|
Maneesh Gupta
|
03ac8e6a92
|
Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
|
2018-05-18 14:54:15 +05:30 |
|
Maneesh Gupta
|
dcb4477f9f
|
Merge pull request #440 from yxsamliu/assert2
Add __assert_fail, __device_trap and hipErrorAssert for clang
|
2018-05-18 14:13:27 +05:30 |
|
Maneesh Gupta
|
ac7713fa34
|
Merge pull request #448 from 949f45ac/master
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
|
2018-05-18 13:18:16 +05:30 |
|
Maneesh Gupta
|
67d45164fa
|
Merge pull request #444 from aaronenyeshi/vg20-initial
initial gfx906 support
|
2018-05-18 13:18:07 +05:30 |
|
Maneesh Gupta
|
223bf5094c
|
Merge pull request #437 from scchan/mbcnt_intrinsics
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
|
2018-05-18 13:17:57 +05:30 |
|
Yaxun (Sam) Liu
|
d079463887
|
Change HIP fat binary magic number
|
2018-05-17 17:04:51 -04:00 |
|
Yaxun (Sam) Liu
|
f4d79a1615
|
Let hipcc suport hip-clang
|
2018-05-17 14:40:15 -04:00 |
|
949f45ac
|
8303bfdffd
|
Reinstate accidentally deleted uchar2Holder
|
2018-05-17 10:55:45 +02:00 |
|
Maneesh Gupta
|
0e44ca7d0f
|
Merge pull request #451 from gargrahul/fix_memcpy2d_for_1d_case
Fixed hipMemcpy2D to handle 1D memcpy case
|
2018-05-17 07:42:47 +05:30 |
|
Maneesh Gupta
|
e197752c8b
|
Merge pull request #452 from gargrahul/fix_hipCommander_makefile
Fix hipCommander Makefile
|
2018-05-17 07:25:27 +05:30 |
|
Rahul Garg
|
dc4d305c25
|
Fix hipCommander Makefile
|
2018-05-16 15:01:32 +05:30 |
|
Rahul Garg
|
8f010ac68e
|
Fixed hipMemcpy2D to handle 1D memcpy case
|
2018-05-16 11:07:10 +05:30 |
|
Alex Voicu
|
5325b6535e
|
Update hip_module.cpp
|
2018-05-14 17:15:36 +01:00 |
|
Evgeny Mankov
|
a69b4c3a06
|
Merge pull request #449 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
|
2018-05-14 16:30:34 +03:00 |
|
Evgeny Mankov
|
b0fd0c310d
|
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench - almost pass, except cusparse (not supported yet).
- started testing of examples from libcudnn7-dev_7.1.3.16-1+cuda8.0_amd64 package.
|
2018-05-14 16:23:59 +03:00 |
|
949f45ac
|
79480d7cbd
|
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
|
2018-05-14 08:34:56 +02:00 |
|
Aaron Enye Shi
|
848a24b524
|
Fix hipMathFunction for gfx906
|
2018-05-11 10:53:07 -04:00 |
|
Alex Voicu
|
1ba8a35dba
|
Don't use magic constants, they're evil.
Also clarify that the register count cannot be queried at the moment.
|
2018-05-11 11:31:46 +01:00 |
|
Alex Voicu
|
13274ce559
|
Add support for the hipFuncGetAttributes interface.
|
2018-05-11 03:35:10 +01:00 |
|
Siu Chi Chan
|
b898049412
|
initial gfx906 support
|
2018-05-10 19:28:00 +00:00 |
|
Evgeny Mankov
|
ace018501d
|
Merge pull request #443 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
|
2018-05-10 19:42:05 +03:00 |
|
Evgeny Mankov
|
dffe1802be
|
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench.
|
2018-05-10 17:36:51 +03:00 |
|
Yaxun (Sam) Liu
|
19f3ed6f62
|
Fix warning about inlined function is not defined
|
2018-05-08 16:38:50 -04:00 |
|
Yaxun (Sam) Liu
|
7672b44c79
|
Add __assert_fail, __device_trap and hipErrorAssert for clang
|
2018-05-08 15:42:27 -04:00 |
|
Evgeny Mankov
|
fea366cc89
|
Merge pull request #438 from emankov/hipBLAS
[HIPIFY][Blas] Sync with CUDA 9.1
|
2018-05-08 20:50:39 +03:00 |
|
Siu Chi Chan
|
b285145966
|
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
|
2018-05-08 13:43:53 -04:00 |
|
Evgeny Mankov
|
e5ba9668fc
|
[HIPIFY][Blas] Sync with CUDA 9.1
|
2018-05-08 20:42:30 +03:00 |
|
Evgeny Mankov
|
6559e57b24
|
Merge pull request #435 from emankov/hipBLAS
[HIPIFY] Sync with hipBLAS
|
2018-05-08 19:22:32 +03:00 |
|