Граф коммитов

2491 Коммитов

Автор SHA1 Сообщение Дата
lthakur a05ac35ab1 HIP test case for 1D texture fetch (#424) 2018-05-29 14:08:01 +05:30
Maneesh Gupta 08779ae5d5 Merge pull request #473 from gargrahul/add_tex1d_nvcc
Add 1d texture types for NVCC path
2018-05-29 13:54:34 +05:30
Rahul Garg 024f77ce61 Add 1d texture types for NVCC path 2018-05-28 15:02:06 +05:30
Maneesh Gupta 323a6226b0 Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-22 10:42:28 +05:30
Maneesh Gupta df3bb9fc32 Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes
Add support for the hipFuncGetAttributes interface.
2018-05-22 09:37:41 +05:30
Maneesh Gupta 22af1f6875 Merge pull request #462 from ROCm-Developer-Tools/mangupta-hipMemcpy-fix
hipMemcpy returns success if sizeBytes is 0.
2018-05-22 06:34:22 +05:30
Maneesh Gupta 2cb59db654 Merge pull request #418 from pfultz2/host-device-targets
Add host and device targets
2018-05-22 06:33:56 +05:30
Rahul Garg f47a8236d7 Fixed memcpy2D for pinned memory case using 2D kernel 2018-05-21 22:14:45 +05:30
Evgeny Mankov ba9d1ec2fd Merge pull request #463 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - finishing
2018-05-21 18:35:16 +03:00
Evgeny Mankov a5df3a484c [HIPIFY][DNN] support of cuDNN 7.1.3 - finishing 2018-05-21 18:31:20 +03:00
Maneesh Gupta 0180a82963 hipMemcpy returns success if sizeBytes is 0.
Fixes SWDEV-153754 & SWDEV-154178.
2018-05-21 15:38:44 +05:30
Maneesh Gupta 8ea218cd1b Merge pull request #460 from mangupta/fix_nvcc_tests
Disable incomplete unit tests that don't work on nvcc path
2018-05-21 12:05:12 +05:30
Maneesh Gupta 333cc86f49 Merge pull request #459 from mangupta/add_malloc3d_nvcc
Add hipMalloc3D to nvcc detail
2018-05-21 12:04:52 +05:30
Maneesh Gupta 305592d622 Disable incomplete unit tests that don't work on nvcc path
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
2018-05-21 11:35:03 +05:30
Maneesh Gupta 661561eead Add hipMalloc3D to nvcc detail
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
2018-05-21 11:33:09 +05:30
Maneesh Gupta 883241495d Merge pull request #456 from founta/master
defined hipPitchedPtr when using hipcc with nvcc
2018-05-21 10:53:39 +05:30
Maneesh Gupta cac3f1c7cd Merge pull request #455 from ROCm-Developer-Tools/magic
Change HIP fat binary magic number
2018-05-21 09:52:03 +05:30
Maneesh Gupta 343083b807 Merge pull request #454 from ROCm-Developer-Tools/hip-clang-hipcc
Let hipcc suport hip-clang
2018-05-21 09:51:42 +05:30
Siu Chi Chan 76fc1cce43 Merge pull request #458 from gargrahul/fix_memcpy2dasync_pinned
Fix for memcpy2DAsync for pinned host memory case
2018-05-18 15:58:55 -04:00
Alex Voicu cd6c979c27 Update hip_module.cpp
Typo.
2018-05-18 17:50:45 +01:00
Rahul Garg afe62e7030 Fix for memcpy2DAsync for pinned host memory case 2018-05-18 21:09:50 +05:30
founta 1a108ef7f3 defined hipPitchedPtr
Added a define for hipPitchedPtr to resolve a compiler error
2018-05-18 09:11:50 -04:00
Maneesh Gupta 03ac8e6a92 Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Maneesh Gupta dcb4477f9f Merge pull request #440 from yxsamliu/assert2
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-18 14:13:27 +05:30
Maneesh Gupta ac7713fa34 Merge pull request #448 from 949f45ac/master
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
2018-05-18 13:18:16 +05:30
Maneesh Gupta 67d45164fa Merge pull request #444 from aaronenyeshi/vg20-initial
initial gfx906 support
2018-05-18 13:18:07 +05:30
Maneesh Gupta 223bf5094c Merge pull request #437 from scchan/mbcnt_intrinsics
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-18 13:17:57 +05:30
Yaxun (Sam) Liu d079463887 Change HIP fat binary magic number 2018-05-17 17:04:51 -04:00
Yaxun (Sam) Liu f4d79a1615 Let hipcc suport hip-clang 2018-05-17 14:40:15 -04:00
949f45ac 8303bfdffd Reinstate accidentally deleted uchar2Holder 2018-05-17 10:55:45 +02:00
Maneesh Gupta 0e44ca7d0f Merge pull request #451 from gargrahul/fix_memcpy2d_for_1d_case
Fixed hipMemcpy2D to handle 1D memcpy case
2018-05-17 07:42:47 +05:30
Maneesh Gupta e197752c8b Merge pull request #452 from gargrahul/fix_hipCommander_makefile
Fix hipCommander Makefile
2018-05-17 07:25:27 +05:30
Rahul Garg dc4d305c25 Fix hipCommander Makefile 2018-05-16 15:01:32 +05:30
Rahul Garg 8f010ac68e Fixed hipMemcpy2D to handle 1D memcpy case 2018-05-16 11:07:10 +05:30
Alex Voicu 5325b6535e Update hip_module.cpp 2018-05-14 17:15:36 +01:00
Evgeny Mankov a69b4c3a06 Merge pull request #449 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
2018-05-14 16:30:34 +03:00
Evgeny Mankov b0fd0c310d [HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench - almost pass, except cusparse (not supported yet).
- started testing of examples from libcudnn7-dev_7.1.3.16-1+cuda8.0_amd64 package.
2018-05-14 16:23:59 +03:00
949f45ac 79480d7cbd Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs 2018-05-14 08:34:56 +02:00
Aaron Enye Shi 848a24b524 Fix hipMathFunction for gfx906 2018-05-11 10:53:07 -04:00
Alex Voicu 1ba8a35dba Don't use magic constants, they're evil.
Also clarify that the register count cannot be queried at the moment.
2018-05-11 11:31:46 +01:00
Alex Voicu 13274ce559 Add support for the hipFuncGetAttributes interface. 2018-05-11 03:35:10 +01:00
Siu Chi Chan b898049412 initial gfx906 support 2018-05-10 19:28:00 +00:00
Evgeny Mankov ace018501d Merge pull request #443 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
2018-05-10 19:42:05 +03:00
Evgeny Mankov dffe1802be [HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench.
2018-05-10 17:36:51 +03:00
Yaxun (Sam) Liu 19f3ed6f62 Fix warning about inlined function is not defined 2018-05-08 16:38:50 -04:00
Yaxun (Sam) Liu 7672b44c79 Add __assert_fail, __device_trap and hipErrorAssert for clang 2018-05-08 15:42:27 -04:00
Evgeny Mankov fea366cc89 Merge pull request #438 from emankov/hipBLAS
[HIPIFY][Blas] Sync with CUDA 9.1
2018-05-08 20:50:39 +03:00
Siu Chi Chan b285145966 add intrinsics mbcnt_lo, mbcnt_hi, lane_id 2018-05-08 13:43:53 -04:00
Evgeny Mankov e5ba9668fc [HIPIFY][Blas] Sync with CUDA 9.1 2018-05-08 20:42:30 +03:00
Evgeny Mankov 6559e57b24 Merge pull request #435 from emankov/hipBLAS
[HIPIFY] Sync with hipBLAS
2018-05-08 19:22:32 +03:00