Граф коммитов

2472 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta 661561eead Add hipMalloc3D to nvcc detail
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
2018-05-21 11:33:09 +05:30
Maneesh Gupta 883241495d Merge pull request #456 from founta/master
defined hipPitchedPtr when using hipcc with nvcc
2018-05-21 10:53:39 +05:30
Maneesh Gupta cac3f1c7cd Merge pull request #455 from ROCm-Developer-Tools/magic
Change HIP fat binary magic number
2018-05-21 09:52:03 +05:30
Maneesh Gupta 343083b807 Merge pull request #454 from ROCm-Developer-Tools/hip-clang-hipcc
Let hipcc suport hip-clang
2018-05-21 09:51:42 +05:30
Siu Chi Chan 76fc1cce43 Merge pull request #458 from gargrahul/fix_memcpy2dasync_pinned
Fix for memcpy2DAsync for pinned host memory case
2018-05-18 15:58:55 -04:00
Rahul Garg afe62e7030 Fix for memcpy2DAsync for pinned host memory case 2018-05-18 21:09:50 +05:30
founta 1a108ef7f3 defined hipPitchedPtr
Added a define for hipPitchedPtr to resolve a compiler error
2018-05-18 09:11:50 -04:00
Maneesh Gupta 03ac8e6a92 Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Maneesh Gupta dcb4477f9f Merge pull request #440 from yxsamliu/assert2
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-18 14:13:27 +05:30
Maneesh Gupta ac7713fa34 Merge pull request #448 from 949f45ac/master
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
2018-05-18 13:18:16 +05:30
Maneesh Gupta 67d45164fa Merge pull request #444 from aaronenyeshi/vg20-initial
initial gfx906 support
2018-05-18 13:18:07 +05:30
Maneesh Gupta 223bf5094c Merge pull request #437 from scchan/mbcnt_intrinsics
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-18 13:17:57 +05:30
Yaxun (Sam) Liu d079463887 Change HIP fat binary magic number 2018-05-17 17:04:51 -04:00
Yaxun (Sam) Liu f4d79a1615 Let hipcc suport hip-clang 2018-05-17 14:40:15 -04:00
949f45ac 8303bfdffd Reinstate accidentally deleted uchar2Holder 2018-05-17 10:55:45 +02:00
Maneesh Gupta 0e44ca7d0f Merge pull request #451 from gargrahul/fix_memcpy2d_for_1d_case
Fixed hipMemcpy2D to handle 1D memcpy case
2018-05-17 07:42:47 +05:30
Maneesh Gupta e197752c8b Merge pull request #452 from gargrahul/fix_hipCommander_makefile
Fix hipCommander Makefile
2018-05-17 07:25:27 +05:30
Rahul Garg dc4d305c25 Fix hipCommander Makefile 2018-05-16 15:01:32 +05:30
Rahul Garg 8f010ac68e Fixed hipMemcpy2D to handle 1D memcpy case 2018-05-16 11:07:10 +05:30
Evgeny Mankov a69b4c3a06 Merge pull request #449 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
2018-05-14 16:30:34 +03:00
Evgeny Mankov b0fd0c310d [HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench - almost pass, except cusparse (not supported yet).
- started testing of examples from libcudnn7-dev_7.1.3.16-1+cuda8.0_amd64 package.
2018-05-14 16:23:59 +03:00
949f45ac 79480d7cbd Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs 2018-05-14 08:34:56 +02:00
Aaron Enye Shi 848a24b524 Fix hipMathFunction for gfx906 2018-05-11 10:53:07 -04:00
Siu Chi Chan b898049412 initial gfx906 support 2018-05-10 19:28:00 +00:00
Evgeny Mankov ace018501d Merge pull request #443 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
2018-05-10 19:42:05 +03:00
Evgeny Mankov dffe1802be [HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench.
2018-05-10 17:36:51 +03:00
Yaxun (Sam) Liu 19f3ed6f62 Fix warning about inlined function is not defined 2018-05-08 16:38:50 -04:00
Yaxun (Sam) Liu 7672b44c79 Add __assert_fail, __device_trap and hipErrorAssert for clang 2018-05-08 15:42:27 -04:00
Evgeny Mankov fea366cc89 Merge pull request #438 from emankov/hipBLAS
[HIPIFY][Blas] Sync with CUDA 9.1
2018-05-08 20:50:39 +03:00
Siu Chi Chan b285145966 add intrinsics mbcnt_lo, mbcnt_hi, lane_id 2018-05-08 13:43:53 -04:00
Evgeny Mankov e5ba9668fc [HIPIFY][Blas] Sync with CUDA 9.1 2018-05-08 20:42:30 +03:00
Evgeny Mankov 6559e57b24 Merge pull request #435 from emankov/hipBLAS
[HIPIFY] Sync with hipBLAS
2018-05-08 19:22:32 +03:00
Evgeny Mankov 7681775662 [HIPIFY] Sync with hipBLAS 2018-05-08 19:20:47 +03:00
Maneesh Gupta 3095f67281 Merge pull request #432 from moosichu/patch-1
Add space between `###` and `Notes` in hip_terms
2018-05-08 12:30:27 +05:30
Rahul Garg da302c3e93 Added hipMemset3D 2018-05-07 10:24:30 +05:30
Tom Maenan Read Cutting 9d76f5839e Add space between ### and Notes in hip_terms
Makes `Notes` an H3 heading.
2018-05-05 13:30:11 +01:00
Evgeny Mankov 991f817441 Merge pull request #431 from emankov/master
[HIPIFY][test] Undo commit "Apply .clangformat to all repo source files"
2018-05-04 22:23:19 +03:00
emankov 21b79cd467 [HIPIFY][test] Undo commit "Apply .clangformat to all repo source files"
Commit broke tests due to code and comments formatting changes, thus FileCheck fails on checks, which are in comments.
2018-05-04 22:23:16 +03:00
emankov 01f146e1bc Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP 2018-05-04 21:04:34 +03:00
Evgeny Mankov dbe1583f28 Merge pull request #429 from emankov/master
[HIPIFY][test] add cuDNN test
2018-05-04 12:40:48 +03:00
Evgeny Mankov a56b480c5e [HIPIFY][test] add cuDNN test 2018-05-04 12:37:15 +03:00
Evgeny Mankov 9073bee7a7 Merge pull request #428 from emankov/docs
[HIPIFY][doc] Readme.md update
2018-05-04 11:06:59 +03:00
Evgeny Mankov 054c3f71f0 [HIPIFY][doc] Readme.md update
+ supported CUDA version to LLVM version correspondence table is added.
+ Test section is rewritten.
+ Windows support is added.
2018-05-04 10:50:18 +03:00
Evgeny Mankov 49024a5a55 Merge pull request #423 from emankov/cuDNN
[HIPIFY] Initial cuDNN support
2018-05-04 10:01:47 +03:00
Evgeny Mankov fe421c89b2 [HIPIFY] Initial cuDNN support
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- lit testing was supplemented with CUDA_DNN_ROOT_DIR for cuDNN testing.
- single cuDNN test was added.
2018-05-03 11:33:40 +03:00
Evgeny Mankov e1d1835798 Merge pull request #417 from emankov/master
[HIPIFY] Sync with HIP (Execution Control, Surfaces, Memory)
2018-05-03 10:45:51 +03:00
Maneesh Gupta 67cb81c1d1 Merge pull request #422 from luckynikki/NULL-FIXES
Null checks added for hipmallocpitch and hipmemcpy apis
2018-05-03 10:14:40 +05:30
Lakhan Singh 6411ca1f6d Null checks added for hipmallocpitch and hipmemcpy apis 2018-05-03 09:27:50 +05:30
emankov 2569972dde [HIPIFY] Initial cuDNN support
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- lit testing was supplemented with CUDA_DNN_ROOT_DIR for cudnn testing.
- single cuDNN test was added.
2018-05-02 22:11:05 +03:00
Maneesh Gupta beb41510ab Merge pull request #421 from gargrahul/fix_tex3d_nvcc
Fix texture 3D for HIP/NVCC
2018-05-02 13:49:00 +05:30