Gráfico de commits

2497 Commits

Autor SHA1 Mensaje Fecha
Alex Voicu 0f4a135e5f Add missing alias half / half2 aliases 2018-05-26 12:10:50 +01:00
Alex Voicu 544adec793 Missing bits. 2018-05-25 20:15:04 +01:00
Alex Voicu 25ac508214 Missing bits. 2018-05-25 20:12:21 +01:00
Alex Voicu 40dad93426 Move converting constructor from _Float16 under macro guard. Refactor. 2018-05-25 19:46:41 +01:00
Alex Voicu 59a046dd2d Update hipTestHalf to actually test behaviour. Add missing hipHostfree. 2018-05-24 13:55:30 +01:00
Alex Voicu f2a86f3e1c Remove vestigial inline LLVMIR. 2018-05-24 12:46:14 +01:00
Alex Voicu a8a9f3bdc5 Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16 2018-05-23 17:58:13 +01:00
Alex Voicu ecefdd6541 Missing commit. 2018-05-23 17:57:47 +01:00
Alex Voicu 36e805cf76 Re-factor half support to match CUDA whilst exploiting native support. 2018-05-23 17:57:09 +01:00
Maneesh Gupta 7042fe6067 Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-22 10:42:28 +05:30
Maneesh Gupta 85342d73b5 Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes
Add support for the hipFuncGetAttributes interface.
2018-05-22 09:37:41 +05:30
Maneesh Gupta 37cdae8cd4 Merge pull request #462 from ROCm-Developer-Tools/mangupta-hipMemcpy-fix
hipMemcpy returns success if sizeBytes is 0.
2018-05-22 06:34:22 +05:30
Maneesh Gupta 197cac144d Merge pull request #418 from pfultz2/host-device-targets
Add host and device targets
2018-05-22 06:33:56 +05:30
Rahul Garg 40fb44dbe6 Fixed memcpy2D for pinned memory case using 2D kernel 2018-05-21 22:14:45 +05:30
Evgeny Mankov c3d5500acb Merge pull request #463 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - finishing
2018-05-21 18:35:16 +03:00
Evgeny Mankov 5585072e63 [HIPIFY][DNN] support of cuDNN 7.1.3 - finishing 2018-05-21 18:31:20 +03:00
Maneesh Gupta 66d05e6fc3 hipMemcpy returns success if sizeBytes is 0.
Fixes SWDEV-153754 & SWDEV-154178.
2018-05-21 15:38:44 +05:30
Maneesh Gupta 212bf6b6b3 Merge pull request #460 from mangupta/fix_nvcc_tests
Disable incomplete unit tests that don't work on nvcc path
2018-05-21 12:05:12 +05:30
Maneesh Gupta 845cfb4fe5 Merge pull request #459 from mangupta/add_malloc3d_nvcc
Add hipMalloc3D to nvcc detail
2018-05-21 12:04:52 +05:30
Maneesh Gupta 92d8e05aa0 Disable incomplete unit tests that don't work on nvcc path
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
2018-05-21 11:35:03 +05:30
Maneesh Gupta 5133299804 Add hipMalloc3D to nvcc detail
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
2018-05-21 11:33:09 +05:30
Maneesh Gupta 817a6857f2 Merge pull request #456 from founta/master
defined hipPitchedPtr when using hipcc with nvcc
2018-05-21 10:53:39 +05:30
Maneesh Gupta cfb7be414b Merge pull request #455 from ROCm-Developer-Tools/magic
Change HIP fat binary magic number
2018-05-21 09:52:03 +05:30
Maneesh Gupta 43ac6a61e7 Merge pull request #454 from ROCm-Developer-Tools/hip-clang-hipcc
Let hipcc suport hip-clang
2018-05-21 09:51:42 +05:30
Siu Chi Chan 86b56068b5 Merge pull request #458 from gargrahul/fix_memcpy2dasync_pinned
Fix for memcpy2DAsync for pinned host memory case
2018-05-18 15:58:55 -04:00
Alex Voicu 43fca684c8 Update hip_module.cpp
Typo.
2018-05-18 17:50:45 +01:00
Rahul Garg 4f5bdb071c Fix for memcpy2DAsync for pinned host memory case 2018-05-18 21:09:50 +05:30
founta 5c5d87b0a3 defined hipPitchedPtr
Added a define for hipPitchedPtr to resolve a compiler error
2018-05-18 09:11:50 -04:00
Maneesh Gupta 1c93e11cdf Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Maneesh Gupta 43f7705278 Merge pull request #440 from yxsamliu/assert2
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-18 14:13:27 +05:30
Maneesh Gupta bb1f53ac44 Merge pull request #448 from 949f45ac/master
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
2018-05-18 13:18:16 +05:30
Maneesh Gupta 3638bb5b1c Merge pull request #444 from aaronenyeshi/vg20-initial
initial gfx906 support
2018-05-18 13:18:07 +05:30
Maneesh Gupta 5e0828e045 Merge pull request #437 from scchan/mbcnt_intrinsics
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-18 13:17:57 +05:30
Yaxun (Sam) Liu 2f9bce3652 Change HIP fat binary magic number 2018-05-17 17:04:51 -04:00
Yaxun (Sam) Liu 55f01cbf36 Let hipcc suport hip-clang 2018-05-17 14:40:15 -04:00
949f45ac 7bf8402d1d Reinstate accidentally deleted uchar2Holder 2018-05-17 10:55:45 +02:00
Maneesh Gupta 89b79e226b Merge pull request #451 from gargrahul/fix_memcpy2d_for_1d_case
Fixed hipMemcpy2D to handle 1D memcpy case
2018-05-17 07:42:47 +05:30
Maneesh Gupta 1ab38e321b Merge pull request #452 from gargrahul/fix_hipCommander_makefile
Fix hipCommander Makefile
2018-05-17 07:25:27 +05:30
Rahul Garg 9707e9f563 Fix hipCommander Makefile 2018-05-16 15:01:32 +05:30
Rahul Garg 8413fb51e1 Fixed hipMemcpy2D to handle 1D memcpy case 2018-05-16 11:07:10 +05:30
Alex Voicu 40a22d235e Update hip_module.cpp 2018-05-14 17:15:36 +01:00
Evgeny Mankov 317f67fac2 Merge pull request #449 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
2018-05-14 16:30:34 +03:00
Evgeny Mankov 1108a925ca [HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench - almost pass, except cusparse (not supported yet).
- started testing of examples from libcudnn7-dev_7.1.3.16-1+cuda8.0_amd64 package.
2018-05-14 16:23:59 +03:00
949f45ac 9210263727 Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs 2018-05-14 08:34:56 +02:00
Aaron Enye Shi 4488f9f7a7 Fix hipMathFunction for gfx906 2018-05-11 10:53:07 -04:00
Alex Voicu eded014abc Don't use magic constants, they're evil.
Also clarify that the register count cannot be queried at the moment.
2018-05-11 11:31:46 +01:00
Alex Voicu bf9529aaa8 Add support for the hipFuncGetAttributes interface. 2018-05-11 03:35:10 +01:00
Siu Chi Chan 368affcea4 initial gfx906 support 2018-05-10 19:28:00 +00:00
Evgeny Mankov e3568a744d Merge pull request #443 from emankov/cuDNN
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
2018-05-10 19:42:05 +03:00
Evgeny Mankov b31df9cc60 [HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN.
- testing on https://github.com/baidu-research/DeepBench.
2018-05-10 17:36:51 +03:00