Rahul Garg
4ff059d641
Clean up and fix remaining bytes copy
2018-05-24 23:30:27 +05:30
foreman
da7593ae40
P4 to Git Change 1559149 by skudchad@skudchad_test2_win_opencl on 2018/05/24 11:54:02
...
SWDEV-145570 - [HIP] - Implement hipMemcpy2DToArray.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14953/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#29 edit
2018-05-24 12:01:44 -04:00
Alex Voicu
9948b5961e
Update hipTestHalf to actually test behaviour. Add missing hipHostfree.
2018-05-24 13:55:30 +01:00
Alex Voicu
9c7fbdb597
Remove vestigial inline LLVMIR.
2018-05-24 12:46:14 +01:00
Rahul Garg
981e56a68f
Fix memcpy2d kernel dims
2018-05-24 17:00:12 +05:30
Rahul Garg
dc179e0c33
Correct remaining bytes in copy 2d kernel
2018-05-24 08:27:24 +05:30
foreman
0fe5f87cba
P4 to Git Change 1558526 by skudchad@skudchad_test2_win_opencl on 2018/05/23 13:34:33
...
SWDEV-145570 - [HIP] Implement hipPointerGetAttributes.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14938/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#28 edit
2018-05-23 13:40:52 -04:00
Alex Voicu
fe6ef584a7
Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16
2018-05-23 17:58:13 +01:00
Alex Voicu
6f819f226b
Missing commit.
2018-05-23 17:57:47 +01:00
Alex Voicu
1dba835e32
Re-factor half support to match CUDA whilst exploiting native support.
2018-05-23 17:57:09 +01:00
Rahul Garg
9a76d5b94c
Optimize memcpy2D kernel use
2018-05-23 14:43:47 +05:30
Jenkins
b4592823cc
Merge 'master' into 'amd-master'
...
Change-Id: I40bfec7bdb863b483485ff5d7ccc6973d9b5f357
2018-05-22 04:09:35 -05:00
Maneesh Gupta
323a6226b0
Merge pull request #464 from gargrahul/fix_memcpy2d_pinned_mem_case
...
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-22 10:42:28 +05:30
Maneesh Gupta
df3bb9fc32
Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes
...
Add support for the hipFuncGetAttributes interface.
2018-05-22 09:37:41 +05:30
Maneesh Gupta
22af1f6875
Merge pull request #462 from ROCm-Developer-Tools/mangupta-hipMemcpy-fix
...
hipMemcpy returns success if sizeBytes is 0.
2018-05-22 06:34:22 +05:30
Maneesh Gupta
2cb59db654
Merge pull request #418 from pfultz2/host-device-targets
...
Add host and device targets
2018-05-22 06:33:56 +05:30
foreman
f165295c66
P4 to Git Change 1557352 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/21 19:53:00
...
SWDEV-145570 - [HIP] Sync streams in hipFree. hipTestHalf passes now.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#27 edit
2018-05-21 20:02:17 -04:00
Rahul Garg
f47a8236d7
Fixed memcpy2D for pinned memory case using 2D kernel
2018-05-21 22:14:45 +05:30
Evgeny Mankov
ba9d1ec2fd
Merge pull request #463 from emankov/cuDNN
...
[HIPIFY][DNN] support of cuDNN 7.1.3 - finishing
2018-05-21 18:35:16 +03:00
Evgeny Mankov
a5df3a484c
[HIPIFY][DNN] support of cuDNN 7.1.3 - finishing
2018-05-21 18:31:20 +03:00
Maneesh Gupta
0180a82963
hipMemcpy returns success if sizeBytes is 0.
...
Fixes SWDEV-153754 & SWDEV-154178.
2018-05-21 15:38:44 +05:30
Jenkins
f93faad3d2
Merge 'master' into 'amd-master'
...
Change-Id: I44d884a9927410d3fafe0d43becdbd3819b544a2
2018-05-21 04:09:40 -05:00
Maneesh Gupta
8ea218cd1b
Merge pull request #460 from mangupta/fix_nvcc_tests
...
Disable incomplete unit tests that don't work on nvcc path
2018-05-21 12:05:12 +05:30
Maneesh Gupta
333cc86f49
Merge pull request #459 from mangupta/add_malloc3d_nvcc
...
Add hipMalloc3D to nvcc detail
2018-05-21 12:04:52 +05:30
Maneesh Gupta
305592d622
Disable incomplete unit tests that don't work on nvcc path
...
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
2018-05-21 11:35:03 +05:30
Maneesh Gupta
661561eead
Add hipMalloc3D to nvcc detail
...
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
2018-05-21 11:33:09 +05:30
Maneesh Gupta
883241495d
Merge pull request #456 from founta/master
...
defined hipPitchedPtr when using hipcc with nvcc
2018-05-21 10:53:39 +05:30
Maneesh Gupta
cac3f1c7cd
Merge pull request #455 from ROCm-Developer-Tools/magic
...
Change HIP fat binary magic number
2018-05-21 09:52:03 +05:30
Maneesh Gupta
343083b807
Merge pull request #454 from ROCm-Developer-Tools/hip-clang-hipcc
...
Let hipcc suport hip-clang
2018-05-21 09:51:42 +05:30
Siu Chi Chan
76fc1cce43
Merge pull request #458 from gargrahul/fix_memcpy2dasync_pinned
...
Fix for memcpy2DAsync for pinned host memory case
2018-05-18 15:58:55 -04:00
foreman
303df5dd2e
P4 to Git Change 1556942 by yaxunl@yaxunl-lc8 on 2018/05/18 14:25:12
...
SWDEV-145570 - [HIP] Change fat binary magic number and clang-offload-bundler target name to match clang
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#11 edit
2018-05-18 14:34:14 -04:00
Alex Voicu
cd6c979c27
Update hip_module.cpp
...
Typo.
2018-05-18 17:50:45 +01:00
Rahul Garg
afe62e7030
Fix for memcpy2DAsync for pinned host memory case
2018-05-18 21:09:50 +05:30
founta
1a108ef7f3
defined hipPitchedPtr
...
Added a define for hipPitchedPtr to resolve a compiler error
2018-05-18 09:11:50 -04:00
Maneesh Gupta
03ac8e6a92
Merge pull request #433 from gargrahul/add_hipmemset3d
...
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Jenkins
bad4563250
Merge 'master' into 'amd-master'
...
Change-Id: I34bbe8311825d881c414767ca7bbbb7a0aafc2b8
2018-05-18 04:09:38 -05:00
Maneesh Gupta
dcb4477f9f
Merge pull request #440 from yxsamliu/assert2
...
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-18 14:13:27 +05:30
Maneesh Gupta
ac7713fa34
Merge pull request #448 from 949f45ac/master
...
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
2018-05-18 13:18:16 +05:30
Maneesh Gupta
67d45164fa
Merge pull request #444 from aaronenyeshi/vg20-initial
...
initial gfx906 support
2018-05-18 13:18:07 +05:30
Maneesh Gupta
223bf5094c
Merge pull request #437 from scchan/mbcnt_intrinsics
...
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-18 13:17:57 +05:30
Yaxun (Sam) Liu
d079463887
Change HIP fat binary magic number
2018-05-17 17:04:51 -04:00
Yaxun (Sam) Liu
f4d79a1615
Let hipcc suport hip-clang
2018-05-17 14:40:15 -04:00
949f45ac
8303bfdffd
Reinstate accidentally deleted uchar2Holder
2018-05-17 10:55:45 +02:00
Maneesh Gupta
0e44ca7d0f
Merge pull request #451 from gargrahul/fix_memcpy2d_for_1d_case
...
Fixed hipMemcpy2D to handle 1D memcpy case
2018-05-17 07:42:47 +05:30
Maneesh Gupta
e197752c8b
Merge pull request #452 from gargrahul/fix_hipCommander_makefile
...
Fix hipCommander Makefile
2018-05-17 07:25:27 +05:30
foreman
b1ab722a25
P4 to Git Change 1555866 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/16 16:27:00
...
SWDEV-145570 - [HIP] Store HIP mem flags inside amd::Buffer's flags
Use the 16 upper bits of amd::Buffer's flags field instead of adding a new field.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#86 edit
2018-05-16 16:35:53 -04:00
Rahul Garg
dc4d305c25
Fix hipCommander Makefile
2018-05-16 15:01:32 +05:30
Rahul Garg
8f010ac68e
Fixed hipMemcpy2D to handle 1D memcpy case
2018-05-16 11:07:10 +05:30
foreman
bb008fbf94
P4 to Git Change 1555197 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/15 16:26:41
...
SWDEV-145570 - [HIP] Fixed a typo, hipStreamGetFlags test passes now
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#10 edit
2018-05-15 16:36:25 -04:00
foreman
da00b9270a
P4 to Git Change 1555193 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/15 16:19:50
...
SWDEV-145570 - [HIP] Implemented events
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#4 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#2 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#9 edit
2018-05-15 16:26:16 -04:00