Rahul Garg
|
d3621a2500
|
Map hipArray alloc flags on NVCC
|
2018-06-01 17:28:43 +05:30 |
|
Maneesh Gupta
|
df450c6680
|
Merge pull request #484 from gargrahul/fix_malloc_hiphostreg
Fix memcpy2D for malloc+ hostRegister
|
2018-06-01 16:53:25 +05:30 |
|
Maneesh Gupta
|
68c36f9125
|
Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.
|
2018-06-01 13:58:07 +05:30 |
|
Maneesh Gupta
|
bdf2645713
|
Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
Feature use _Float16 and match CUDA __half behaviour.
|
2018-06-01 13:50:12 +05:30 |
|
Yaxun (Sam) Liu
|
98491cd759
|
Update INSTALL.md
|
2018-05-31 23:55:42 -04:00 |
|
foreman
|
bad00db6ea
|
P4 to Git Change 1562507 by skudchad@skudchad_test2_win_opencl on 2018/05/31 18:10:39
SWDEV-145570 - [HIP] - Fix HIP build
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#11 edit
|
2018-05-31 18:18:00 -04:00 |
|
Alex Voicu
|
ab4b2a650b
|
Re-sync with upstream. Add integer abs.
|
2018-05-31 16:38:00 +01:00 |
|
Alex Voicu
|
b9b1e0b703
|
Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16
|
2018-05-31 15:27:31 +01:00 |
|
Alex Voicu
|
0108819e2a
|
Add missing interop with volatile. Fix unit tests.
|
2018-05-31 15:27:12 +01:00 |
|
Jenkins
|
ae056231b4
|
Merge 'master' into 'amd-master'
Change-Id: I7fc7c2321bc4d89c69271a648f8450deb0c53e02
|
2018-05-31 04:09:40 -05:00 |
|
Rahul Garg
|
8d6357669d
|
Fix memcpy2D for malloc+ hostRegister
|
2018-05-31 13:14:27 +05:30 |
|
Maneesh Gupta
|
449ddba0aa
|
Merge pull request #480 from yxsamliu/add-fun
Add more function declarations for hip-clang
|
2018-05-31 09:27:54 +05:30 |
|
Maneesh Gupta
|
3f87d4efb2
|
Merge pull request #481 from gargrahul/fix_texobj1dfetch_test
Fixed texture obj 1Dfetch test
|
2018-05-31 09:14:31 +05:30 |
|
Maneesh Gupta
|
bbdbcd1fe7
|
Merge pull request #479 from yxsamliu/fix-hipcc
Drop --amdgpu-target= options for hip-clang
|
2018-05-31 09:12:36 +05:30 |
|
Alex Voicu
|
59db16fd36
|
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
|
2018-05-31 03:17:26 +01:00 |
|
Yaxun (Sam) Liu
|
b7641cf835
|
Fix __syncthreads for hip-clang
|
2018-05-30 16:33:18 -04:00 |
|
Rahul Garg
|
5ae6d93526
|
Fixed texture obj 1Dfetch test
|
2018-05-30 21:54:51 +05:30 |
|
Yaxun (Sam) Liu
|
6b8218e5a0
|
Add more function declarations for hip-clang
|
2018-05-30 06:51:03 -04:00 |
|
Yaxun (Sam) Liu
|
9008a3ec56
|
Drop --amdgpu-target= options for hip-clang
They are replaced by --cuda-gpu-arch= options elsewhere
|
2018-05-30 06:47:44 -04:00 |
|
Maneesh Gupta
|
a1fea02c6e
|
Merge pull request #469 from yxsamliu/fun
Add math function declaration for hip-clang
|
2018-05-30 15:01:27 +05:30 |
|
Jenkins
|
d060430380
|
Merge 'master' into 'amd-master'
Change-Id: If682eb69168da413136c4e5c110468ac658139b7
|
2018-05-30 04:09:39 -05:00 |
|
Maneesh Gupta
|
59d849a643
|
Merge pull request #477 from mangupta/hipfuncgetattr_nvcc
Add signature for hipFuncGetAttributes to nvcc_details
|
2018-05-30 12:29:03 +05:30 |
|
Maneesh Gupta
|
0cec23b172
|
Merge pull request #478 from gargrahul/fix_tex1dfetch_test
Fixed tex1dFetch test
|
2018-05-30 12:13:47 +05:30 |
|
Rahul Garg
|
e8f3381997
|
Fixed tex1dFetch test
|
2018-05-30 11:42:01 +05:30 |
|
Maneesh Gupta
|
8b93cd67a6
|
[dtests] Fix nvcc path build errors in hipFuncGetAttributes.tst
Change-Id: I000bed69162897f7b6edd733ed6e9acc93beb0ed
|
2018-05-30 11:26:11 +05:30 |
|
Maneesh Gupta
|
7fd92d471b
|
Add signature for hipFuncGetAttributes to nvcc_details
Change-Id: I06d5b50028fcfe3ea9d81f749b02be6b094c0977
|
2018-05-30 11:24:48 +05:30 |
|
Maneesh Gupta
|
c20a3368da
|
Merge pull request #476 from gargrahul/fix_bindtexture_nvcc
Fix hipBindTexture on NVCC path
|
2018-05-30 11:21:27 +05:30 |
|
Rahul Garg
|
d67b614b22
|
Fix hipBindTexture on NVCC path
|
2018-05-30 10:59:07 +05:30 |
|
Maneesh Gupta
|
156e93a0b2
|
Merge pull request #475 from gargrahul/fix_tex2d_tests
Fix tex2D tests for result value
|
2018-05-30 08:34:51 +05:30 |
|
Maneesh Gupta
|
57fb96013c
|
Merge pull request #472 from Jorghi12/patch-3
Adding double/long int signatures for abs
|
2018-05-30 08:32:14 +05:30 |
|
Maneesh Gupta
|
c8aca45856
|
Merge pull request #471 from Jorghi12/patch-4
Adding double/long int signatures for abs
|
2018-05-30 08:31:32 +05:30 |
|
Maneesh Gupta
|
591b7797d9
|
Merge pull request #465 from gargrahul/optimize_memcpy2d
Optimize memcpy2D kernel use
|
2018-05-30 08:31:16 +05:30 |
|
Rahul Garg
|
7f7a7041a5
|
Fix runTest return type
|
2018-05-29 16:18:14 +05:30 |
|
Rahul Garg
|
ca156449e2
|
Fix tex2D tests for result value
|
2018-05-29 16:03:37 +05:30 |
|
lthakur
|
a05ac35ab1
|
HIP test case for 1D texture fetch (#424)
|
2018-05-29 14:08:01 +05:30 |
|
Maneesh Gupta
|
08779ae5d5
|
Merge pull request #473 from gargrahul/add_tex1d_nvcc
Add 1d texture types for NVCC path
|
2018-05-29 13:54:34 +05:30 |
|
Alex Voicu
|
75390c7083
|
GCC only supports power-of-2 component counts, so round up to next square.
|
2018-05-29 01:22:26 +01:00 |
|
Rahul Garg
|
024f77ce61
|
Add 1d texture types for NVCC path
|
2018-05-28 15:02:06 +05:30 |
|
Jorghi12
|
6e2b9d0547
|
Update math_functions.h
CUDA also has a function named labs.
|
2018-05-26 16:22:10 -04:00 |
|
Jorghi12
|
ec2edb2c92
|
Update math_functions.cpp
CUDA also has a function named labs.
|
2018-05-26 16:21:14 -04:00 |
|
Alex Voicu
|
98395ad38c
|
Add missing alias half / half2 aliases
|
2018-05-26 12:10:50 +01:00 |
|
Jorghi12
|
4f6904b5c7
|
Adding double/long int signatures for abs
Adding overloads for abs that are found in cuda's math_functions.
|
2018-05-26 00:41:24 -04:00 |
|
Jorghi12
|
4383d6c6de
|
Adding double/long int signatures for abs
Adding overloads for abs that are found in cuda's math_functions.
|
2018-05-26 00:40:14 -04:00 |
|
Yaxun (Sam) Liu
|
b7801f1fe3
|
Add more function declarations to hip-clang
|
2018-05-25 16:18:46 -04:00 |
|
Alex Voicu
|
06e0b11106
|
Missing bits.
|
2018-05-25 20:15:04 +01:00 |
|
Alex Voicu
|
32388cbf28
|
Missing bits.
|
2018-05-25 20:12:21 +01:00 |
|
Alex Voicu
|
0eec115bcd
|
Move converting constructor from _Float16 under macro guard. Refactor.
|
2018-05-25 19:46:41 +01:00 |
|
Yaxun (Sam) Liu
|
f9693a9210
|
Add math function declaration for hip-clang
|
2018-05-25 12:22:16 -04:00 |
|
foreman
|
cadb15165e
|
P4 to Git Change 1559366 by gandryey@gera-w8 on 2018/05/24 18:06:45
SWDEV-79445 - OCL generic changes and code clean-up
- Combine validateMemory() and arguments capture() under a single function. Rename validateMemory() in NDRangeKernelCommand class to captureAndValidate()
http://ocltc.amd.com/reviews/r/14964/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_execute.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#87 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#90 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.hpp#22 edit
|
2018-05-24 18:12:49 -04:00 |
|
Rahul Garg
|
d8cb47242b
|
Use 64x4 grid dims
|
2018-05-24 23:51:52 +05:30 |
|