نمودار کامیت

6428 کامیت‌ها

مولف SHA1 پیام تاریخ
Rahul Garg d3621a2500 Map hipArray alloc flags on NVCC 2018-06-01 17:28:43 +05:30
Maneesh Gupta df450c6680 Merge pull request #484 from gargrahul/fix_malloc_hiphostreg
Fix memcpy2D for malloc+ hostRegister
2018-06-01 16:53:25 +05:30
Maneesh Gupta 68c36f9125 Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.
2018-06-01 13:58:07 +05:30
Maneesh Gupta bdf2645713 Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
Feature use _Float16 and match CUDA __half behaviour.
2018-06-01 13:50:12 +05:30
Yaxun (Sam) Liu 98491cd759 Update INSTALL.md 2018-05-31 23:55:42 -04:00
foreman bad00db6ea P4 to Git Change 1562507 by skudchad@skudchad_test2_win_opencl on 2018/05/31 18:10:39
SWDEV-145570 - [HIP] - Fix HIP build

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#11 edit
2018-05-31 18:18:00 -04:00
Alex Voicu ab4b2a650b Re-sync with upstream. Add integer abs. 2018-05-31 16:38:00 +01:00
Alex Voicu b9b1e0b703 Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16 2018-05-31 15:27:31 +01:00
Alex Voicu 0108819e2a Add missing interop with volatile. Fix unit tests. 2018-05-31 15:27:12 +01:00
Jenkins ae056231b4 Merge 'master' into 'amd-master'
Change-Id: I7fc7c2321bc4d89c69271a648f8450deb0c53e02
2018-05-31 04:09:40 -05:00
Rahul Garg 8d6357669d Fix memcpy2D for malloc+ hostRegister 2018-05-31 13:14:27 +05:30
Maneesh Gupta 449ddba0aa Merge pull request #480 from yxsamliu/add-fun
Add more function declarations for hip-clang
2018-05-31 09:27:54 +05:30
Maneesh Gupta 3f87d4efb2 Merge pull request #481 from gargrahul/fix_texobj1dfetch_test
Fixed texture obj 1Dfetch test
2018-05-31 09:14:31 +05:30
Maneesh Gupta bbdbcd1fe7 Merge pull request #479 from yxsamliu/fix-hipcc
Drop --amdgpu-target= options for hip-clang
2018-05-31 09:12:36 +05:30
Alex Voicu 59db16fd36 Switch to using ROCDL directly, as opposed to via HC. Add missing bits. 2018-05-31 03:17:26 +01:00
Yaxun (Sam) Liu b7641cf835 Fix __syncthreads for hip-clang 2018-05-30 16:33:18 -04:00
Rahul Garg 5ae6d93526 Fixed texture obj 1Dfetch test 2018-05-30 21:54:51 +05:30
Yaxun (Sam) Liu 6b8218e5a0 Add more function declarations for hip-clang 2018-05-30 06:51:03 -04:00
Yaxun (Sam) Liu 9008a3ec56 Drop --amdgpu-target= options for hip-clang
They are replaced by --cuda-gpu-arch= options elsewhere
2018-05-30 06:47:44 -04:00
Maneesh Gupta a1fea02c6e Merge pull request #469 from yxsamliu/fun
Add math function declaration for hip-clang
2018-05-30 15:01:27 +05:30
Jenkins d060430380 Merge 'master' into 'amd-master'
Change-Id: If682eb69168da413136c4e5c110468ac658139b7
2018-05-30 04:09:39 -05:00
Maneesh Gupta 59d849a643 Merge pull request #477 from mangupta/hipfuncgetattr_nvcc
Add signature for hipFuncGetAttributes to nvcc_details
2018-05-30 12:29:03 +05:30
Maneesh Gupta 0cec23b172 Merge pull request #478 from gargrahul/fix_tex1dfetch_test
Fixed tex1dFetch test
2018-05-30 12:13:47 +05:30
Rahul Garg e8f3381997 Fixed tex1dFetch test 2018-05-30 11:42:01 +05:30
Maneesh Gupta 8b93cd67a6 [dtests] Fix nvcc path build errors in hipFuncGetAttributes.tst
Change-Id: I000bed69162897f7b6edd733ed6e9acc93beb0ed
2018-05-30 11:26:11 +05:30
Maneesh Gupta 7fd92d471b Add signature for hipFuncGetAttributes to nvcc_details
Change-Id: I06d5b50028fcfe3ea9d81f749b02be6b094c0977
2018-05-30 11:24:48 +05:30
Maneesh Gupta c20a3368da Merge pull request #476 from gargrahul/fix_bindtexture_nvcc
Fix hipBindTexture on NVCC path
2018-05-30 11:21:27 +05:30
Rahul Garg d67b614b22 Fix hipBindTexture on NVCC path 2018-05-30 10:59:07 +05:30
Maneesh Gupta 156e93a0b2 Merge pull request #475 from gargrahul/fix_tex2d_tests
Fix tex2D tests for result value
2018-05-30 08:34:51 +05:30
Maneesh Gupta 57fb96013c Merge pull request #472 from Jorghi12/patch-3
Adding double/long int signatures for abs
2018-05-30 08:32:14 +05:30
Maneesh Gupta c8aca45856 Merge pull request #471 from Jorghi12/patch-4
Adding double/long int signatures for abs
2018-05-30 08:31:32 +05:30
Maneesh Gupta 591b7797d9 Merge pull request #465 from gargrahul/optimize_memcpy2d
Optimize memcpy2D kernel use
2018-05-30 08:31:16 +05:30
Rahul Garg 7f7a7041a5 Fix runTest return type 2018-05-29 16:18:14 +05:30
Rahul Garg ca156449e2 Fix tex2D tests for result value 2018-05-29 16:03:37 +05:30
lthakur a05ac35ab1 HIP test case for 1D texture fetch (#424) 2018-05-29 14:08:01 +05:30
Maneesh Gupta 08779ae5d5 Merge pull request #473 from gargrahul/add_tex1d_nvcc
Add 1d texture types for NVCC path
2018-05-29 13:54:34 +05:30
Alex Voicu 75390c7083 GCC only supports power-of-2 component counts, so round up to next square. 2018-05-29 01:22:26 +01:00
Rahul Garg 024f77ce61 Add 1d texture types for NVCC path 2018-05-28 15:02:06 +05:30
Jorghi12 6e2b9d0547 Update math_functions.h
CUDA also has a function named labs.
2018-05-26 16:22:10 -04:00
Jorghi12 ec2edb2c92 Update math_functions.cpp
CUDA also has a function named labs.
2018-05-26 16:21:14 -04:00
Alex Voicu 98395ad38c Add missing alias half / half2 aliases 2018-05-26 12:10:50 +01:00
Jorghi12 4f6904b5c7 Adding double/long int signatures for abs
Adding overloads for abs that are found in cuda's math_functions.
2018-05-26 00:41:24 -04:00
Jorghi12 4383d6c6de Adding double/long int signatures for abs
Adding overloads for abs that are found in cuda's math_functions.
2018-05-26 00:40:14 -04:00
Yaxun (Sam) Liu b7801f1fe3 Add more function declarations to hip-clang 2018-05-25 16:18:46 -04:00
Alex Voicu 06e0b11106 Missing bits. 2018-05-25 20:15:04 +01:00
Alex Voicu 32388cbf28 Missing bits. 2018-05-25 20:12:21 +01:00
Alex Voicu 0eec115bcd Move converting constructor from _Float16 under macro guard. Refactor. 2018-05-25 19:46:41 +01:00
Yaxun (Sam) Liu f9693a9210 Add math function declaration for hip-clang 2018-05-25 12:22:16 -04:00
foreman cadb15165e P4 to Git Change 1559366 by gandryey@gera-w8 on 2018/05/24 18:06:45
SWDEV-79445 - OCL generic changes and code clean-up
	- Combine validateMemory() and arguments capture() under a single function. Rename validateMemory() in NDRangeKernelCommand class to captureAndValidate()

	http://ocltc.amd.com/reviews/r/14964/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_execute.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#87 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#90 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/kernel.hpp#22 edit
2018-05-24 18:12:49 -04:00
Rahul Garg d8cb47242b Use 64x4 grid dims 2018-05-24 23:51:52 +05:30