Граф коммитов

186 Коммитов

Автор SHA1 Сообщение Дата
Jack Chung bf4ba9e09e Squashed commit of the following:
commit 9548493fa754b3bf5c31cbdc2211db1e73e8c07c
Author: Jack Chung <whchung@gmail.com>
Date:   Mon May 23 11:57:23 2016 +0800

    Rename hipExternShared test to hipDynamicShared

    Change-Id: I180d9d539420fb69cfc121eceaa7db9da03483b2

commit 827081f8244a38f010789d556db0c4ff7b6422d8
Author: Jack Chung <whchung@gmail.com>
Date:   Mon May 23 11:56:27 2016 +0800

    Rename HIP_DECLARE_EXTERN_SHARED to HIP_DYNAMIC_SHARED

    Change-Id: I22362d179812ac547e0f11ba4e2bb999050e08ae

commit 4c277228ed41af187739610fa17eab1fb144c947
Author: Jack Chung <whchung@gmail.com>
Date:   Thu May 19 17:49:52 2016 +0800

    Adopt new interface to get dynamic LDS in hc.hpp

    Change-Id: I47b433b714633a4c97df87c40a0b1d3386429a00

commit 5a36117d777064113a528dc47b42e8c8413baa97
Author: Jack Chung <whchung@gmail.com>
Date:   Thu May 19 11:29:24 2016 +0800

    Add test patterns for regular expression to match "extern __shared__"

    These test patterns should better be saved as an individual test case, but I'm
    not familiar with HIP test structures so I leave them as comments in hipify as
    of now.

    Change-Id: I7fee89c89b9e73de2133357a226ec0c769733531

commit 1b26284168c7f5339f63338fd0149bed5d994656
Author: Jack Chung <whchung@gmail.com>
Date:   Thu May 19 11:25:23 2016 +0800

    Add one HIP unit test to use HIP_DECLARE_EXTERN_SHARED

    Change-Id: I4d9907815920693a74ea9d575fe26e7c67636109

commit 77b816ee5972b13d829d5bbcf06fbfd07acea2af
Author: Jack Chung <whchung@gmail.com>
Date:   Wed May 18 19:18:59 2016 +0800

    Adopt HIP_ prefix for DECLARE_EXTERN_SHARED macro

    Change-Id: I555ded16b449b67d2e20904013d86fe1ded6a2be

commit ef0997939c3578a9ae11621bf21c0416f04d2622
Author: Jack Chung <whchung@gmail.com>
Date:   Wed May 18 17:42:04 2016 +0800

    Modify hipify to support converting extern __shared__ to DECLARE_EXTERN_SHARED macro

    Added regular expression to search & replace extern __shared__ declarations to
    DECLARE_EXTERN_SHARED macro.

    Limitation:
    - Won't work if "extern __shared__" is declared at global scope

    Sample Usages:
    extern __shared__ double foo[];
    extern __shared__ unsigned int foo[];
    extern volatile __shared__ double foo[];
    extern volatile __shared__ unsigned int sdata[];
    extern __shared__ volatile unsigned int sdata[];
    extern __shared__ T s[];
    extern __shared__ T::type s[];
    extern __shared__ blah<T>::type s[];
    extern __shared__ typename mapper<Float>::type s_data[];
    extern __attribute__((used)) __shared__ typename mapper<Float>::type s_data[];

    Change-Id: I2be0b7039adeddb789f5a2b067d403a43fdc3e26

commit 93ff268724493aedfacdcd5a5aa9a100f4ebaed0
Author: Jack Chung <whchung@gmail.com>
Date:   Wed May 18 15:13:09 2016 +0800

    Introduce DECLARE_EXTERN_SHARED macro to encapsulate "extern __shared__" decls

    Change-Id: I93b2d37c763195b0ca9fd0afee78605a1e3272db

commit cff9c95412de343cc6405158b5acc4f1029267ff
Author: Jack Chung <whchung@gmail.com>
Date:   Wed May 18 12:53:54 2016 +0800

    Add __get_dynamic_groupbaseptr() to point to dynamic LDS

    Change-Id: I97b548d8a691488057617c551a8f331cad7afc77

Change-Id: I84e7875b76fa1f59e860e19c93bd4209cdd1fd2c
2016-06-05 06:20:44 -04:00
Maneesh Gupta 0452d7c1af Add hipSetDeviceFlags implementation for NVCC
Change-Id: Id19e965950b728cfe0a514e6cdf65a191d063c10
2016-05-20 12:39:02 +05:30
Rahul Garg 381dc93b96 Support for Atomic inc and dec in HIP
Change-Id: I783e4917cece5cc379894f0d293382315fbfa8b0
2016-05-12 11:10:48 +05:30
Jack Chung 453d07daf4 Fix math functions so they can be used on host codes as well
Add __host__ for some math functions to fix the following tests:

- hipSinglePrecisionMathHost
- hipDobulePrecisionMathHost
2016-05-06 08:09:52 +08:00
Ben Sander 89df2f4e2f Merge branch 'privatestaging' into grid_launch 2016-05-02 18:38:20 -05:00
Ben Sander 4b1d3d2cba enable hipHostRegister 2016-04-28 17:54:03 -05:00
Ben Sander db7877320f remove kalmar_defines.h 2016-04-28 17:40:17 -05:00
Ben Sander 558975c046 remove hacks to disable math or LDG functions. 2016-04-28 14:14:22 -05:00
Ben Sander 9026146951 Properly pair #if/#endif 2016-04-28 13:43:32 -05:00
Ben Sander efaa278cbf merge fix 2016-04-28 13:24:40 -05:00
Ben Sander bae6462e3b hipcc uses -stdlib=stdlibc++ by default.
hipcc switches to -stdlib=libc++ if user requests it.
hack to disable LDG.
2016-04-28 13:17:49 -05:00
Jack Chung 6756b8f0bc Fix compilation error in hipSinglePrecisionMathHost test 2016-04-27 17:46:27 -05:00
Aditya Atluri cac8110a4f changed to guard from hc.hpp 2016-04-27 17:46:27 -05:00
bwicakso 77d4b41118 Merge remote-tracking branch 'refs/remotes/origin/privatestaging' into kernel_synchronization 2016-04-25 14:43:28 -05:00
bwicakso c976e6bcd3 Update with original 2016-04-25 14:42:35 -05:00
bwicakso f0974e5867 Merge remote-tracking branch 'refs/remotes/origin/privatestaging' into kernel_synchronization 2016-04-25 13:57:28 -05:00
bwicakso a080650e18 Merge from original 2016-04-25 13:55:44 -05:00
Aditya Atluri ec23aba6f9 Fixed git spills 2016-04-22 09:19:05 -05:00
Aditya Atluri 4dac4789ff added workweek to hip_ldg 2016-04-22 09:14:26 -05:00
Maneesh Gupta ffdf6ab23b Merge branch 'release_0.84.00' into privatestaging
Conflicts:
	include/hcc_detail/hip_runtime.h
	src/hip_hcc.cpp
2016-04-22 10:55:58 +05:30
Aditya Atluri 9ca8584ec0 reorg make_datatype feature 2016-04-21 14:51:19 -05:00
Aditya Atluri ba59ea87ab added full data type support for __ldg 2016-04-21 11:17:26 -05:00
bwicakso df98fd8531 Fix for kernel synchronization
The completion future of a particular kernel is lost if there are
multiple kernels in the stream. This can cause a racing condition where
the signal associated with the unreferenced completion_future might get
released by hcc runtime.
2016-04-20 15:51:39 -05:00
Aditya Atluri de7952cd06 added copyright for ldg 2016-04-20 12:28:02 -05:00
Aditya Atluri 805b268ad4 added support for __ldg 2016-04-20 12:25:40 -05:00
pensun 883a3fe831 update API trace information for hipLaunchKernel 2016-04-20 09:57:55 -05:00
pensun 2ddade96d7 Merge branch 'release_0.84.00' of https://github.com/AMDComputeLibraries/HIP-privatestaging into release_0.84.00 2016-04-20 09:48:52 -05:00
pensun 7f37fc4ec7 update hipLaunchKernel API trace information 2016-04-20 09:48:35 -05:00
Ben Sander 453615ed57 Fix hipDeviceReset synchronization 2016-04-19 11:56:12 -05:00
Jack Chung 07efbcf0d0 Fix compilation error in hipSinglePrecisionMathHost test 2016-04-19 23:17:09 +08:00
Maneesh Gupta bc4f388b69 Prefix HIP includes with hip/ 2016-04-19 15:02:12 +05:30
Ben Sander 30233cd2e0 Add experimental notes around p2p 2016-04-18 23:52:42 -05:00
Ben Sander 3bcefdaa2d Add USE_HCC_LOCK_API.
Default off, if set will compile assuming HCC lock APIs exist.
2016-04-18 23:50:52 -05:00
Ben Sander e020d68309 Fixes for P2P and hipDeviceReset
- devicereset would lose track of default stream and thus subsequent
  synchronization calls might not actually sychronize.
- Also deviceReset now correctly frees streams.
- fix waits in P2P staging copy - first phase (Device0-to-Staging) must
  wait for second phase (Staging to Device1) to finish draining the
  buffer.
2016-04-18 20:49:33 -05:00
Ben Sander 65abde6626 Move HIP_HCC define to CMake 2016-04-17 07:40:04 -05:00
Ben Sander 49cc5aec91 Merge branch 'privatestaging' into p2p
Conflicts:
	include/hcc_detail/hip_hcc.h
	src/hip_hcc.cpp
2016-04-17 06:46:52 -05:00
Aditya Atluri 846432a1c1 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-04-16 17:10:32 -05:00
Aditya Atluri 8dc1bdcbe6 Corrected Memcpydefault 2016-04-16 17:10:13 -05:00
Ben Sander dcabc9dbf7 P2P Update.
- add P2P staging buffer copy.
- If copy device does not have sufficient access permissions, fall back
  to staging buffer.
- improve docs for which copy device is used.
2016-04-16 10:18:56 -05:00
streamhsa 76f2670cef Add __ldg 2016-04-16 21:12:09 +08:00
Maneesh Gupta dc4c174a54 Bump min required hcc to workweek 16155 2016-04-16 14:49:10 +05:30
Aditya Atluri a0aef11da7 changed to guard from hc.hpp 2016-04-15 11:21:45 -05:00
Aditya Atluri 2e80de6dcb improved error returns for hipHostRegister/UnRegister functions 2016-04-15 10:42:31 -05:00
pensun bdd26bd1f1 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-04-14 09:02:16 -05:00
Aditya Atluri facc7a39be Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-04-14 09:17:30 -05:00
Aditya Atluri 573224e55f Enabling make_* device code 2016-04-14 09:17:13 -05:00
pensun 234727c801 Add missing cudaMemsetAsync transformation 2016-04-14 09:02:02 -05:00
Maneesh Gupta 011a0335d4 Add hip soft-link to include for migrating to new include format 2016-04-14 16:37:27 +05:30
Jack Chung 210ba67b84 Add missing USE_PEER_TO_PEER macro definition 2016-04-13 13:56:26 +08:00
Aditya Atluri 4dfe27e8b5 Removed pragma once 2016-04-12 09:22:03 -05:00