نمودار کامیت

1657 کامیت‌ها

مولف SHA1 پیام تاریخ
Kent Knox ff920bc06e Adding initial set Jenkinsfile and dockerfiles
For continuous integration enablement


[ROCm/clr commit: 3ee99588f4]
2017-06-26 13:18:15 -05:00
Maneesh Gupta a11bc9fe4a Updated RELEASE.md
Change-Id: Ic451612555c66f3ed7131514fc97fcc41091370a


[ROCm/clr commit: 6174e69f87]
2017-06-12 11:20:28 +05:30
Maneesh Gupta 4760a21b55 Update directed tests README.md
Change-Id: I395245454d376508f04e5a4a62c8933895cb3867


[ROCm/clr commit: 15a3464630]
2017-06-12 11:19:55 +05:30
Patrick Flick 2f1e4e84a2 fix typo
[ROCm/clr commit: 821c238bad]
2017-06-12 10:15:27 +05:30
Maneesh Gupta 56a827a8ee Merge branch hipify-updates into amd-develop
Change-Id: I13d8750027a2a8787e4eb2e1ed525cf69d14b805


[ROCm/clr commit: d2b90ad93c]
2017-06-12 10:10:19 +05:30
Maneesh Gupta cdd3846478 Initial implementation of hipify-cmakefile
Change-Id: Id365da9f887b5c3409639f000b430d093fd4f6b3


[ROCm/clr commit: c5366a55f1]
2017-06-12 09:57:17 +05:30
Sun, Peng 14e51d052e Fix error related to undefined reference of __get_dynamicgroupbaseptr().
Change-Id: I14951e1725e35dd5f5e53805f81cdb58661f59f2


[ROCm/clr commit: 682dda4418]
2017-06-08 19:24:32 -05:00
Sun, Peng 65ead764d3 Add clang version guard so the hip_fp16.h header won't be picked up by gcc
Change-Id: Ia21335a455bc93210901b44bc8c76a7f4a385b55


[ROCm/clr commit: 5450021f93]
2017-06-08 19:24:32 -05:00
Ben Sander 2408d0cf79 Use amHostCoherentFlag. Requires new HCC version.
[ROCm/clr commit: 9bfc7b0e13]
2017-06-07 09:06:40 -05:00
Maneesh Gupta aeb85a76ef hip_hcc package: add libstdc++-static as a rpm dependency
Change-Id: I83a79353492a6be3d788b7c0ce4a8f3aa740d9d9


[ROCm/clr commit: ff4fae7d20]
2017-06-07 15:50:28 +05:30
Maneesh Gupta 57bfd56a0d hipMemcpy-size test: reduce max size to make it work correctly on nvcc path
Change-Id: I9ce9f5a9e141ffd8ddf961269010b33358e02771


[ROCm/clr commit: ff8ade59aa]
2017-06-07 15:25:54 +05:30
Maneesh Gupta 949fbad6e2 hipDeviceMemcpy test: make it functional on nvcc path
Change-Id: Id10c79b48747ed701adbd0a233c53cd60cfa743b


[ROCm/clr commit: a50f5ca0ac]
2017-06-07 15:24:44 +05:30
Maneesh Gupta e23be58d91 p2p_copy_coherency test: gracefully handle single gpu case
Change-Id: I216663f67ef58c673136332635dab8b57079b909


[ROCm/clr commit: a7dc938ec0]
2017-06-07 15:23:37 +05:30
Ben Sander daae691cdb Enable HCC_OPT_FLUSH=1.
Requires appropriate HCC with this support   :
commit 38e392b517a46a09a3b1c8f388e6a0db3741c510


[ROCm/clr commit: c2baa4f6e6]
2017-06-07 00:15:05 -05:00
Sun, Peng 450a26e5d4 Improve HIP kernel names, attributes and codegen, contributed by Alex Voicu
Change-Id: I2cafbdc5a98e26c7f4fad84739c915e7dc09993c


[ROCm/clr commit: 3b6a863eef]
2017-06-05 11:39:00 -05:00
Ben Sander d1fe7f1683 Enable HIP_SYNC_NULL_STREAM=0 optimization.
[ROCm/clr commit: 344b6cb0c0]
2017-06-05 08:50:41 -05:00
Ben Sander 7237ed04f3 Fix HIP_SYNC_NULL_STREAM=0 mode.
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow


[ROCm/clr commit: 823281dcba]
2017-06-05 08:50:22 -05:00
Ben Sander be21cd1a91 Update tests.
Fix some NVCC issues.
Add hipStreamSync2, record_event tests.


[ROCm/clr commit: 863b7c3f56]
2017-06-04 20:18:37 -05:00
Ben Sander 6aaeed821d Update tests, add p2p coherency test.
[ROCm/clr commit: 15f54fb943]
2017-06-03 17:11:34 -05:00
Aditya Atluri 97fa7aeef6 added half data type and vector destructors
1. Added half data types to hip_fp16.h
2. Added destructor to vector data types

Change-Id: Id5ae76a663bb90a4bde2839ec79c58fbaee5072f


[ROCm/clr commit: fdcc223842]
2017-06-02 11:19:33 -05:00
emankov 6235e4bc7f [HIPIFY] annotation
[ROCm/clr commit: c5f9758f4b]
2017-06-02 16:33:48 +03:00
emankov ef444588e1 [HIPIFY] rename legacy hipify perl script and its usage to hipify-perl
[ROCm/clr commit: e7779650e9]
2017-06-02 16:30:43 +03:00
Evgeny Mankov b30b1acc5c [HIPIFY] All CUDA 8.0.44 API functions update
(for both Driver and Runtime APIs)

1) P2P
cuDeviceGetP2PAttribute   cudaDeviceGetP2PAttribute

2) Memory Mngmnt
cuMemPrefetchAsync        cudaMemPrefetchAsync
cuMemAdvise               cudaMemAdvise
cuMemRangeGetAttribute    cudaMemRangeGetAttribute
cuMemRangeGetAttributes   cudaMemRangeGetAttributes

3) Streams (Driver API only, no analogues in Runtime API)
cuStreamWaitValue32
cuStreamWaitValue32
cuStreamWriteValue32

4) Texture Reference Mngmnt (Driver API only, no analogues in Runtime API)
cuTexRefSetBorderColor
cuTexRefGetBorderColor


[ROCm/clr commit: ee85243bcd]
2017-06-01 21:08:33 +03:00
Siu Chi Chan 8514cf513a fix atomicCAS:remove load for the return value after CAS
[ROCm/clr commit: 969931b1ce]
2017-05-31 15:20:19 -04:00
Evgeny Mankov afbf55a9dc [HIP] [HIPIFY] CUDA Driver API 8.0.44 JIT options support.
[ROCm/clr commit: 463c026976]
2017-05-31 18:55:29 +03:00
Maneesh Gupta 0e4e17db27 Fix hipMemoryAllocate test for single GPU
Change-Id: If121c18ab490ba125dc689ffc08a8839fd280c38


[ROCm/clr commit: 06ee0d3704]
2017-05-31 10:16:57 +05:30
Maneesh Gupta 2985fa3814 Disable rcbrtf, scalblnf, scalbnf in single precision device test
Change-Id: I8a250a64a0cb05132d022a11d9766ced9cdf11a7


[ROCm/clr commit: 2145e94049]
2017-05-31 10:16:19 +05:30
Maneesh Gupta 13896d6fb9 Disable rcbrt, scalbln and scalbn double precision device test
Change-Id: I46bd895701c46d3592b553090cafba99e41a2e2d


[ROCm/clr commit: da19087ae2]
2017-05-31 10:15:41 +05:30
Sandeep Kumar ac8089e773 Add readme for inline asm and unroll cookbook samples
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb


[ROCm/clr commit: f6b98854ba]
2017-05-31 09:25:50 +05:30
Sandeep Kumar c3167f463d Add inline asm hip directed tests for v_add and v_mac
Change-Id: Ie5ace2e42d5da89b16e040537df2bb13d3883c6d


[ROCm/clr commit: c964a5f208]
2017-05-31 09:25:40 +05:30
Sandeep Kumar be31ebb8a7 Add unroll and inline asm cookbook samples
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1


[ROCm/clr commit: 5696eaf842]
2017-05-31 09:25:35 +05:30
Sandeep Kumar b22fdeb171 Print msg for single gpu
Change-Id: I2d23c73542add8973990ba96592016726994422e


[ROCm/clr commit: e104c2e3bf]
2017-05-31 09:25:17 +05:30
Ben Sander 81354999e8 Set event->_stream on hipHccModuleLaunchKernel path if start/stop used
Ensure _stream is always non-null in recorded events.
Fixes isDefaultStream fault.


[ROCm/clr commit: 6cc5dc0326]
2017-05-30 21:55:46 -05:00
Evgeny Mankov 54b3c90964 [HIPIFY] Add the rest CUDA Runtime API 8.0.44 Data structures.
+ sync with corresponding CUDA Driver API Data structures.

P.S.
There is no any new changes in CUDA Runtime API 8.0.61 Data structures since 8.0.44.


[ROCm/clr commit: 997ed19bb8]
2017-05-30 19:45:59 +03:00
Evgeny Mankov 306dca2c78 [HIPIFY] Add the rest CUDA Driver API 8.0.44 Data structures.
+ Memory advise values
+ Memory Range Attributes
+ P2P Attributes

P.S.
There is no any new changes in CUDA Driver API 8.0.61 Data structures since 8.0.44.


[ROCm/clr commit: a020eb76dd]
2017-05-30 18:29:14 +03:00
Evgeny Mankov ffb0d43b07 [HIPIFY] Add more CUDA Driver API 8.0.44 Data structures.
[ROCm/clr commit: ef86f943ac]
2017-05-30 17:58:13 +03:00
Maneesh Gupta 06cdafe311 Disable normcdfinvf on __host__
Change-Id: If7bfc9826a09eb9b7675ea2a417b9418759b7912


[ROCm/clr commit: 445012d451]
2017-05-30 15:45:22 +05:30
Ben Sander 0ca3262f0a Add event controls for release fences.
Env var : HIP_EVENT_SYS_RELEASE
Event allocation flags : hipEventReleaseToDevice, hipEventReleaseToSystem
   (remove hipEventDisableSystemRelease)

Update test for new functionality.


[ROCm/clr commit: 942ec0eff8]
2017-05-27 16:02:34 -05:00
Ben Sander d6e8f5bbdc Cleanup hipEvent. (Intermediate checkpoint)
Support hipEventDisableSystemRelease flag.
Update test.
Remove stray printf


[ROCm/clr commit: c8178c6838]
2017-05-27 16:02:34 -05:00
Ben Sander 9442c6dd2d Updates so hip compiles on CUDA.
[ROCm/clr commit: 8dc968f036]
2017-05-27 15:55:07 -05:00
Ben Sander d9587ae2f0 Add isDefaultStream() accessor.
Fix code that checked for stream==nullptr after stream had been
resolved to a "true stream".


[ROCm/clr commit: b2b620c12b]
2017-05-26 13:46:48 -05:00
Siu Chi Chan 6c3a05ac5b fix hip_fast_dsqrt* to call a double fp sqrt function
[ROCm/clr commit: a3595d2e8c]
2017-05-25 23:15:30 -04:00
Evgeny Mankov ed54e3d0ee [FIX] [HIPIFY] Add matchers for function return types.
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/73

Examples (https://github.com/thrust/thrust/blob/master/thrust/system/cuda/detail/trivial_copy.inl):

template<typename System1,
         typename System2>
cudaStream_t cuda_memcpy_stream(const thrust::cpp::execution_policy<System1> &,
                                const thrust::cuda::execution_policy<System2> &exec)

template<typename System1,
         typename System2>
cudaMemcpyKind cuda_memcpy_kind(const thrust::cuda::execution_policy<System1> &,
                                const thrust::cpp::execution_policy<System2> &)


[ROCm/clr commit: a19ecab3f2]
2017-05-24 18:25:40 +03:00
Ben Sander b4363ffcba Remove HIP_NUM_KERNELS_INFLIGHT. (redundant with HCC controls)
[ROCm/clr commit: 35212632e7]
2017-05-24 01:03:28 -05:00
Ben Sander d302498787 Add hipHostMallocCoherent, hipHostMallocNonCoherent
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.


[ROCm/clr commit: dda70ae514]
2017-05-24 00:48:10 -05:00
Ben Sander ae983e1b09 Remove HIP_MAX_QUEUES (replaced with HCC_MAX_QUEUES)
[ROCm/clr commit: d43d57d39c]
2017-05-23 23:48:01 -05:00
Ben Sander 59e07db865 Expand test to cover copy followed by event sync
[ROCm/clr commit: 92bd54d7b3]
2017-05-23 23:15:45 -05:00
Ben Sander 2e8625a208 Use accelerator_scope for create_marker and create_blocking_marker.
As optimization when system-scope is not needed.


[ROCm/clr commit: 2d5b3359c6]
2017-05-23 23:15:45 -05:00
Ben Sander 0cde8e5db4 Fix trace category for hipHostMalloc
[ROCm/clr commit: ca07615c37]
2017-05-23 23:15:45 -05:00
Evgeny Mankov 9e7a50b1e0 [FIX] [HIPIFY] Matcher for new operator is missing.
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/80

Example from CUDA 8.0.44 sample (CUDASamples\0_Simple\matrixMulDrv\matrixMulDrv.cpp):
    CUjit_option *jitOptions = new CUjit_option[jitNumOptions];
where CUjit_option is enum, should be:
    hipJitOption *jitOptions = new hipJitOption[jitNumOptions];


[ROCm/clr commit: 21d74f09b9]
2017-05-23 19:45:38 +03:00