Gráfico de commits

71 Commits

Autor SHA1 Mensaje Fecha
Maneesh Gupta e2d97e19bc Enable cospi,rsqrt,sinpi tests for HCC newer than 16073 2016-02-22 15:13:23 +05:30
streamhsa 005155b7b2 Resolve issues for hip_popc and hip_ballot on nvcc 2016-02-19 20:18:03 +08:00
Evgeny Mankov 376fb0d8ad A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth. 2016-02-19 13:36:37 +03:00
Evgeny Mankov d4b15399f5 Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
2016-02-19 13:27:03 +03:00
Evgeny Mankov 14ec340746 Formatting, no functional changes. 2016-02-18 18:54:19 +03:00
Evgeny Mankov da8169dd89 Device property memoryBusWidth implementation.
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
2016-02-18 18:15:01 +03:00
Ben Sander 2447067f27 Update release notes 2016-02-18 21:07:14 -06:00
Ben Sander ad20273a1d Search multiple dirs. 2016-02-18 21:07:14 -06:00
Ben Sander 8c3436e927 Update doxygen HTML 2016-02-18 21:02:39 -06:00
Ben Sander 3496398651 Update doxygen HTML 2016-02-18 20:43:03 -06:00
Evgeny Mankov 8aace64dce Device property memoryClockRate implementation.
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
2016-02-18 17:25:28 +03:00
Evgeny Mankov 859208d6f0 hipInfo sample update with new Device Properties. 2016-02-18 15:08:55 +03:00
Evgeny Mankov d4bd94e9a0 Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added. 2016-02-18 14:34:18 +03:00
Evgeny Mankov 763aa5ea5a Formatting, no functional changes. 2016-02-15 13:16:05 +03:00
Maneesh Gupta c82511258c Documented supported fastmath functions 2016-02-12 14:21:58 +05:30
Maneesh Gupta 2659e70d48 Updated integer intrinsics documentation 2016-02-12 13:58:35 +05:30
Evgeny Mankov 460b501cbb Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.

hipify is updated as well.
2016-02-12 01:29:20 +03:00
Evgeny Mankov 1025341300 Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.

NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.
2016-02-12 00:04:14 +03:00
Evgeny Mankov 658e9f0484 BDFID (BusID/DeviceID/FunctionID) support.
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.
2016-02-11 22:26:01 +03:00
sunway513 fe1000df17 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-11 22:22:47 +05:30
sunway513 c7cbcfa2e9 Add reminder to keep ROCR runtime on the system library path 2016-02-11 22:22:00 +05:30
Maneesh Gupta ed2d86f3a9 Updated readme for test 2016-02-11 13:06:58 +05:30
Evgeny Mankov 3139c72756 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-10 17:21:53 +03:00
Evgeny Mankov d9a94191f2 Formatting, no functional changes 2016-02-10 17:21:18 +03:00
streamhsa 90add185fd Remove test for atomicInc and atomicDec 2016-02-10 21:02:52 +08:00
streamhsa 56f1832e70 Updated readme for test 2016-02-10 20:05:59 +08:00
streamhsa 2f8d56e903 Resolved test issues 2016-02-10 20:01:16 +08:00
gargrahul 51f46d9ddf Removed atomicInc and atomicDec support from HIP 2016-02-10 04:29:55 +05:30
Evgeny Mankov 4d4ca3ef3f Device property concurrentKernels is added to hipDeviceProp_t struct.
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.
2016-02-09 17:10:35 +03:00
Maneesh Gupta f8bfc7f54c which_hip -> hipconfig 2016-02-09 11:51:26 +05:30
Maneesh Gupta 0df78ac9bf Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-09 10:57:46 +05:30
Maneesh Gupta f6e7abd710 Move HIP_DEVICE_COMPILE defines to hip_common.h 2016-02-09 10:57:20 +05:30
streamhsa 310023e273 Rename test hipInfo as hipGetDeviceAttribute 2016-02-09 13:19:32 +08:00
Ben Sander ce2fc0f7fe Test fixes:
- Remove reference to missing test.
- Add hipMemset back.
- Parse --gpu option to specify default starting GPU.
2016-02-08 22:55:23 -06:00
Ben Sander 2ecb345a67 minor doc touchup 2016-02-08 22:11:11 -06:00
Ben Sander c482a3f456 in HIPCHECK, only run command once even if error occurs 2016-02-08 21:45:49 -06:00
Ben Sander a06e0d9050 Doc update 2016-02-08 21:44:55 -06:00
Ben Sander 39c5f0f610 Add hcc-config info to --full 2016-02-08 21:44:55 -06:00
Ben Sander fdeb477822 iScript cleanup, add --full 2016-02-08 21:44:55 -06:00
Ben Sander 26854bb31c Fix HIP_PLATFORM detection 2016-02-05 07:15:46 -06:00
Ben Sander 9aec91a3b7 Fix getdeviceattr compilation for NVCC 2016-02-04 16:26:33 -06:00
Sam Kolton afe45964ae Implementation of hipDeviceGetAttribute() 2016-02-04 17:39:27 +03:00
Ben Sander 2faf1dfe6e Merge branch 'master' into privatestaging 2016-02-03 09:39:19 -06:00
Peng Sun d4835c7416 Fix all TODO-doc 2016-02-02 21:29:09 -06:00
Peng Sun b20e02ae58 Finish all TODO for error code 2016-02-02 17:39:46 -06:00
scchan 63a6bce3d9 add inline attribute to shfl functions 2016-02-02 12:53:17 -06:00
Ben Sander 714bfcbff6 Merge branch 'master' of https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP 2016-02-02 10:05:44 -06:00
Ben Sander 182296ce59 Remove warning on ballot/any/all and pop/clz.
Since these are supported in HIP no reason to emit warnings.
2016-02-02 10:02:48 -06:00
streamhsa b14c890851 Adjusted the value of __any as per CUDA -sandeep 2016-02-02 15:25:42 +05:30
streamhsa a7c0be6e4b ADDED Support for __ffs() and __ffsll() having signed input -sandeep 2016-02-02 15:05:46 +05:30