Maneesh Gupta
e2d97e19bc
Enable cospi,rsqrt,sinpi tests for HCC newer than 16073
2016-02-22 15:13:23 +05:30
streamhsa
005155b7b2
Resolve issues for hip_popc and hip_ballot on nvcc
2016-02-19 20:18:03 +08:00
Evgeny Mankov
376fb0d8ad
A support of the following device properties is added to legacy hipify.pl: hipDeviceAttributeConcurrentKernels, hipDeviceAttributeMemoryClockRate & hipDeviceAttributeMemoryBusWidth.
2016-02-19 13:36:37 +03:00
Evgeny Mankov
d4b15399f5
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
2016-02-19 13:27:03 +03:00
Evgeny Mankov
14ec340746
Formatting, no functional changes.
2016-02-18 18:54:19 +03:00
Evgeny Mankov
da8169dd89
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
2016-02-18 18:15:01 +03:00
Ben Sander
2447067f27
Update release notes
2016-02-18 21:07:14 -06:00
Ben Sander
ad20273a1d
Search multiple dirs.
2016-02-18 21:07:14 -06:00
Ben Sander
8c3436e927
Update doxygen HTML
2016-02-18 21:02:39 -06:00
Ben Sander
3496398651
Update doxygen HTML
2016-02-18 20:43:03 -06:00
Evgeny Mankov
8aace64dce
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
2016-02-18 17:25:28 +03:00
Evgeny Mankov
859208d6f0
hipInfo sample update with new Device Properties.
2016-02-18 15:08:55 +03:00
Evgeny Mankov
d4bd94e9a0
Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
2016-02-18 14:34:18 +03:00
Evgeny Mankov
763aa5ea5a
Formatting, no functional changes.
2016-02-15 13:16:05 +03:00
Maneesh Gupta
c82511258c
Documented supported fastmath functions
2016-02-12 14:21:58 +05:30
Maneesh Gupta
2659e70d48
Updated integer intrinsics documentation
2016-02-12 13:58:35 +05:30
Evgeny Mankov
460b501cbb
Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
...
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.
hipify is updated as well.
2016-02-12 01:29:20 +03:00
Evgeny Mankov
1025341300
Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
...
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.
NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.
2016-02-12 00:04:14 +03:00
Evgeny Mankov
658e9f0484
BDFID (BusID/DeviceID/FunctionID) support.
...
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.
2016-02-11 22:26:01 +03:00
sunway513
fe1000df17
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-11 22:22:47 +05:30
sunway513
c7cbcfa2e9
Add reminder to keep ROCR runtime on the system library path
2016-02-11 22:22:00 +05:30
Maneesh Gupta
ed2d86f3a9
Updated readme for test
2016-02-11 13:06:58 +05:30
Evgeny Mankov
3139c72756
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-10 17:21:53 +03:00
Evgeny Mankov
d9a94191f2
Formatting, no functional changes
2016-02-10 17:21:18 +03:00
streamhsa
90add185fd
Remove test for atomicInc and atomicDec
2016-02-10 21:02:52 +08:00
streamhsa
56f1832e70
Updated readme for test
2016-02-10 20:05:59 +08:00
streamhsa
2f8d56e903
Resolved test issues
2016-02-10 20:01:16 +08:00
gargrahul
51f46d9ddf
Removed atomicInc and atomicDec support from HIP
2016-02-10 04:29:55 +05:30
Evgeny Mankov
4d4ca3ef3f
Device property concurrentKernels is added to hipDeviceProp_t struct.
...
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.
2016-02-09 17:10:35 +03:00
Maneesh Gupta
f8bfc7f54c
which_hip -> hipconfig
2016-02-09 11:51:26 +05:30
Maneesh Gupta
0df78ac9bf
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-09 10:57:46 +05:30
Maneesh Gupta
f6e7abd710
Move HIP_DEVICE_COMPILE defines to hip_common.h
2016-02-09 10:57:20 +05:30
streamhsa
310023e273
Rename test hipInfo as hipGetDeviceAttribute
2016-02-09 13:19:32 +08:00
Ben Sander
ce2fc0f7fe
Test fixes:
...
- Remove reference to missing test.
- Add hipMemset back.
- Parse --gpu option to specify default starting GPU.
2016-02-08 22:55:23 -06:00
Ben Sander
2ecb345a67
minor doc touchup
2016-02-08 22:11:11 -06:00
Ben Sander
c482a3f456
in HIPCHECK, only run command once even if error occurs
2016-02-08 21:45:49 -06:00
Ben Sander
a06e0d9050
Doc update
2016-02-08 21:44:55 -06:00
Ben Sander
39c5f0f610
Add hcc-config info to --full
2016-02-08 21:44:55 -06:00
Ben Sander
fdeb477822
iScript cleanup, add --full
2016-02-08 21:44:55 -06:00
Ben Sander
26854bb31c
Fix HIP_PLATFORM detection
2016-02-05 07:15:46 -06:00
Ben Sander
9aec91a3b7
Fix getdeviceattr compilation for NVCC
2016-02-04 16:26:33 -06:00
Sam Kolton
afe45964ae
Implementation of hipDeviceGetAttribute()
2016-02-04 17:39:27 +03:00
Ben Sander
2faf1dfe6e
Merge branch 'master' into privatestaging
2016-02-03 09:39:19 -06:00
Peng Sun
d4835c7416
Fix all TODO-doc
2016-02-02 21:29:09 -06:00
Peng Sun
b20e02ae58
Finish all TODO for error code
2016-02-02 17:39:46 -06:00
scchan
63a6bce3d9
add inline attribute to shfl functions
2016-02-02 12:53:17 -06:00
Ben Sander
714bfcbff6
Merge branch 'master' of https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP
2016-02-02 10:05:44 -06:00
Ben Sander
182296ce59
Remove warning on ballot/any/all and pop/clz.
...
Since these are supported in HIP no reason to emit warnings.
2016-02-02 10:02:48 -06:00
streamhsa
b14c890851
Adjusted the value of __any as per CUDA -sandeep
2016-02-02 15:25:42 +05:30
streamhsa
a7c0be6e4b
ADDED Support for __ffs() and __ffsll() having signed input -sandeep
2016-02-02 15:05:46 +05:30