Evgeny Mankov
6add51ef8c
Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor
...
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path).
Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported.
hipify is updated as well.
[ROCm/clr commit: 460b501cbb ]
2016-02-12 01:29:20 +03:00
Evgeny Mankov
735d4738ad
Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path).
...
Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size.
NVCC path remains untouched: CUDA's device property maxThreadsPerMultiProcessor is reported.
[ROCm/clr commit: 1025341300 ]
2016-02-12 00:04:14 +03:00
Evgeny Mankov
a8b7647f8b
BDFID (BusID/DeviceID/FunctionID) support.
...
Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA.
[ROCm/clr commit: 658e9f0484 ]
2016-02-11 22:26:01 +03:00
sunway513
6bfdfc34a0
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: fe1000df17 ]
2016-02-11 22:22:47 +05:30
sunway513
38cc074f08
Add reminder to keep ROCR runtime on the system library path
...
[ROCm/clr commit: c7cbcfa2e9 ]
2016-02-11 22:22:00 +05:30
Maneesh Gupta
f826c7aaae
Updated readme for test
...
[ROCm/clr commit: ed2d86f3a9 ]
2016-02-11 13:06:58 +05:30
Evgeny Mankov
cedd1c0947
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: 3139c72756 ]
2016-02-10 17:21:53 +03:00
Evgeny Mankov
2478fc078f
Formatting, no functional changes
...
[ROCm/clr commit: d9a94191f2 ]
2016-02-10 17:21:18 +03:00
streamhsa
03c2768897
Remove test for atomicInc and atomicDec
...
[ROCm/clr commit: 90add185fd ]
2016-02-10 21:02:52 +08:00
streamhsa
688a9a19a5
Updated readme for test
...
[ROCm/clr commit: 56f1832e70 ]
2016-02-10 20:05:59 +08:00
streamhsa
5d857b2bc3
Resolved test issues
...
[ROCm/clr commit: 2f8d56e903 ]
2016-02-10 20:01:16 +08:00
gargrahul
91a5b0aa77
Removed atomicInc and atomicDec support from HIP
...
[ROCm/clr commit: 51f46d9ddf ]
2016-02-10 04:29:55 +05:30
Evgeny Mankov
9f596e0aab
Device property concurrentKernels is added to hipDeviceProp_t struct.
...
For HCC path concurrentKernels is set to true since all ROCR hardware supports this feature.
For NVCC path concurrentKernels is obtained from CUDA's device property cudaDeviceProp::concurrentKernels.
[ROCm/clr commit: 4d4ca3ef3f ]
2016-02-09 17:10:35 +03:00
Maneesh Gupta
4df8743f84
which_hip -> hipconfig
...
[ROCm/clr commit: f8bfc7f54c ]
2016-02-09 11:51:26 +05:30
Maneesh Gupta
978aac7fe0
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: 0df78ac9bf ]
2016-02-09 10:57:46 +05:30
Maneesh Gupta
8442259da0
Move HIP_DEVICE_COMPILE defines to hip_common.h
...
[ROCm/clr commit: f6e7abd710 ]
2016-02-09 10:57:20 +05:30
streamhsa
71d3c9f306
Rename test hipInfo as hipGetDeviceAttribute
...
[ROCm/clr commit: 310023e273 ]
2016-02-09 13:19:32 +08:00
Ben Sander
d258c7fdcd
Test fixes:
...
- Remove reference to missing test.
- Add hipMemset back.
- Parse --gpu option to specify default starting GPU.
[ROCm/clr commit: ce2fc0f7fe ]
2016-02-08 22:55:23 -06:00
Ben Sander
3b04ce4e81
minor doc touchup
...
[ROCm/clr commit: 2ecb345a67 ]
2016-02-08 22:11:11 -06:00
Ben Sander
6fe93da014
in HIPCHECK, only run command once even if error occurs
...
[ROCm/clr commit: c482a3f456 ]
2016-02-08 21:45:49 -06:00
Ben Sander
8b291201de
Doc update
...
[ROCm/clr commit: a06e0d9050 ]
2016-02-08 21:44:55 -06:00
Ben Sander
3ce8ed9f4c
Add hcc-config info to --full
...
[ROCm/clr commit: 39c5f0f610 ]
2016-02-08 21:44:55 -06:00
Ben Sander
c94c562f26
iScript cleanup, add --full
...
[ROCm/clr commit: fdeb477822 ]
2016-02-08 21:44:55 -06:00
Ben Sander
a1bbf9aa3d
Fix HIP_PLATFORM detection
...
[ROCm/clr commit: 26854bb31c ]
2016-02-05 07:15:46 -06:00
Ben Sander
0f1752e720
Fix getdeviceattr compilation for NVCC
...
[ROCm/clr commit: 9aec91a3b7 ]
2016-02-04 16:26:33 -06:00
Sam Kolton
136baccbe5
Implementation of hipDeviceGetAttribute()
...
[ROCm/clr commit: afe45964ae ]
2016-02-04 17:39:27 +03:00
Ben Sander
054598a1b3
Merge branch 'master' into privatestaging
...
[ROCm/clr commit: 2faf1dfe6e ]
2016-02-03 09:39:19 -06:00
Peng Sun
b9367aedec
Fix all TODO-doc
...
[ROCm/clr commit: d4835c7416 ]
2016-02-02 21:29:09 -06:00
Peng Sun
4afe96bf21
Finish all TODO for error code
...
[ROCm/clr commit: b20e02ae58 ]
2016-02-02 17:39:46 -06:00
scchan
a9745be3f4
add inline attribute to shfl functions
...
[ROCm/clr commit: 63a6bce3d9 ]
2016-02-02 12:53:17 -06:00
Ben Sander
0d62f7767a
Merge branch 'master' of https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP
...
[ROCm/clr commit: 714bfcbff6 ]
2016-02-02 10:05:44 -06:00
Ben Sander
71bfa20508
Remove warning on ballot/any/all and pop/clz.
...
Since these are supported in HIP no reason to emit warnings.
[ROCm/clr commit: 182296ce59 ]
2016-02-02 10:02:48 -06:00
streamhsa
c832718f42
Adjusted the value of __any as per CUDA -sandeep
...
[ROCm/clr commit: b14c890851 ]
2016-02-02 15:25:42 +05:30
streamhsa
fa391f72c9
ADDED Support for __ffs() and __ffsll() having signed input -sandeep
...
[ROCm/clr commit: a7c0be6e4b ]
2016-02-02 15:05:46 +05:30
streamhsa
75643c8c00
Added test for ballot and removing HIP_FUNCTION from hipSampleAtomicsTest.cpp -sandeep
...
[ROCm/clr commit: e5a491f3c8 ]
2016-02-02 14:50:55 +05:30
Jack Chung
627c9622c8
Merge branch 'privatestaging' of github.com:AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/clr commit: e62a9e1bb7 ]
2016-02-02 16:28:02 +08:00
Jack Chung
58e2e13f04
Suppress linker warnings in case HCC distribution contains OpenCL/SPIR symbols
...
[ROCm/clr commit: 8df395a006 ]
2016-02-02 16:27:42 +08:00
scchan
4ca3c297be
adding shfl, shfl_up, shfl_down, shfl_xor intrinsics
...
[ROCm/clr commit: ed0a4fc1b7 ]
2016-02-01 23:55:31 -06:00
Ben Sander
d1c1274f69
Merge pull request #10 from SethosII/patch-1
...
Update hip_faq.md based on Sethosll review. Closes #10
[ROCm/clr commit: 0c03868e90 ]
2016-02-01 22:01:17 -06:00
Maneesh Gupta
b5634c1e26
Add double and integer intrinsics to test
...
[ROCm/clr commit: 405ee35a04 ]
2016-02-01 16:00:45 +05:30
Jack Chung
3e57a5abae
Disable sincosf which has trouble on hcc now.
...
[ROCm/clr commit: 518ef58652 ]
2016-02-01 17:42:37 +08:00
Maneesh Gupta
8213a76631
Split math function tests into several smaller tests
...
[ROCm/clr commit: 97fb876c6a ]
2016-02-01 14:36:50 +05:30
Maneesh Gupta
ea04770e88
Disable testing of unsupported single precision intrinsics
...
[ROCm/clr commit: 8d01a1db15 ]
2016-02-01 14:34:28 +05:30
Maneesh Gupta
79b6e4bda6
Add few more single precision intrinsics to hcc_detail/hip_runtime.h
...
[ROCm/clr commit: 4b94638bd3 ]
2016-02-01 14:29:50 +05:30
Maneesh Gupta
3f648ea239
Restrict using namespace hc::precise_math to device only
...
[ROCm/clr commit: c5990d5651 ]
2016-02-01 14:26:50 +05:30
Maneesh Gupta
7152009198
Remove redundant #define __HCC__ in hcc_detail/hip_runtime.h
...
[ROCm/clr commit: 0f4fe765c4 ]
2016-02-01 14:24:41 +05:30
Paul Jähne
25db6b5cde
Update hip_faq.md
...
changed Cuda to CUDA for consistency
changed NVIDIA to Nvidia for consistency
corrected apostrophes
corrected section "What hardware does HIP support?"
[ROCm/clr commit: 2872d8dfec ]
2016-01-31 13:29:03 +01:00
sunway513
90f385fc55
Fix some typos and incorrect namings in comments
...
[ROCm/clr commit: 04aa623569 ]
2016-01-28 13:17:44 -06:00
sunway513
d89011badf
Fix @file and @brief tag on header files
...
[ROCm/clr commit: f531ab50e5 ]
2016-01-28 10:59:21 -06:00
Ben Sander
94863d2bee
Fix typo in hipStreamWaitEvent. Fixes#9
...
[ROCm/clr commit: fd1a2721c2 ]
2016-01-28 09:51:11 -06:00