Ben Sander
2dbafb89ce
Merge pull request #108 from adityaatluri/enum-fix
...
fixed device selection during compilation to use rocm_agent_enumerator
2017-07-21 16:42:48 -05:00
Aditya Atluri
8e3e104313
fixed device selection during compilation to use rocm_agent_enumerator
...
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta
b897d5599f
Merge branch 'amd-develop'
2017-07-06 12:16:47 +05:30
Maneesh Gupta
8252ae785b
GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
...
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Maneesh Gupta
465bc42a12
Merge branch 'roc-1.6.x' into master
...
Change-Id: I367a3940a0a9e5658abc28a7dc2bfb9cf4167dc8
2017-06-30 09:59:30 +05:30
Aditya Atluri
98905a7272
automate gcnarch detection
...
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri
a491a49f98
removed rm for /opt/rocm/hip/src in inline asm sample
...
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar
5c530e7c32
Add peer2peer bandwidth and latency test
...
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar
7c6b0384bb
Add readme for inline asm and unroll cookbook samples
...
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar
83472bfa78
Add unroll and inline asm cookbook samples
...
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar
3bc6df2044
Print msg for single gpu
...
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30
Maneesh Gupta
fa64db5171
Merge branch 'rocm-rel-1.5'
...
Change-Id: Ib2318f9c0d01a1bc8be2fcb172a3075e82851877
2017-05-02 09:06:49 +05:30
Maneesh Gupta
b8fd2f159a
Merge branch 'amd-develop' into amd-master
...
Change-Id: I312fb9d1181733ef5160d1e993e2ae57ced0f6b3
(cherry picked from commit 88fb807af0 )
2017-04-25 00:01:30 -04:00
Ben Sander
2335bcdd03
Fix compilation error with nvcc (c++ nullptr)
2017-04-21 09:01:34 -05:00
Maneesh Gupta
9fcc03e2b6
Merge branch 'amd-develop' into amd-master
...
Change-Id: I53d5a8916d769c4f0fe60d2ee3b240551da80b4f
(cherry picked from commit 01c523f6c9 )
2017-04-07 11:10:59 -05:00
Maneesh Gupta
ad280696c6
Fix build issues with bit_extract sample
...
Change-Id: I628b3c83a16f7adf0ab8ca60aecde8c073c34fd9
2017-04-07 15:24:10 +05:30
Maneesh Gupta
59db2f453f
Fix build issues in hipCommander sample
...
- Remove -stdlib=libstdc++ from Makefile
- Removed deleted HIP header file fom includes
Change-Id: Ia189396bee19fc52b679259df56c6c6e2bafb6fe
2017-04-07 14:54:03 +05:30
Aditya Atluri
8e2b7147a5
added module api sample which uses hipHccModuleLaunchKernel
...
Change-Id: I7bce60b4480a3b5ff7ed69c3256078ded65a0945
2017-03-31 14:30:40 -05:00
Aditya Atluri
93a0b55616
added debug support for HIP sample
...
Change-Id: Ia7265234082039b68114f7421f4dbcb7149d4d2b
2017-03-31 14:13:46 -05:00
Aditya Atluri
8bc80debe4
Fixed bit_extract
...
Change-Id: I92d7b7a302e3fa0db84889fb5dc6b612e6a53c73
2017-03-31 13:35:26 -05:00
Aditya Atluri
7735b454a1
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng
c4c4d95db6
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
pensun
323807d02b
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
2017-03-17 14:59:34 -05:00
Aditya Atluri
219343027f
Added default module launch api functionality
...
1. As in hipModuleLaunchKernel(..., kernelParams, nullptr); works with this commit
2. Added headers AMDGPUPTNote.h, AMDGPURuntimeMetadata.h to do code object meta data parsing
3. Changed CMake to look at llvm link libraries
4. HIP developer should set env variable LLVM_HOME to remove link errors
5. HIP depends on installed LLVM (not source, not build)
6. Added sample to test out the feature
7. Right now HCC does not support embedding metadata in code object. Use clang opencl
8. Changed HIPCC to read LLVM_HOME env var
9. New argument to CMake should be given -DLLVM_HOME=<where llvm 5.0 is installed>
Change-Id: Iba38194aa872d97cc2c90a8e5ff746c48055c868
2017-03-17 13:11:34 -05:00
Maneesh Gupta
b9d4c0be78
Merge branch 'amd-develop' into amd-master
...
Change-Id: I8921e67e352e35e4c496e78a797fb309279ab7d0
2017-03-14 15:57:53 +05:30
Maneesh Gupta
8d6cb1f5a3
4_shfl and 5_2dshfl samples are unsupported on gfx701
...
Change-Id: I81eb880350f25e89573ba14c62b549c6c43f8c91
2017-03-14 15:56:18 +05:30
Maneesh Gupta
bc07b0da7c
Merge branch 'amd-master' into amd-develop
2017-03-14 13:44:41 +05:30
Ben Sander
705ab93664
Refactor registered memory calls.
2017-03-11 09:18:27 -06:00
Ben Sander
b25691cb87
Add first step to a "registerd" mode in hipBusBandwidth.
2017-03-11 09:18:27 -06:00
Rahul Garg
5eb39f1c6b
Fix for HCSWAP-128, make 5_2dshfl cookbook sample only for fiji
...
Change-Id: I8869c28151bca1bd47a053a2808e93a801d16d00
2017-03-10 10:29:52 +05:30
Aditya Atluri
a8fb90d9a9
make 4_shfl cookbook sample only for fiji
...
1. __shfl is not supported on hawaii gfx701
Change-Id: Iac09f5d30ee0674b8f58a6e74ec5c49b02be32ad
2017-03-09 08:52:50 -06:00
Aditya Atluri
af22699ec6
added new field to hipDeviceProp_t structure gcnArch.
...
1. It is an integer containing gfx values 701, 801, 802, 803
2. On NV path, it is zero
Change-Id: I2b4c7f48981d0214d8c6b1905d2cc85b16203419
2017-03-07 11:24:32 -06:00
Maneesh Gupta
518967319c
Merge branch 'amd-develop' into amd-master
...
Change-Id: I0e856db61fa4a50e190bd1d4c464ceb4a709b550
2017-02-23 11:19:23 +05:30
Aditya Atluri
b5e3f59377
removed hipblas samples as it is not yet supported
...
Change-Id: I354b710e652ce0d0413d670530ceb8b70f4993d5
2017-02-17 08:51:02 -06:00
Rahul Garg
dc2f5fcc7d
Command scripts for latency measurements
...
Change-Id: I8c28765a09fb0358447367939de524b12699a317
2017-02-07 15:03:46 +05:30
Ben Sander
0390b12175
Doc update - describe debug techniques
...
Also tweak sample to remove unneeded HIP_KERNEL_NAME.
Comment update
2017-01-19 12:40:45 -06:00
Rahul Garg
ba8fe1675f
Fixed hipcommander default execution for HCSWAP-106
...
Change-Id: I9fbd10dfaeeb4928b2ec23ceed131b5200a658f9
2017-01-19 15:04:32 +05:30
Maneesh Gupta
008499801b
Merge branch 'amd-develop' into amd-master
...
Change-Id: I77fa88b460be549bfcf9e18d3212e732ffc045f5
2016-12-19 16:20:38 +05:30
Ben Sander
cee24a20f2
Print limits on CUDA devices
2016-12-16 08:55:11 -06:00
Ben Sander
054fc61f6e
remove TODO file
2016-12-15 14:42:52 -06:00
Maneesh Gupta
cc83c44714
Merge branch 'amd-develop' into amd-master
...
Change-Id: I52830df409da1f021c32ea569d4ae671aeb57b03
2016-12-15 16:25:33 +05:30
Sandeep Kumar
5ae4e8bd67
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
2016-12-14 05:58:34 -05:00
Sandeep Kumar
70716bee42
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
2016-12-14 16:27:14 +05:30
Maneesh Gupta
effe8e1437
Merge branch 'amd-develop' into amd-master
...
Change-Id: Ie5aa8e70607936f63cf4c763298140e83a375a68
2016-11-28 09:18:26 +05:30
Ben Sander
c590cd6865
Add more debug info
2016-11-26 08:56:02 -06:00
Sandeep Kumar
5e86e5f565
fix_format
...
Change-Id: I34e265de434263a11654e5deba044c3f21e86578
2016-11-18 14:34:14 +05:30
Maneesh Gupta
b1144c429d
Merge branch 'amd-develop' into amd-master
...
Change-Id: I32d41081ac065f2c50531dc2e420802d765665e2
2016-11-14 06:12:03 +05:30
Sandeep Kumar
8829e2626c
Add p2p for cookbook
...
Change-Id: Id2e77ab31123ef95885d665efe34bc0d4596733a
(cherry picked from commit 6fbd0352713ca36e399b1ed4f17c486207a53875)
2016-11-14 06:10:36 +05:30
Maneesh Gupta
aaf1547bff
hcc_dialects/Makefile: use clamp-config
...
Change-Id: I86df82f75b75125825e22d0545209a19386d9936
2016-11-10 11:31:50 +05:30
Ben Sander
997dbf0d6a
Update gitignore for some common output files
...
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd
2016-11-06 04:26:15 -06:00