Rahul Garg
b030eb2c8a
Use 2X for bidir p2p memory bandwidth calc
...
[ROCm/hip-tests commit: f19c685f88 ]
2017-10-23 21:57:20 +05:30
Ben Sander
f9cf19daf4
Use 2X for bidir memory bandwidth calc
...
[ROCm/hip-tests commit: 9fef6f860c ]
2017-10-21 07:47:32 -05:00
Sandeep Kumar
4ddfe042c6
Add more info for inline asm in hip kernel guide and cookbook readme
...
[ROCm/hip-tests commit: 451f36a42a ]
2017-09-13 12:57:37 +05:30
Maneesh Gupta
0055532b17
Merge pull request #159 from bensander/hipDispatchLatency
...
Refactor dispatch latency test and fix several bugs.
[ROCm/hip-tests commit: 5efad49773 ]
2017-08-22 14:49:14 +05:30
Ben Sander
a1d0dda301
Refactor dispatch latency test and fix several bugs.
...
[ROCm/hip-tests commit: 6ac55d2b34 ]
2017-08-17 08:46:58 -05:00
Ben Sander
7227025a61
Merge pull request #108 from adityaatluri/enum-fix
...
fixed device selection during compilation to use rocm_agent_enumerator
[ROCm/hip-tests commit: 2dbafb89ce ]
2017-07-21 16:42:48 -05:00
Aditya Atluri
4255c3cb44
fixed device selection during compilation to use rocm_agent_enumerator
...
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
[ROCm/hip-tests commit: 8e3e104313 ]
2017-07-21 15:50:12 -05:00
Maneesh Gupta
6f65cca633
Merge branch 'amd-develop'
...
[ROCm/hip-tests commit: b897d5599f ]
2017-07-06 12:16:47 +05:30
Maneesh Gupta
6442d0652f
GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
...
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
[ROCm/hip-tests commit: 8252ae785b ]
2017-07-05 11:44:44 +05:30
Maneesh Gupta
1ded0b85f5
Merge branch 'roc-1.6.x' into master
...
Change-Id: I367a3940a0a9e5658abc28a7dc2bfb9cf4167dc8
[ROCm/hip-tests commit: 465bc42a12 ]
2017-06-30 09:59:30 +05:30
Aditya Atluri
7a20b9afe1
automate gcnarch detection
...
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
[ROCm/hip-tests commit: 98905a7272 ]
2017-06-29 12:01:40 -05:00
Aditya Atluri
faa7417a8c
removed rm for /opt/rocm/hip/src in inline asm sample
...
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
[ROCm/hip-tests commit: a491a49f98 ]
2017-06-20 11:35:52 -05:00
Sandeep Kumar
ef65747f38
Add peer2peer bandwidth and latency test
...
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
[ROCm/hip-tests commit: 5c530e7c32 ]
2017-06-14 09:44:56 +05:30
Sandeep Kumar
a51e393772
Add readme for inline asm and unroll cookbook samples
...
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
[ROCm/hip-tests commit: 7c6b0384bb ]
2017-05-31 09:25:50 +05:30
Sandeep Kumar
78789054f0
Add unroll and inline asm cookbook samples
...
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
[ROCm/hip-tests commit: 83472bfa78 ]
2017-05-31 09:25:35 +05:30
Sandeep Kumar
a485c75155
Print msg for single gpu
...
Change-Id: I2d23c73542add8973990ba96592016726994422e
[ROCm/hip-tests commit: 3bc6df2044 ]
2017-05-31 09:25:17 +05:30
Maneesh Gupta
b1d90234a8
Merge branch 'rocm-rel-1.5'
...
Change-Id: Ib2318f9c0d01a1bc8be2fcb172a3075e82851877
[ROCm/hip-tests commit: fa64db5171 ]
2017-05-02 09:06:49 +05:30
Maneesh Gupta
182d5261b3
Merge branch 'amd-develop' into amd-master
...
Change-Id: I312fb9d1181733ef5160d1e993e2ae57ced0f6b3
(cherry picked from commit 88fb807af0 )
[ROCm/hip-tests commit: b8fd2f159a ]
2017-04-25 00:01:30 -04:00
Ben Sander
681ba1245d
Fix compilation error with nvcc (c++ nullptr)
...
[ROCm/hip-tests commit: 2335bcdd03 ]
2017-04-21 09:01:34 -05:00
Maneesh Gupta
b6dc347329
Merge branch 'amd-develop' into amd-master
...
Change-Id: I53d5a8916d769c4f0fe60d2ee3b240551da80b4f
(cherry picked from commit 01c523f6c9 )
[ROCm/hip-tests commit: 9fcc03e2b6 ]
2017-04-07 11:10:59 -05:00
Maneesh Gupta
0b1fade1d2
Fix build issues with bit_extract sample
...
Change-Id: I628b3c83a16f7adf0ab8ca60aecde8c073c34fd9
[ROCm/hip-tests commit: ad280696c6 ]
2017-04-07 15:24:10 +05:30
Maneesh Gupta
00822d98ff
Fix build issues in hipCommander sample
...
- Remove -stdlib=libstdc++ from Makefile
- Removed deleted HIP header file fom includes
Change-Id: Ia189396bee19fc52b679259df56c6c6e2bafb6fe
[ROCm/hip-tests commit: 59db2f453f ]
2017-04-07 14:54:03 +05:30
Aditya Atluri
b41e69b99c
added module api sample which uses hipHccModuleLaunchKernel
...
Change-Id: I7bce60b4480a3b5ff7ed69c3256078ded65a0945
[ROCm/hip-tests commit: 8e2b7147a5 ]
2017-03-31 14:30:40 -05:00
Aditya Atluri
8f495da889
added debug support for HIP sample
...
Change-Id: Ia7265234082039b68114f7421f4dbcb7149d4d2b
[ROCm/hip-tests commit: 93a0b55616 ]
2017-03-31 14:13:46 -05:00
Aditya Atluri
ae52620393
Fixed bit_extract
...
Change-Id: I92d7b7a302e3fa0db84889fb5dc6b612e6a53c73
[ROCm/hip-tests commit: 8bc80debe4 ]
2017-03-31 13:35:26 -05:00
Aditya Atluri
60636db8ee
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
[ROCm/hip-tests commit: 7735b454a1 ]
2017-03-31 12:11:34 -05:00
Sun, Peng
d59dbb3fd6
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
[ROCm/hip-tests commit: c4c4d95db6 ]
2017-03-21 10:26:09 -05:00
pensun
d33885889d
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
[ROCm/hip-tests commit: 323807d02b ]
2017-03-17 14:59:34 -05:00
Aditya Atluri
1ec7582820
Added default module launch api functionality
...
1. As in hipModuleLaunchKernel(..., kernelParams, nullptr); works with this commit
2. Added headers AMDGPUPTNote.h, AMDGPURuntimeMetadata.h to do code object meta data parsing
3. Changed CMake to look at llvm link libraries
4. HIP developer should set env variable LLVM_HOME to remove link errors
5. HIP depends on installed LLVM (not source, not build)
6. Added sample to test out the feature
7. Right now HCC does not support embedding metadata in code object. Use clang opencl
8. Changed HIPCC to read LLVM_HOME env var
9. New argument to CMake should be given -DLLVM_HOME=<where llvm 5.0 is installed>
Change-Id: Iba38194aa872d97cc2c90a8e5ff746c48055c868
[ROCm/hip-tests commit: 219343027f ]
2017-03-17 13:11:34 -05:00
Maneesh Gupta
c8321725f3
Merge branch 'amd-develop' into amd-master
...
Change-Id: I8921e67e352e35e4c496e78a797fb309279ab7d0
[ROCm/hip-tests commit: b9d4c0be78 ]
2017-03-14 15:57:53 +05:30
Maneesh Gupta
ea06138990
4_shfl and 5_2dshfl samples are unsupported on gfx701
...
Change-Id: I81eb880350f25e89573ba14c62b549c6c43f8c91
[ROCm/hip-tests commit: 8d6cb1f5a3 ]
2017-03-14 15:56:18 +05:30
Maneesh Gupta
fba01ebd3b
Merge branch 'amd-master' into amd-develop
...
[ROCm/hip-tests commit: bc07b0da7c ]
2017-03-14 13:44:41 +05:30
Ben Sander
11730ab32b
Refactor registered memory calls.
...
[ROCm/hip-tests commit: 705ab93664 ]
2017-03-11 09:18:27 -06:00
Ben Sander
7dd5113bdf
Add first step to a "registerd" mode in hipBusBandwidth.
...
[ROCm/hip-tests commit: b25691cb87 ]
2017-03-11 09:18:27 -06:00
Rahul Garg
80820f3615
Fix for HCSWAP-128, make 5_2dshfl cookbook sample only for fiji
...
Change-Id: I8869c28151bca1bd47a053a2808e93a801d16d00
[ROCm/hip-tests commit: 5eb39f1c6b ]
2017-03-10 10:29:52 +05:30
Aditya Atluri
e6d7dc9067
make 4_shfl cookbook sample only for fiji
...
1. __shfl is not supported on hawaii gfx701
Change-Id: Iac09f5d30ee0674b8f58a6e74ec5c49b02be32ad
[ROCm/hip-tests commit: a8fb90d9a9 ]
2017-03-09 08:52:50 -06:00
Aditya Atluri
e2b0e9c8ea
added new field to hipDeviceProp_t structure gcnArch.
...
1. It is an integer containing gfx values 701, 801, 802, 803
2. On NV path, it is zero
Change-Id: I2b4c7f48981d0214d8c6b1905d2cc85b16203419
[ROCm/hip-tests commit: af22699ec6 ]
2017-03-07 11:24:32 -06:00
Maneesh Gupta
874a182a6c
Merge branch 'amd-develop' into amd-master
...
Change-Id: I0e856db61fa4a50e190bd1d4c464ceb4a709b550
[ROCm/hip-tests commit: 518967319c ]
2017-02-23 11:19:23 +05:30
Aditya Atluri
8e522c123c
removed hipblas samples as it is not yet supported
...
Change-Id: I354b710e652ce0d0413d670530ceb8b70f4993d5
[ROCm/hip-tests commit: b5e3f59377 ]
2017-02-17 08:51:02 -06:00
Rahul Garg
78c27a5c51
Command scripts for latency measurements
...
Change-Id: I8c28765a09fb0358447367939de524b12699a317
[ROCm/hip-tests commit: dc2f5fcc7d ]
2017-02-07 15:03:46 +05:30
Ben Sander
09ede45f88
Doc update - describe debug techniques
...
Also tweak sample to remove unneeded HIP_KERNEL_NAME.
Comment update
[ROCm/hip-tests commit: 0390b12175 ]
2017-01-19 12:40:45 -06:00
Rahul Garg
b9ba4f6ca1
Fixed hipcommander default execution for HCSWAP-106
...
Change-Id: I9fbd10dfaeeb4928b2ec23ceed131b5200a658f9
[ROCm/hip-tests commit: ba8fe1675f ]
2017-01-19 15:04:32 +05:30
Maneesh Gupta
ee0831f8b2
Merge branch 'amd-develop' into amd-master
...
Change-Id: I77fa88b460be549bfcf9e18d3212e732ffc045f5
[ROCm/hip-tests commit: 008499801b ]
2016-12-19 16:20:38 +05:30
Ben Sander
257d60a385
Print limits on CUDA devices
...
[ROCm/hip-tests commit: cee24a20f2 ]
2016-12-16 08:55:11 -06:00
Ben Sander
7ace03b518
remove TODO file
...
[ROCm/hip-tests commit: 054fc61f6e ]
2016-12-15 14:42:52 -06:00
Maneesh Gupta
889363ee97
Merge branch 'amd-develop' into amd-master
...
Change-Id: I52830df409da1f021c32ea569d4ae671aeb57b03
[ROCm/hip-tests commit: cc83c44714 ]
2016-12-15 16:25:33 +05:30
Sandeep Kumar
9ab3b01ba6
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
[ROCm/hip-tests commit: 5ae4e8bd67 ]
2016-12-14 05:58:34 -05:00
Sandeep Kumar
7ba0720f71
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
[ROCm/hip-tests commit: 70716bee42 ]
2016-12-14 16:27:14 +05:30
Maneesh Gupta
0a01c0325f
Merge branch 'amd-develop' into amd-master
...
Change-Id: Ie5aa8e70607936f63cf4c763298140e83a375a68
[ROCm/hip-tests commit: effe8e1437 ]
2016-11-28 09:18:26 +05:30
Ben Sander
83ab39c044
Add more debug info
...
[ROCm/hip-tests commit: c590cd6865 ]
2016-11-26 08:56:02 -06:00