Aditya Atluri
d041291293
Fixed offline kernel compilation
...
1. Removed vcpy_isa.ptx as it should be generated during make
2. Made argument padding specific to hcc path
3. Renamed --gencodeobject to --genco
4. Changed Makefile to work on both nvcc and hcc path
Change-Id: Ifd053d541085d9ce4fd37bc21b07674786c7163e
[ROCm/hip commit: cc8ce4d58c ]
2016-09-01 10:39:14 -05:00
Maneesh Gupta
c99371e4f0
Fixed module_api/Makefile to set flags based on HIP_PLATFORM
...
Change-Id: I2fa9a556e0c4f25f4963ecef1d25eb922f9af1b9
[ROCm/hip commit: 811df250e6 ]
2016-09-01 15:11:12 +05:30
Maneesh Gupta
34ed6ed4cc
module_api/Makefile: Use gencodeobject instead of genisa
...
Change-Id: I7e3523810f5603ad727b1fda7ff2d0dc53ec72d7
[ROCm/hip commit: 13d1721431 ]
2016-09-01 12:10:31 +05:30
Aditya Atluri
87d3977451
added sample for how-to-use pre-compiled kernels1. Corrected the exit output of kernel compilation by hipcc
...
2. Added sample which loads/run kernel binary during runtime?
Change-Id: I26ccaca1f844fee317592e26c9e654ce548b96a8
[ROCm/hip commit: d5a6e22c59 ]
2016-08-31 13:56:07 -05:00
Maneesh Gupta
f4ff7eb4f6
Rename 2_Advanced to 7_Advanced
...
Change-Id: I51e5fa7f4c1dbf467f2d7182ec69d12d5fe548d0
[ROCm/hip commit: 4803ff12f1 ]
2016-08-18 12:40:30 +05:30
Maneesh Gupta
0ec10f571c
Add simple hipblas saxpy sample
...
Change-Id: I67ae83e1e5397d5191a3c644aba068f06ff97830
[ROCm/hip commit: d192976b00 ]
2016-08-12 13:50:22 +05:30
Maneesh Gupta
a4a8d105db
hipDispatchLatency: reduce iterations to 5120
...
Change-Id: I94ae4993ff5058cf15f9487a5a528fc24c1ad5fa
[ROCm/hip commit: 506d4086a9 ]
2016-06-13 14:23:51 +05:30
Maneesh Gupta
5592d53ff5
Fix bit_extract sample
...
Change-Id: I933f932bac26d9a9469d5d069973af166e11cbcd
[ROCm/hip commit: bd31d333e6 ]
2016-05-20 01:06:08 -04:00
Maneesh Gupta
d1e6c121f3
Fix square.cu to use cudaError_t instead of hipError_t
...
Change-Id: If3314910d1c03122741d3e0a45e14a4412c473b3
[ROCm/hip commit: 3f83673b04 ]
2016-05-12 10:13:07 +05:30
Maneesh Gupta
fc73044bf2
hcc_dialects report PASSED when passed
...
[ROCm/hip commit: 6181988232 ]
2016-05-03 14:32:59 +05:30
Maneesh Gupta
d253806e64
bit_extract reports PASSED when passed
...
[ROCm/hip commit: cb6a5d9421 ]
2016-05-03 14:19:25 +05:30
Maneesh Gupta
b8cd64dec9
Fix makefiles in samples
...
[ROCm/hip commit: bcaefb81fc ]
2016-04-18 10:15:35 +05:30
Maneesh Gupta
32e35b671d
Replace /opt/hcc -> /opt/rocm/hcc and /opt/hsa -> /opt/rocm/hsa
...
[ROCm/hip commit: 5a31bad821 ]
2016-04-15 12:56:31 +05:30
Ben Sander
05a43de72a
Fix HIP_PATH, CHECK macro in samples.
...
[ROCm/hip commit: 8bbe32a708 ]
2016-04-13 17:37:39 -05:00
Ben Sander
0618c39648
add hcc dialects sample
...
[ROCm/hip commit: 624b2f35ff ]
2016-04-13 17:32:38 -05:00
Ben Sander
775e484517
fix peer query order
...
[ROCm/hip commit: e4d1863ce8 ]
2016-04-11 07:58:59 -05:00
Ben Sander
c964e3c75a
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
[ROCm/hip commit: 83f0de7806 ]
2016-04-11 07:58:59 -05:00
Ben Sander
80e2e37c51
Remove stray debug msgs, hipInfo don't print self as peer.
...
[ROCm/hip commit: d89539d40f ]
2016-04-11 07:58:58 -05:00
Ben Sander
c09e4adc03
Use HIP_PATH if set else use relative ../...
...
[ROCm/hip commit: 40e72dcd4a ]
2016-04-11 07:58:58 -05:00
Ben Sander
85f22fe7db
Print peers in hipConfig.
...
Also include peer APIs in vim hilighting.
[ROCm/hip commit: 0ac41ad143 ]
2016-04-11 07:58:58 -05:00
Maneesh Gupta
5f1f0b34d0
Remove deprecated KERNELBEGIN and KERNELEND from bit_extract sample
...
[ROCm/hip commit: 70f8236ac5 ]
2016-04-04 14:47:02 +05:30
streamhsa
1d55744e91
change makefile for samples
...
[ROCm/hip commit: d0f0bf5c8e ]
2016-03-29 16:02:09 +08:00
Aditya Atluri
ad91545586
Logging dispatch latency through database util
...
[ROCm/hip commit: 78407ea40a ]
2016-03-23 11:39:57 -05:00
Ben Sander
2313b4b7b5
Only include activity logger if CodeXL installed.
...
Fix hipHostMalloc in hipBusBandwidth.
[ROCm/hip commit: 3a5f964c4f ]
2016-03-22 09:27:10 -05:00
Ben Sander
b319209299
hipHostRegister and hipHostMalloc refactor.
...
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
- the hipHost* is used for all HIP APIs dealing with Host memory.
(including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
- hipMallocHost is consistent with "hipMalloc" for allocating device
memory. Enumerations hipHostMalloc* also used as optional
flags parm to hipHostMalloc.
[ROCm/hip commit: ab910efb96 ]
2016-03-22 02:30:10 -05:00
Ben Sander
5297376252
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: 1de63bfeea ]
2016-03-19 03:22:09 -05:00
Ben Sander
b6de799f47
Add beastperiteration and onesize for testing.
...
onesize allows running tests at one specific size.
[ROCm/hip commit: 7ff5b16d2a ]
2016-03-19 02:43:04 -05:00
Ben Sander
79d0d442d3
Improve formatting - line up cols
...
[ROCm/hip commit: 85fce5f21e ]
2016-03-18 23:43:04 -05:00
Ben Sander
5996a5c0a1
Print Pinned or Unpinned in result summary
...
[ROCm/hip commit: c2102847a4 ]
2016-03-18 21:28:29 -05:00
Ben Sander
8a47efb22a
Supported --aliged mode. Add results check for H2D and D2H.
...
[ROCm/hip commit: 618556eaf9 ]
2016-03-18 03:09:52 -05:00
Aditya Atluri
637fca1cb6
corrected first and second kernel dispatch
...
[ROCm/hip commit: e23bd0a23e ]
2016-03-15 14:22:00 -05:00
Aditya Atluri
2223b41232
Added single kernel launch to sample
...
[ROCm/hip commit: 862817626b ]
2016-03-15 21:05:15 -05:00
Aditya Atluri
b202017891
added performance metrics for kernel dispatch
...
[ROCm/hip commit: 31d8f60e56 ]
2016-03-15 12:37:24 -05:00
Aditya Atluri
83cafe7d94
v2 deprecating hipMallocHost with hipHostAlloc
...
[ROCm/hip commit: 58fa0524b6 ]
2016-03-15 13:39:15 -05:00
Ben Sander
bef8dc7bcc
print device config info
...
[ROCm/hip commit: 70c5f5e3f5 ]
2016-03-14 23:02:49 -05:00
Ben Sander
0abf5db89e
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
tests/src/CMakeLists.txt
[ROCm/hip commit: e1617b9604 ]
2016-03-14 15:01:26 -05:00
Ben Sander
10ca8ed91d
Add Bidir copy test and help.
...
[ROCm/hip commit: 5606bee076 ]
2016-03-14 14:39:23 -05:00
Ben Sander
3427873d7c
refactor, add support for speccing xfers in bytes
...
[ROCm/hip commit: ac6ed35ba0 ]
2016-03-13 09:41:06 -05:00
Aditya Atluri
f1b8758919
corrected hipDeviceGetProperties to hipGetDeviceProperties - not docs
...
[ROCm/hip commit: d3ba2b9782 ]
2016-03-06 08:31:04 -06:00
Ben Sander
1ac07d2b87
fixes for titan platform
...
[ROCm/hip commit: ff66ef0779 ]
2016-02-26 05:25:30 -06:00
Ben Sander
8a2bcf2da3
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: c300ffe458 ]
2016-02-26 06:15:09 -06:00
Ben Sander
5ca4914e0e
Merge branch 'memtracker' into privatestaging
...
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: 4adab7b7ef ]
2016-02-25 19:38:46 -06:00
Evgeny Mankov
82900a1888
Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
...
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.
P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).
[ROCm/hip commit: 57e212606d ]
2016-02-25 23:44:39 +03:00
Evgeny Mankov
c3a600c63b
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
[ROCm/hip commit: 833c9e52ad ]
2016-02-19 13:27:03 +03:00
Evgeny Mankov
4fcd9f2542
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
[ROCm/hip commit: 1c19dbb807 ]
2016-02-18 18:15:01 +03:00
Evgeny Mankov
a0cc7134e3
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
[ROCm/hip commit: 5ea8543d2e ]
2016-02-18 17:25:28 +03:00
Evgeny Mankov
801ae992d6
hipInfo sample update with new Device Properties.
...
[ROCm/hip commit: 5b05a9fef1 ]
2016-02-18 15:08:55 +03:00
Evgeny Mankov
189e652a5e
Formatting, no functional changes.
...
[ROCm/hip commit: 072d649d8d ]
2016-02-15 13:16:05 +03:00
Ben Sander
565d0fb968
Enable -O3, style points on array size
...
[ROCm/hip commit: 928996fec7 ]
2016-02-13 03:17:42 -06:00
Ben Sander
7a633dc4b8
Result formatting
...
[ROCm/hip commit: c3720c19a8 ]
2016-02-13 01:14:01 -06:00