Maneesh Gupta
506d4086a9
hipDispatchLatency: reduce iterations to 5120
...
Change-Id: I94ae4993ff5058cf15f9487a5a528fc24c1ad5fa
2016-06-13 14:23:51 +05:30
Maneesh Gupta
bd31d333e6
Fix bit_extract sample
...
Change-Id: I933f932bac26d9a9469d5d069973af166e11cbcd
2016-05-20 01:06:08 -04:00
Maneesh Gupta
3f83673b04
Fix square.cu to use cudaError_t instead of hipError_t
...
Change-Id: If3314910d1c03122741d3e0a45e14a4412c473b3
2016-05-12 10:13:07 +05:30
Maneesh Gupta
6181988232
hcc_dialects report PASSED when passed
2016-05-03 14:32:59 +05:30
Maneesh Gupta
cb6a5d9421
bit_extract reports PASSED when passed
2016-05-03 14:19:25 +05:30
Maneesh Gupta
bcaefb81fc
Fix makefiles in samples
2016-04-18 10:15:35 +05:30
Maneesh Gupta
5a31bad821
Replace /opt/hcc -> /opt/rocm/hcc and /opt/hsa -> /opt/rocm/hsa
2016-04-15 12:56:31 +05:30
Ben Sander
8bbe32a708
Fix HIP_PATH, CHECK macro in samples.
2016-04-13 17:37:39 -05:00
Ben Sander
624b2f35ff
add hcc dialects sample
2016-04-13 17:32:38 -05:00
Ben Sander
e4d1863ce8
fix peer query order
2016-04-11 07:58:59 -05:00
Ben Sander
83f0de7806
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 07:58:59 -05:00
Ben Sander
d89539d40f
Remove stray debug msgs, hipInfo don't print self as peer.
2016-04-11 07:58:58 -05:00
Ben Sander
40e72dcd4a
Use HIP_PATH if set else use relative ../...
2016-04-11 07:58:58 -05:00
Ben Sander
0ac41ad143
Print peers in hipConfig.
...
Also include peer APIs in vim hilighting.
2016-04-11 07:58:58 -05:00
Maneesh Gupta
70f8236ac5
Remove deprecated KERNELBEGIN and KERNELEND from bit_extract sample
2016-04-04 14:47:02 +05:30
streamhsa
d0f0bf5c8e
change makefile for samples
2016-03-29 16:02:09 +08:00
Aditya Atluri
78407ea40a
Logging dispatch latency through database util
2016-03-23 11:39:57 -05:00
Ben Sander
3a5f964c4f
Only include activity logger if CodeXL installed.
...
Fix hipHostMalloc in hipBusBandwidth.
2016-03-22 09:27:10 -05:00
Ben Sander
ab910efb96
hipHostRegister and hipHostMalloc refactor.
...
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
- the hipHost* is used for all HIP APIs dealing with Host memory.
(including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
- hipMallocHost is consistent with "hipMalloc" for allocating device
memory. Enumerations hipHostMalloc* also used as optional
flags parm to hipHostMalloc.
2016-03-22 02:30:10 -05:00
Ben Sander
1de63bfeea
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
2016-03-19 03:22:09 -05:00
Ben Sander
7ff5b16d2a
Add beastperiteration and onesize for testing.
...
onesize allows running tests at one specific size.
2016-03-19 02:43:04 -05:00
Ben Sander
85fce5f21e
Improve formatting - line up cols
2016-03-18 23:43:04 -05:00
Ben Sander
c2102847a4
Print Pinned or Unpinned in result summary
2016-03-18 21:28:29 -05:00
Ben Sander
618556eaf9
Supported --aliged mode. Add results check for H2D and D2H.
2016-03-18 03:09:52 -05:00
Aditya Atluri
e23bd0a23e
corrected first and second kernel dispatch
2016-03-15 14:22:00 -05:00
Aditya Atluri
862817626b
Added single kernel launch to sample
2016-03-15 21:05:15 -05:00
Aditya Atluri
31d8f60e56
added performance metrics for kernel dispatch
2016-03-15 12:37:24 -05:00
Aditya Atluri
58fa0524b6
v2 deprecating hipMallocHost with hipHostAlloc
2016-03-15 13:39:15 -05:00
Ben Sander
70c5f5e3f5
print device config info
2016-03-14 23:02:49 -05:00
Ben Sander
e1617b9604
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
tests/src/CMakeLists.txt
2016-03-14 15:01:26 -05:00
Ben Sander
5606bee076
Add Bidir copy test and help.
2016-03-14 14:39:23 -05:00
Ben Sander
ac6ed35ba0
refactor, add support for speccing xfers in bytes
2016-03-13 09:41:06 -05:00
Aditya Atluri
d3ba2b9782
corrected hipDeviceGetProperties to hipGetDeviceProperties - not docs
2016-03-06 08:31:04 -06:00
Ben Sander
ff66ef0779
fixes for titan platform
2016-02-26 05:25:30 -06:00
Ben Sander
c300ffe458
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
2016-02-26 06:15:09 -06:00
Ben Sander
4adab7b7ef
Merge branch 'memtracker' into privatestaging
...
Conflicts:
src/hip_hcc.cpp
2016-02-25 19:38:46 -06:00
Evgeny Mankov
57e212606d
Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
...
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.
P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).
2016-02-25 23:44:39 +03:00
Evgeny Mankov
833c9e52ad
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
2016-02-19 13:27:03 +03:00
Evgeny Mankov
1c19dbb807
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
2016-02-18 18:15:01 +03:00
Evgeny Mankov
5ea8543d2e
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
2016-02-18 17:25:28 +03:00
Evgeny Mankov
5b05a9fef1
hipInfo sample update with new Device Properties.
2016-02-18 15:08:55 +03:00
Evgeny Mankov
072d649d8d
Formatting, no functional changes.
2016-02-15 13:16:05 +03:00
Ben Sander
928996fec7
Enable -O3, style points on array size
2016-02-13 03:17:42 -06:00
Ben Sander
c3720c19a8
Result formatting
2016-02-13 01:14:01 -06:00
Ben Sander
bcb5953d6e
Add D2H test
2016-02-12 22:47:26 -06:00
Ben Sander
559db057d5
Add D2H test
2016-02-12 22:46:34 -06:00
Ben Sander
f3fd6476eb
Add Bus Bandwidth test, leveraged from SHOC.
2016-02-12 21:30:43 -06:00
Ben Sander
317566c1b6
Update links in docs to GPUOpen and to Doxygen
2016-01-27 00:23:47 -06:00
Aditya Avinash Atluri
1d74e7c05f
Update README.md
2016-01-26 10:43:41 -05:00
Aditya Avinash Atluri
2d57a3dd0b
Corrected compilation error
2016-01-26 10:40:06 -05:00