Aditya Atluri
bf3d3a9745
Revert "Fixed passing unpinned memory to async memcpy"
...
This reverts commit c272c53808 .
[ROCm/hip commit: 45965aaa2d ]
2016-03-08 03:17:27 -06:00
Aditya Atluri
c272c53808
Fixed passing unpinned memory to async memcpy
...
[ROCm/hip commit: 216af71480 ]
2016-03-08 03:14:49 -06:00
Aditya Atluri
91dbc3114d
added feature for hipHostGetFlags for CUDA and HIP
...
[ROCm/hip commit: 75952029d6 ]
2016-03-06 12:17:30 -06:00
Aditya Atluri
5346c57411
added flags support to be used later
...
[ROCm/hip commit: 1f3695b224 ]
2016-03-06 08:40:33 -06:00
Aditya Atluri
f1b8758919
corrected hipDeviceGetProperties to hipGetDeviceProperties - not docs
...
[ROCm/hip commit: d3ba2b9782 ]
2016-03-06 08:31:04 -06:00
Aditya Atluri
3c91a6d0a7
Added hipHostAlloc with hipHostAllocMapped flag
...
[ROCm/hip commit: 3aa764d5eb ]
2016-03-05 15:57:56 -06:00
Aditya Atluri
52049a5927
v2: Fixed implementation of hipMemcpyDefault flag
...
[ROCm/hip commit: bd23279271 ]
2016-03-04 06:00:04 -06:00
Aditya Atluri
cc9a9cc5db
Added support for hipMemcpyDefault
...
[ROCm/hip commit: c154e1f4e4 ]
2016-03-03 10:30:06 -06:00
Aditya Avinash Atluri
12e7f550b7
Update hip_hcc.cpp
...
[ROCm/hip commit: 40eefc1cde ]
2016-03-03 13:59:43 -06:00
Aditya Avinash Atluri
057bcc73f1
Fix output of hipPointerGetAttributes
...
The output of hipPointerGetAttributes is fixed to match CUDA counterpart.
[ROCm/hip commit: b6e34a44b0 ]
2016-03-03 13:58:18 -06:00
Aditya Atluri
0ed3d943f1
Initialize hip when single kernel is called
...
[ROCm/hip commit: ce7ae41d42 ]
2016-03-02 08:08:45 -06:00
Aditya Avinash Atluri
5214b314a9
H2H Async memcpy fix
...
In this change, the cpu memcpy will wait until all the commands in the current stream are done.
Note that, it only waits on current stream. But not on other streams.
[ROCm/hip commit: 180bc32db0 ]
2016-02-29 12:49:50 -06:00
Ben Sander
5fc32bc1e8
Copy dependency bug fixes and test modes.
...
Add dependency for host-to-host copy.
Add debug mode for HIP_DISABLE_HW_COPY_DEP and
HIP_DISABLE_HW_KERNEL_DEP - setting these to -1 now ignores
all dependencies.
[ROCm/hip commit: ba9ad6be80 ]
2016-02-28 21:19:49 -06:00
Ben Sander
cefe65e24a
touchup
...
[ROCm/hip commit: af22d056e0 ]
2016-02-28 21:08:53 -06:00
pensun
b9421830a4
Test cases for HIP_VISIBLE_DEVICES/CUDA_VISIBLE_DEVICES.
...
hipEnvVar is the base test case, to be called by hipEnvVarDriver
at the run time.
Test case includes tests for normal use case of the environment
variable, invalid value/sequence and use CUDA_VISIBLE_DEVICES as a
alternative.
[ROCm/hip commit: 39b44cb484 ]
2016-02-27 14:48:00 -06:00
pensun
1154872b0d
improve the HIP_VISIBLE_DEVICES implementation
...
[ROCm/hip commit: 1f606261c1 ]
2016-02-27 14:14:08 -06:00
pensun
a71c5a6cf0
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: 07e56d4666 ]
2016-02-27 04:25:28 -06:00
Aditya Avinash Atluri
f2dfb87abf
Merge pull request #4 from AMDComputeLibraries/memtracker
...
hipGetPointerAttrib behavioral changes
[ROCm/hip commit: ecadb1623c ]
2016-02-27 10:51:23 -06:00
Ben Sander
66d6dbf686
disable rocrv2, properly
...
[ROCm/hip commit: ea09557e1b ]
2016-02-27 03:31:30 -06:00
Aditya Avinash Atluri
e7631f28e5
Corrected hipPointerGetAttribute
...
Made hipPointerGetAttribute work same as cudaPointerGetAttribute for HCC
[ROCm/hip commit: 66aa7f2f8a ]
2016-02-26 18:50:40 -06:00
pensun
c864a3f0fd
relsove conflicts
...
[ROCm/hip commit: 57f60b34fb ]
2016-02-26 09:57:40 -06:00
pensun
dc3710cee4
fix compiling error
...
[ROCm/hip commit: ee7ac16396 ]
2016-02-26 09:50:00 -06:00
Ben Sander
1ac07d2b87
fixes for titan platform
...
[ROCm/hip commit: ff66ef0779 ]
2016-02-26 05:25:30 -06:00
Ben Sander
b46a90852e
Disable ROCR_V2
...
[ROCm/hip commit: 6e0ccdfb95 ]
2016-02-26 23:34:45 -06:00
Ben Sander
193dbe4632
Merge branch 'memtracker' into privatestaging
...
Conflicts:
include/nvcc_detail/hip_runtime_api.h
[ROCm/hip commit: 369e0d7b5b ]
2016-02-26 06:17:05 -06:00
Ben Sander
8a2bcf2da3
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
[ROCm/hip commit: c300ffe458 ]
2016-02-26 06:15:09 -06:00
Ben Sander
a10f2d1a93
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
...
Conflicts:
tests/src/hipMemcpy.cpp
[ROCm/hip commit: d319299ddb ]
2016-02-25 23:22:51 -06:00
Ben Sander
5ca4914e0e
Merge branch 'memtracker' into privatestaging
...
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: 4adab7b7ef ]
2016-02-25 19:38:46 -06:00
Ben Sander
ae365b4ae2
Improve memory copy and commands switching
...
- Add chicken bits to use host-side dependency management.
- Add optional PinInPlace path for unpinned copies
- Synchronize before pinned memcpy path.
- Add mutex to protect two threads launching to same stream.
[ROCm/hip commit: 8b64c0dc62 ]
2016-02-25 19:19:49 -06:00
Evgeny Mankov
82900a1888
Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
...
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.
P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).
[ROCm/hip commit: 57e212606d ]
2016-02-25 23:44:39 +03:00
Ben Sander
60552f5133
Add tests for multi-threaded streams
...
[ROCm/hip commit: 7090f5c3f9 ]
2016-02-23 12:08:22 -06:00
Ben Sander
1888acb5f3
Sync review.
...
- add calls to ihipInit missing from some routines.
- sync before draining a stream.
[ROCm/hip commit: 3886d494f4 ]
2016-02-23 04:07:11 -06:00
Ben Sander
f0c734e256
Improve async copy implementation.
...
- Add device-side signal waits when transitioning between command classes
(Kernel, H2D copy, D2H copy).
- Support waiting in staged memory copies as well.
- Add several chicken bits to control implementation:
- HIP_DISABLE_ENQ_BARRIER
- HIP_DISABLE_BIDIR_MEMCPY
- HIP_ONESHOT_COPY_DEP
- Refactor signal pool to support efficient deallocation based on
signsequnm.
- Deallocate copy signals on eventSynchronize.
- Improve copy tests, add pingpong.
[ROCm/hip commit: 549b18ce77 ]
2016-02-22 23:15:24 -06:00
Ben Sander
23b257bca4
Merge branch 'memtracker' of https://github.com/AMDComputeLibraries/HIP-privatestaging into memtracker
...
[ROCm/hip commit: 0a98db4b5f ]
2016-02-22 08:33:47 -06:00
gargrahul
ccd1ed0a97
Update for shared atomics support
...
[ROCm/hip commit: a2fbf06129 ]
2016-02-22 16:21:52 +05:30
Ben Sander
ebf2700936
Track last command to a stream.
...
Passing simple tests.
[ROCm/hip commit: d33d806a5b ]
2016-02-20 11:02:07 -06:00
Evgeny Mankov
c3a600c63b
Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
...
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
[ROCm/hip commit: 833c9e52ad ]
2016-02-19 13:27:03 +03:00
Evgeny Mankov
68621c6702
Formatting, no functional changes.
...
[ROCm/hip commit: fbdeee39cd ]
2016-02-18 18:54:19 +03:00
Evgeny Mankov
4fcd9f2542
Device property memoryBusWidth implementation.
...
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
[ROCm/hip commit: 1c19dbb807 ]
2016-02-18 18:15:01 +03:00
Evgeny Mankov
a0cc7134e3
Device property memoryClockRate implementation.
...
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
[ROCm/hip commit: 5ea8543d2e ]
2016-02-18 17:25:28 +03:00
Evgeny Mankov
8c1a0d1924
Attribute hipDevAttrConcurrentKernels for obtaining Device property concurrentKernels is added.
...
[ROCm/hip commit: 2b6fda77ca ]
2016-02-18 14:34:18 +03:00
Ben Sander
4e0ba06044
Enable Tracker and ROCR by default, verify with HCC
...
[ROCm/hip commit: c6f8883b0d ]
2016-02-17 23:03:37 -06:00
Ben Sander
a2d8f9d98e
Remove HIP-local AM tracker (now in HCC)
...
[ROCm/hip commit: d653782d9d ]
2016-02-17 21:33:32 -06:00
Ben Sander
5ab45e9be7
USE_AM_TRACKER=0 works
...
[ROCm/hip commit: 44f40e171a ]
2016-02-17 21:23:36 -06:00
pensun
b3d70ca271
1. Bug fix
...
2. passed initial tests on different sets of
HIP_VISIBLE_DEVICES: (0),(1),(0,1),(1,2),(2,3),(1,2,3),(2,3,4),(1,5,2,3)
and achieved expected choice of GPU devices at the runtime.
3. Passed HIP test suite.
[ROCm/hip commit: 8aa4bfce57 ]
2016-02-17 09:32:50 -06:00
pensun
e5ff38e421
Implementation of HIP_VISIBLE_DEVICES in runtime
...
[ROCm/hip commit: c1e120fb1b ]
2016-02-17 06:59:18 -06:00
Ben Sander
bb8a1fe72d
more work on async copies
...
[ROCm/hip commit: 59379ffb44 ]
2016-02-17 00:59:12 -06:00
pensun
66df1ff930
modify to add remove invalid devices numbers
...
[ROCm/hip commit: 060439b6ab ]
2016-02-16 10:00:05 -06:00
pensun
8c230eab73
Implement to read HIP_VISIBLE_DEVICES to internal global variable
...
[ROCm/hip commit: d40cbef2af ]
2016-02-16 07:39:04 -06:00
Ben Sander
512163b889
Add per-stream pool for hsa_signals.
...
[ROCm/hip commit: caef9b5ced ]
2016-02-16 01:59:13 -06:00