hipGraphAddHostNode and hipGraphAddMemcpyNode1D are not part of the HIP
graph implementation, hence they should not be exported.
Change-Id: Ia3d0188895fb4f64ebfa23d4a0a09c44438f878d
For hipRTC on Windows, add macro __HIPCC_RTC__ to allow
online compilation of with device functions excluding standard
C/C++ headers, system headers, and host HIP APIs.
Change-Id: I1d91f042baf1359856ec83ab7030dc58785e0334
On StreamBegincapture captures the parameters passed to APIs and respective node will be created and added to graph
All parameters are passed to STREAM_CAPTURE macro, it checks if stream in capture mode and redirects the call to the capture function and returns
Updated hipStream and hipEvent with capture parameters
Added handling for hipStreamBeginCapture & hipStreamEndCapture
Change-Id: Ic8926a7b4336c2cc81f0b3a9a224aa392c474134
Currently LiquidFlash cannot be supported from Github Enterprise,
hence we need to be able to build with out it.
Allow this by setting -DWITH_LIQUID_FLASH=0.
Change-Id: If59afe45c02a1623639360dbcc3927360f16ebd3
Selector indices are as follows (the upper 16-bits of the selector are not used): selector[0] = s<2:0> selector[1] = s<6:4> selector[2] = s<10:8> selector[3] = s<14:12>
Change-Id: Ibf76c6ec2374f1f5b9bba8bd9dbd73660f830eea
Windows doesn't have headers unistd.h, sys/wait.h, so
remove them for _WIN64. Also, Windows doesn't have rand_r
so define it. Fixes builds for HIP tests:
d2dMemCpyWithPinnedHostMemory, hipMemset3DFunctional,
hipModuleLoadMultProcessOnMultGPU,
hipModuleLoadUnloadStress, hipModuleNegative,
hipStreamACb_ThrdBehaviour, hipMemcpyNegativeMThrdMSize,
hipStreamACb_MStrm_Mgpu, StreamAddCallback,
hipStreamACb_StrmSyncTiming, hipStreamACb_MultiCalls,
hipMallocConcurrency, hipMemset3DRegressMultiThread,
and hipSetGetDevice.
Change-Id: Id2f98d664716026c590d879e464ba467d4c071be
Fix wrong mixing of current device and stream device in
ihipModuleLaunchKernel() and hipLaunchCooperativeKernel().
Fix missing hipSetDevice() in hipMemcpyWithStream* tests.
Change-Id: I09333bb40d239bb42c832df5ea16d17eeaeff5e7
hipAddressModeClamp corresponds to CL_ADDRESS_CLAMP_TO_EDGE.
This change fixes incorrect sampling near texture edges.
https://github.com/ROCm-Developer-Tools/HIP/pull/2229
Change-Id: Ia2af9f3cb582a74af7b1f65bad1ed0310eab2628
Change-Id: I5daeacd9dd5c6ce7f914d6e6e45dd41fb2a675a5
hipMemRangeGetAttributes was returning hipErrorInvalidValue due to improper
mapping of the arguments to cudaMemRangeGetAttributes.
If the width is 0, hipMemset2D or 3D APIs should return success
doing nothing
Change-Id: I97f8825d090a73ee63145e52d2b6d2cc0eddade3
Change-Id: I9d62fee70929ad0c79b947aa4cbbc8ab898fb64a
HIP supports emitting two types of static libraries. One type
will export host functions and is compatible with host linkers.
The second type exports device functions, but is generated with
ar manually. Also, add a README with steps on how to run these
samples with Makefile or CMake.
Change-Id: I1be15c2884583b370092bc8e4bf04f726f8f5a27
Remove hipSimpleIpcMem.cpp as the IPC APIs are expected to work
with different processes and not the same process. Also, modify
the hipMultiProcIpcMem.cpp to test the offset scenarios
Change-Id: Ia161fc9ab39e27f22d8ef7268e0d7669ece5cdce
Change-Id: Ic3cb79ef4aca40b4a504483eb81fd83bb8a201d9
[dtest] Tests for hipMemcpy2DFromArray
and hipMemcpy2DFromArrayAsync APIs
1. Negative scenarios for both the APIs
2. Functional size check scenarios for both APIs
3. Invoke the API from Peer Device Context
4. Simple scenarios for both APIs
5. Pinned Host Memory from same & multi GPU
Change-Id: I80c84ec3f9722d18683d31be65caa1bd64054da6
This test is incorrect about _Float16 overloading. Need to keep the test
passing for both old and new compiler until the compiler change is in
mainline.
Change-Id: I2f9be8957282fd0579c74f1e9f23e65c4484f3a2
Add concurrentManagedAccess detection in hipMallocManaged test.
Skip test when device doesn't support concurrentManagedAccess.
Change-Id: Ie54046feef3baba857a7068972ec1fc1a60c2dfd
Early notification allows CPU command state to be processed earlier.
Otherwise state update will be delayed until the actual possible
wait.
Change-Id: I0e9d6c2ad88e3723eab2f6dc9a20f288f6c1ca0c
HIP headers use few structure names as X, Y, and Z. This causes
compilation issues when the apps use similar names as macros.
Renamed the struct names to use reserved names such as
__X, __Y and __Z
Change-Id: I59416c3734f274e853c87d4856b7e616f6cee5f5