Alex Voicu
0755f1fc26
Modify the set component of the memcpy test (unclear why there is a memset component to begin with).
2017-11-21 17:52:01 +00:00
Alex Voicu
fb1021cc0a
This corrects how addresses are formed for symbols which reside in shared objects. For this case, the .value component of an ELF symbol holds the offset from the base VA where the shared object was loaded. Thus, to correctly obtain the VA of the object refered by the symbol, we must add the offset to the VA where the shared object is loaded. We were already doing this correctly for symbols denoting functions, but we were incorrect for those denoting objects.
2017-11-21 13:15:13 +00:00
Alex Voicu
f8c1c1b38e
Refactor the __device__ versions of memset and memcpy to be less awkward i.e. not return nullptr as opposed to the destination pointer (it can only be assumed it was done for maximum confusion) and actually unroll as they claim to. Change all of the {to, from}Symbol functions to use hipModuleGetGlobal, as opposed to hc::accelerator::get_symbol_address which is no longer valid with module based dispatch.
2017-11-21 02:40:34 +00:00
Alex Voicu
24f8a93ff7
Clean-up some remaining noise in program_state.cpp.
2017-11-20 22:41:46 +00:00
Alex Voicu
27bc487635
Correct ill-formed merge in earlier commit and adjust for differences with the new CUDA natural indexing mechanism.
2017-11-20 16:33:52 +00:00
Alex Voicu
30d90dab38
Re-sync with upstream.
2017-11-20 15:34:50 +00:00
Ben Sander
a1e0c96689
Merge pull request #264 from pzins/missing_end_marker
...
Fix missing MARKER_END
2017-11-20 06:08:01 -06:00
Maneesh Gupta
4477d3d314
Merge pull request #266 from gargrahul/fix_half2_gfx900
...
Fixed half2 issue on gfx900
2017-11-20 07:28:41 +05:30
Maneesh Gupta
29c0ab8401
Merge pull request #265 from phani544/nvccTests
...
[nvccTests]Enabled inline_asm_vadd on nvcc
2017-11-20 07:28:29 +05:30
Ben Sander
aeadc3f18f
Fix test on cuda
2017-11-19 15:31:02 -06:00
Ben Sander
a43262e699
Merge branch 'feature_natural_indexing' of https://github.com/AlexVlx/HIP
2017-11-19 15:25:17 -06:00
Ben Sander
fc34fd6f03
Temporarily disable P2P on nvidia (fails on dual GPU)
2017-11-19 15:21:37 -06:00
Alex Voicu
0cc921f103
This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility.
2017-11-19 01:54:12 +00:00
Alex Voicu
45ac5b70df
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
2017-11-16 23:20:15 +00:00
Rahul Garg
cfc9f87465
Fixed test case for GFX900
2017-11-16 09:34:52 +05:30
Rahul Garg
20947f80fb
Fixed half2 issue on gfx900
2017-11-15 18:52:59 +05:30
Phaneendr-kumar Lanka
9fa3ed4f6f
[nvccTests]Enabled inline_asm_vadd on nvcc
2017-11-14 16:37:59 +05:30
Pierre
e60a95d7dd
Fix missing MARKER_END
...
Logging status of hipCtxSynchronize was missing
Test if hip profiling is active for MARKER_END in ihipPostLaunchKernel
Add MARKER_END after the completion of a kernel launched through
the "grid launch"
2017-11-13 16:13:19 -05:00
Maneesh Gupta
7f3653af7a
Merge pull request #261 from gargrahul/fix_module_api_sample
...
Fix module_api sample
2017-11-13 11:55:54 +05:30
Rahul Garg
e434a23be5
Fix module_api sample
2017-11-13 08:56:39 +05:30
Alex Voicu
bdd2d6d602
Add omitted changes in CMakeLists.txt.
2017-11-10 01:20:50 +00:00
Alex Voicu
8c62d0fbc4
Update new tests so as to make them work with new variadic based launch mechanisms.
2017-11-10 00:14:34 +00:00
Alex Voicu
1518611976
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
2017-11-09 23:43:07 +00:00
Maneesh Gupta
925f97fac7
Merge pull request #250 from AlexVlx/feature_add_agent_global_support
...
Support for agent globals
2017-11-09 07:52:09 +05:30
Alex Voicu
f7919a8321
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
...
# Conflicts:
# tests/src/runtimeApi/stream/hipStreamSync2.cpp
2017-11-08 10:26:30 +00:00
Maneesh Gupta
6f1298d9f6
Merge pull request #255 from AlexVlx/remove_some_trailing_whitespace
...
Clean up trailing whitespace so as to reduce noise in #246 .
2017-11-08 10:29:19 +05:30
Maneesh Gupta
86f5934beb
Merge pull request #253 from mangupta/hit_add_cmakecmd
...
[hit] Add support for cmake commands as part of test infra
2017-11-08 09:50:29 +05:30
Alex Voicu
0ad1308efd
Clean up trailing whitespace so as to reduce noise in #246 .
2017-11-08 00:08:55 +00:00
Ben Sander
7b534885e6
Merge pull request #254 from bensander/event_thread_safe
...
Event thread safe
2017-11-07 17:59:51 +01:00
Maneesh Gupta
52c8c4b03c
Merge pull request #252 from phani544/newTests2
...
More tests for memory,stream & error APIs
2017-11-07 10:38:18 +05:30
Phaneendr-kumar Lanka
2a1d7db371
identation change in hipPeekAtLastError
2017-11-07 09:51:32 +05:30
Alex Voicu
8854dec897
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
2017-11-07 00:01:22 +00:00
Ben Sander
e71456cc6c
Check for null event in hipEventElapsedTime
2017-11-06 23:49:31 +00:00
Ben Sander
b5911acce2
hipStreamWaitEvent returns success if event created but not recorded
2017-11-06 23:49:31 +00:00
Ben Sander
955cfbfdc7
Make hipEvent_t thread safe.
...
Support re-recording of same event by different threads.
- Add criticalData structure to hipEvent_t, similar to mechanism used
for streams, contexts, device. Events are always locked
after streams to avoid deadlock.
- ihipEvent_t::locked_copyCrit can be used to copy critical state
including marker. The critical state in the event can then
be re-recorded.
- refactor hipEventElapsedTime. Remmove stale debug code, native signal
refs.
2017-11-06 23:49:25 +00:00
Maneesh Gupta
f92648e447
[hit] Add support for cmake commands as part of test infra
...
Cmake supports several builtin command-line tools. These tools can
now be specified as part of the HIT block. These commands will be
run during cmake configure step.
Change-Id: I32466c94b2fe1ecdc30249755fc027102295617d
2017-11-06 16:53:57 +05:30
Phaneendr-kumar Lanka
e18f4239b6
Modified tests as per review
2017-11-06 15:13:36 +05:30
Phaneendr-kumar Lanka
7427abba04
More tests for memory,stream & error APIs
2017-11-06 11:31:52 +05:30
Maneesh Gupta
dbb6b2bab8
Merge pull request #251 from ROCm-Developer-Tools/fix_event_state
...
Set event state AFTER it is recorded.
2017-11-06 07:28:11 +05:30
Maneesh Gupta
958ca62c63
Merge pull request #249 from bensander/warn_event
...
Add HIP_DB=warn + message if sync on dangerous event.
2017-11-06 07:25:40 +05:30
Maneesh Gupta
544538fd88
Merge pull request #239 from phani544/newTests
...
[newTests]Adding tests for device APIs
2017-11-06 07:22:13 +05:30
Ben Sander
1c65bb1684
Set event state AFTER it is recorded.
2017-11-05 10:33:18 -06:00
Alex Voicu
394886903c
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
2017-11-03 10:57:12 +00:00
Alex Voicu
17753cbd92
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
...
# Conflicts:
# src/hip_module.cpp
2017-11-03 10:53:39 +00:00
Phaneendr-kumar Lanka
fa32a1924a
[newTests]modified the test
2017-11-03 14:52:19 +05:30
Phaneendr-kumar Lanka
19946ca43d
[newTests]Modified the test as per review comments
2017-11-03 10:39:47 +05:30
Maneesh Gupta
afd3a6786d
Merge pull request #248 from mangupta/SWDEV-134531
...
Remove libsupc++ dependency from HIP
2017-11-03 08:01:59 +05:30
Alex Voicu
d12cf0da7d
This introduces correct support for agent global variables, and implements hipModuleGetGlobal as an actual equivalent for cuModuleGetGlobal.
2017-11-03 01:44:48 +00:00
Ben Sander
3c024a9cc7
Merge pull request #247 from AlexVlx/feature_cleanup_tests
...
Test cleanup
2017-11-02 11:31:10 +01:00
Maneesh Gupta
473208368e
Remove libsupc++ dependency from HIP
...
Change-Id: Ib843f99dfb3057b7c28029250c2449732ea7f17f
2017-11-02 12:29:09 +05:30