Gráfico de commits

2108 Commits

Autor SHA1 Mensaje Fecha
Alex Voicu 0755f1fc26 Modify the set component of the memcpy test (unclear why there is a memset component to begin with). 2017-11-21 17:52:01 +00:00
Alex Voicu fb1021cc0a This corrects how addresses are formed for symbols which reside in shared objects. For this case, the .value component of an ELF symbol holds the offset from the base VA where the shared object was loaded. Thus, to correctly obtain the VA of the object refered by the symbol, we must add the offset to the VA where the shared object is loaded. We were already doing this correctly for symbols denoting functions, but we were incorrect for those denoting objects. 2017-11-21 13:15:13 +00:00
Alex Voicu f8c1c1b38e Refactor the __device__ versions of memset and memcpy to be less awkward i.e. not return nullptr as opposed to the destination pointer (it can only be assumed it was done for maximum confusion) and actually unroll as they claim to. Change all of the {to, from}Symbol functions to use hipModuleGetGlobal, as opposed to hc::accelerator::get_symbol_address which is no longer valid with module based dispatch. 2017-11-21 02:40:34 +00:00
Alex Voicu 24f8a93ff7 Clean-up some remaining noise in program_state.cpp. 2017-11-20 22:41:46 +00:00
Alex Voicu 27bc487635 Correct ill-formed merge in earlier commit and adjust for differences with the new CUDA natural indexing mechanism. 2017-11-20 16:33:52 +00:00
Alex Voicu 30d90dab38 Re-sync with upstream. 2017-11-20 15:34:50 +00:00
Ben Sander a1e0c96689 Merge pull request #264 from pzins/missing_end_marker
Fix missing MARKER_END
2017-11-20 06:08:01 -06:00
Maneesh Gupta 4477d3d314 Merge pull request #266 from gargrahul/fix_half2_gfx900
Fixed half2 issue on gfx900
2017-11-20 07:28:41 +05:30
Maneesh Gupta 29c0ab8401 Merge pull request #265 from phani544/nvccTests
[nvccTests]Enabled inline_asm_vadd on nvcc
2017-11-20 07:28:29 +05:30
Ben Sander aeadc3f18f Fix test on cuda 2017-11-19 15:31:02 -06:00
Ben Sander a43262e699 Merge branch 'feature_natural_indexing' of https://github.com/AlexVlx/HIP 2017-11-19 15:25:17 -06:00
Ben Sander fc34fd6f03 Temporarily disable P2P on nvidia (fails on dual GPU) 2017-11-19 15:21:37 -06:00
Alex Voicu 0cc921f103 This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility. 2017-11-19 01:54:12 +00:00
Alex Voicu 45ac5b70df Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe 2017-11-16 23:20:15 +00:00
Rahul Garg cfc9f87465 Fixed test case for GFX900 2017-11-16 09:34:52 +05:30
Rahul Garg 20947f80fb Fixed half2 issue on gfx900 2017-11-15 18:52:59 +05:30
Phaneendr-kumar Lanka 9fa3ed4f6f [nvccTests]Enabled inline_asm_vadd on nvcc 2017-11-14 16:37:59 +05:30
Pierre e60a95d7dd Fix missing MARKER_END
Logging status of hipCtxSynchronize was missing
Test if hip profiling is active for MARKER_END in ihipPostLaunchKernel
Add MARKER_END after the completion of a kernel launched through
the "grid launch"
2017-11-13 16:13:19 -05:00
Maneesh Gupta 7f3653af7a Merge pull request #261 from gargrahul/fix_module_api_sample
Fix module_api sample
2017-11-13 11:55:54 +05:30
Rahul Garg e434a23be5 Fix module_api sample 2017-11-13 08:56:39 +05:30
Alex Voicu bdd2d6d602 Add omitted changes in CMakeLists.txt. 2017-11-10 01:20:50 +00:00
Alex Voicu 8c62d0fbc4 Update new tests so as to make them work with new variadic based launch mechanisms. 2017-11-10 00:14:34 +00:00
Alex Voicu 1518611976 Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe 2017-11-09 23:43:07 +00:00
Maneesh Gupta 925f97fac7 Merge pull request #250 from AlexVlx/feature_add_agent_global_support
Support for agent globals
2017-11-09 07:52:09 +05:30
Alex Voicu f7919a8321 Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
# Conflicts:
#	tests/src/runtimeApi/stream/hipStreamSync2.cpp
2017-11-08 10:26:30 +00:00
Maneesh Gupta 6f1298d9f6 Merge pull request #255 from AlexVlx/remove_some_trailing_whitespace
Clean up trailing whitespace so as to reduce noise in #246.
2017-11-08 10:29:19 +05:30
Maneesh Gupta 86f5934beb Merge pull request #253 from mangupta/hit_add_cmakecmd
[hit] Add support for cmake commands as part of test infra
2017-11-08 09:50:29 +05:30
Alex Voicu 0ad1308efd Clean up trailing whitespace so as to reduce noise in #246. 2017-11-08 00:08:55 +00:00
Ben Sander 7b534885e6 Merge pull request #254 from bensander/event_thread_safe
Event thread safe
2017-11-07 17:59:51 +01:00
Maneesh Gupta 52c8c4b03c Merge pull request #252 from phani544/newTests2
More tests for memory,stream & error APIs
2017-11-07 10:38:18 +05:30
Phaneendr-kumar Lanka 2a1d7db371 identation change in hipPeekAtLastError 2017-11-07 09:51:32 +05:30
Alex Voicu 8854dec897 Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe 2017-11-07 00:01:22 +00:00
Ben Sander e71456cc6c Check for null event in hipEventElapsedTime 2017-11-06 23:49:31 +00:00
Ben Sander b5911acce2 hipStreamWaitEvent returns success if event created but not recorded 2017-11-06 23:49:31 +00:00
Ben Sander 955cfbfdc7 Make hipEvent_t thread safe.
Support re-recording of same event by different threads.

- Add criticalData structure to hipEvent_t, similar to mechanism used
  for streams, contexts, device.  Events are always locked
  after streams to avoid deadlock.
- ihipEvent_t::locked_copyCrit can be used to copy critical state
  including marker.  The critical state in the event can then
  be re-recorded.
- refactor hipEventElapsedTime.  Remmove stale debug code, native signal
  refs.
2017-11-06 23:49:25 +00:00
Maneesh Gupta f92648e447 [hit] Add support for cmake commands as part of test infra
Cmake supports several builtin command-line tools. These tools can
now be specified as part of the HIT block. These commands will be
run during cmake configure step.

Change-Id: I32466c94b2fe1ecdc30249755fc027102295617d
2017-11-06 16:53:57 +05:30
Phaneendr-kumar Lanka e18f4239b6 Modified tests as per review 2017-11-06 15:13:36 +05:30
Phaneendr-kumar Lanka 7427abba04 More tests for memory,stream & error APIs 2017-11-06 11:31:52 +05:30
Maneesh Gupta dbb6b2bab8 Merge pull request #251 from ROCm-Developer-Tools/fix_event_state
Set event state AFTER it is recorded.
2017-11-06 07:28:11 +05:30
Maneesh Gupta 958ca62c63 Merge pull request #249 from bensander/warn_event
Add HIP_DB=warn + message if sync on dangerous event.
2017-11-06 07:25:40 +05:30
Maneesh Gupta 544538fd88 Merge pull request #239 from phani544/newTests
[newTests]Adding tests for device APIs
2017-11-06 07:22:13 +05:30
Ben Sander 1c65bb1684 Set event state AFTER it is recorded. 2017-11-05 10:33:18 -06:00
Alex Voicu 394886903c Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe 2017-11-03 10:57:12 +00:00
Alex Voicu 17753cbd92 Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
# Conflicts:
#	src/hip_module.cpp
2017-11-03 10:53:39 +00:00
Phaneendr-kumar Lanka fa32a1924a [newTests]modified the test 2017-11-03 14:52:19 +05:30
Phaneendr-kumar Lanka 19946ca43d [newTests]Modified the test as per review comments 2017-11-03 10:39:47 +05:30
Maneesh Gupta afd3a6786d Merge pull request #248 from mangupta/SWDEV-134531
Remove libsupc++ dependency from HIP
2017-11-03 08:01:59 +05:30
Alex Voicu d12cf0da7d This introduces correct support for agent global variables, and implements hipModuleGetGlobal as an actual equivalent for cuModuleGetGlobal. 2017-11-03 01:44:48 +00:00
Ben Sander 3c024a9cc7 Merge pull request #247 from AlexVlx/feature_cleanup_tests
Test cleanup
2017-11-02 11:31:10 +01:00
Maneesh Gupta 473208368e Remove libsupc++ dependency from HIP
Change-Id: Ib843f99dfb3057b7c28029250c2449732ea7f17f
2017-11-02 12:29:09 +05:30