Граф коммитов

2117 Коммитов

Автор SHA1 Сообщение Дата
Ben Sander 2b2bc08c10 Merge pull request #275 from mangupta/bump_version_1.5
Bumped HIP version to 1.5 and updated summary in HIP release notes

[ROCm/clr commit: 2f542c46c8]
2017-11-29 15:06:02 -06:00
Maneesh Gupta 98ec873ede Bumped HIP version to 1.5 and updated summary in HIP release notes
Change-Id: I4b612da88236674fac2c585ab6891bdd495511b0


[ROCm/clr commit: 2d5e0274ef]
2017-11-29 13:56:17 +05:30
Ben Sander 0de96b6e3e Merge pull request #256 from gargrahul/texture_driver_api_support
Texture driver APIs support

[ROCm/clr commit: e93a24bdbe]
2017-11-27 13:52:39 -06:00
Rahul Garg 37ceb9a1a8 Added note in Release notes for texture driver APIs
[ROCm/clr commit: 6c18df987d]
2017-11-27 22:06:50 +05:30
Evgeny Mankov 2df33086a8 Merge pull request #262 from ChrisKitching/frontendaction
[HIPIFY] Mostly fix preprocessor-or-template induced issues

[ROCm/clr commit: b25d199111]
2017-11-27 17:30:11 +03:00
Rahul Garg 896c146fef Porting guides update for texture APIs usage
[ROCm/clr commit: 3e9a4cfdd1]
2017-11-24 12:00:55 +05:30
Ben Sander 84802e5d68 Merge pull request #273 from mangupta/swdev-129574
Fix float2int rounding functions

[ROCm/clr commit: 77de4b96cf]
2017-11-23 12:04:36 -06:00
Maneesh Gupta 05963822e7 Fix float2int rounding functions
Change-Id: I67943859a6344c5eec0eaa23418c9b802ef72468


[ROCm/clr commit: 822743845c]
2017-11-23 09:57:24 +05:30
Evgeny Mankov d7e1419d8b Merge pull request #263 from ChrisKitching/headers
[HIPIFY] Add hipify mappings for all CUDA headers that have HIP equivalents

[ROCm/clr commit: 6bddac1cfa]
2017-11-22 21:24:21 +03:00
Rahul Garg 212afdf56b Fixed review comments
[ROCm/clr commit: 38029f2849]
2017-11-21 21:19:06 +05:30
Rahul Garg dc2c56c43d Changed function hipMemcpy_2D to hipMemcpyParam2D
[ROCm/clr commit: 24307fe5c4]
2017-11-21 12:36:24 +05:30
Ben Sander fdb4ea06f7 Merge pull request #264 from pzins/missing_end_marker
Fix missing MARKER_END

[ROCm/clr commit: a1e0c96689]
2017-11-20 06:08:01 -06:00
Rahul Garg 9cee9705dc -Moved coGlobals in hipModule class (takes care of multi module case)
-Used mutex scope for updating coGlobals


[ROCm/clr commit: eff09fcd86]
2017-11-20 16:23:18 +05:30
Maneesh Gupta 02cbb93ec1 Merge pull request #266 from gargrahul/fix_half2_gfx900
Fixed half2 issue on gfx900

[ROCm/clr commit: 4477d3d314]
2017-11-20 07:28:41 +05:30
Maneesh Gupta 9498ff3cae Merge pull request #265 from phani544/nvccTests
[nvccTests]Enabled inline_asm_vadd on nvcc

[ROCm/clr commit: 29c0ab8401]
2017-11-20 07:28:29 +05:30
Ben Sander 809f575305 Fix test on cuda
[ROCm/clr commit: aeadc3f18f]
2017-11-19 15:31:02 -06:00
Ben Sander 80021d3c68 Merge branch 'feature_natural_indexing' of https://github.com/AlexVlx/HIP
[ROCm/clr commit: a43262e699]
2017-11-19 15:25:17 -06:00
Ben Sander 6b1fb439f6 Temporarily disable P2P on nvidia (fails on dual GPU)
[ROCm/clr commit: fc34fd6f03]
2017-11-19 15:21:37 -06:00
Rahul Garg 7453c8b25c Update hipModuleGetTexRef API
[ROCm/clr commit: 76f174b536]
2017-11-19 22:10:46 +05:30
Alex Voicu af0ae5421c This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility.
[ROCm/clr commit: 0cc921f103]
2017-11-19 01:54:12 +00:00
Rahul Garg b0461a2176 Fixed test case for GFX900
[ROCm/clr commit: cfc9f87465]
2017-11-16 09:34:52 +05:30
Rahul Garg c3ba309562 Fixed half2 issue on gfx900
[ROCm/clr commit: 20947f80fb]
2017-11-15 18:52:59 +05:30
Rahul Garg 0f8296b0c2 Removed redundant desc variable
[ROCm/clr commit: e2e499461a]
2017-11-15 18:28:27 +05:30
Rahul Garg a0f838a75e -Fixed texture driver API sample
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs


[ROCm/clr commit: edbfeeb141]
2017-11-15 18:23:28 +05:30
Phaneendr-kumar Lanka 3bfe30e22c [nvccTests]Enabled inline_asm_vadd on nvcc
[ROCm/clr commit: 9fa3ed4f6f]
2017-11-14 16:37:59 +05:30
Rahul Garg c154921e10 Texture code reorganized
[ROCm/clr commit: e536eb038d]
2017-11-14 11:09:35 +05:30
Pierre 75a1e24519 Fix missing MARKER_END
Logging status of hipCtxSynchronize was missing
Test if hip profiling is active for MARKER_END in ihipPostLaunchKernel
Add MARKER_END after the completion of a kernel launched through
the "grid launch"


[ROCm/clr commit: e60a95d7dd]
2017-11-13 16:13:19 -05:00
Chris Kitching c3dc74be1b Add an explicit check for proper rewriting of CUDA includes
[ROCm/clr commit: 642af94492]
2017-11-13 21:02:42 +00:00
Chris Kitching 98034c4e3f Add a preprocessor conditional to one of the tests
Hurrah, we can cope with ifdefs now (except for kernel launches)


[ROCm/clr commit: e8c1e7bb11]
2017-11-13 20:58:55 +00:00
Chris Kitching 83e450e2c7 Use proper clang diagnostics for printing warnings
Much pretty. Very wow

This gives users all the usual power when it comes to manipulating
clang diagnostics. People can pass -Werror can have hipify fail if
it doesn't completely translate a file, for example. Much nicer
than reinventing the wheel.


[ROCm/clr commit: 506deca07f]
2017-11-13 20:58:55 +00:00
Chris Kitching 842c8c28d7 Use a custom FrontendAction to simplify identifier translation
Most of what hipify does is really just replacing CUDA idenitifers
with HIP ones. CUDA function calls, preprocessor macro calls,
enum references, types, etc.

This is problematic: calls/types/enum-refs require name resolution
for the AST matcher to work. This fails in the presence of code
deleted by the preprocessor, and in two-pass template compilation.

Instead, we can simply hook the lexer and have it rewrite the
identifiers for us.

This approach means identifier transformations will work correctly
regardless of where they appear (and we get to delete lots of code)

- Fixes #260
- Helps a bit with #207 - it will still fail to translate kernel
calls in preprocessor-ignored code, but everything except kerel
launches should translate correctly now, even in
preprocessor-deleted code.


[ROCm/clr commit: 7a441aef10]
2017-11-13 20:58:54 +00:00
Chris Kitching e00c92d828 Add hipify mappings for all CUDA headers that have HIP equivalents
I'm particularly running into issues with `device_types.h` in real
CUDA code...


[ROCm/clr commit: 9014c89f04]
2017-11-13 17:20:07 +00:00
Chris Kitching 41d7a05974 Add a test that exposes #260
[ROCm/clr commit: 0afc6b2e95]
2017-11-13 16:18:15 +00:00
Chris Kitching 5ced4544eb Add a couple of missing CHECK directives to concurrentKernels.cu
[ROCm/clr commit: 4353fc06d7]
2017-11-13 16:17:19 +00:00
Maneesh Gupta a857b650ad Merge pull request #261 from gargrahul/fix_module_api_sample
Fix module_api sample

[ROCm/clr commit: 7f3653af7a]
2017-11-13 11:55:54 +05:30
Rahul Garg 4ff4a71867 Fix module_api sample
[ROCm/clr commit: e434a23be5]
2017-11-13 08:56:39 +05:30
Rahul Garg 8069290ca1 Added texture 2D driver API usage example
[ROCm/clr commit: 3711136e46]
2017-11-09 22:35:29 +05:30
Rahul Garg bb8c31c198 Texture driver APIs support
[ROCm/clr commit: 1851c153f6]
2017-11-09 22:10:55 +05:30
Maneesh Gupta 0b86003f38 Merge pull request #250 from AlexVlx/feature_add_agent_global_support
Support for agent globals

[ROCm/clr commit: 925f97fac7]
2017-11-09 07:52:09 +05:30
Maneesh Gupta 907a265bc1 Merge pull request #255 from AlexVlx/remove_some_trailing_whitespace
Clean up trailing whitespace so as to reduce noise in #246.

[ROCm/clr commit: 6f1298d9f6]
2017-11-08 10:29:19 +05:30
Maneesh Gupta 9fe4f363ef Merge pull request #253 from mangupta/hit_add_cmakecmd
[hit] Add support for cmake commands as part of test infra

[ROCm/clr commit: 86f5934beb]
2017-11-08 09:50:29 +05:30
Alex Voicu 142213a52b Clean up trailing whitespace so as to reduce noise in #246.
[ROCm/clr commit: 0ad1308efd]
2017-11-08 00:08:55 +00:00
Ben Sander 6f7992387b Merge pull request #254 from bensander/event_thread_safe
Event thread safe

[ROCm/clr commit: 7b534885e6]
2017-11-07 17:59:51 +01:00
Maneesh Gupta 15a6a84a81 Merge pull request #252 from phani544/newTests2
More tests for memory,stream & error APIs

[ROCm/clr commit: 52c8c4b03c]
2017-11-07 10:38:18 +05:30
Phaneendr-kumar Lanka fc1af184a4 identation change in hipPeekAtLastError
[ROCm/clr commit: 2a1d7db371]
2017-11-07 09:51:32 +05:30
Ben Sander 2c95d48f87 Check for null event in hipEventElapsedTime
[ROCm/clr commit: e71456cc6c]
2017-11-06 23:49:31 +00:00
Ben Sander f761859993 hipStreamWaitEvent returns success if event created but not recorded
[ROCm/clr commit: b5911acce2]
2017-11-06 23:49:31 +00:00
Ben Sander 379b3fc7a0 Make hipEvent_t thread safe.
Support re-recording of same event by different threads.

- Add criticalData structure to hipEvent_t, similar to mechanism used
  for streams, contexts, device.  Events are always locked
  after streams to avoid deadlock.
- ihipEvent_t::locked_copyCrit can be used to copy critical state
  including marker.  The critical state in the event can then
  be re-recorded.
- refactor hipEventElapsedTime.  Remmove stale debug code, native signal
  refs.


[ROCm/clr commit: 955cfbfdc7]
2017-11-06 23:49:25 +00:00
Maneesh Gupta f00dd1ad73 [hit] Add support for cmake commands as part of test infra
Cmake supports several builtin command-line tools. These tools can
now be specified as part of the HIT block. These commands will be
run during cmake configure step.

Change-Id: I32466c94b2fe1ecdc30249755fc027102295617d


[ROCm/clr commit: f92648e447]
2017-11-06 16:53:57 +05:30
Phaneendr-kumar Lanka b36eb26ef0 Modified tests as per review
[ROCm/clr commit: e18f4239b6]
2017-11-06 15:13:36 +05:30