Ben Sander
2b2bc08c10
Merge pull request #275 from mangupta/bump_version_1.5
...
Bumped HIP version to 1.5 and updated summary in HIP release notes
[ROCm/clr commit: 2f542c46c8 ]
2017-11-29 15:06:02 -06:00
Maneesh Gupta
98ec873ede
Bumped HIP version to 1.5 and updated summary in HIP release notes
...
Change-Id: I4b612da88236674fac2c585ab6891bdd495511b0
[ROCm/clr commit: 2d5e0274ef ]
2017-11-29 13:56:17 +05:30
Ben Sander
0de96b6e3e
Merge pull request #256 from gargrahul/texture_driver_api_support
...
Texture driver APIs support
[ROCm/clr commit: e93a24bdbe ]
2017-11-27 13:52:39 -06:00
Rahul Garg
37ceb9a1a8
Added note in Release notes for texture driver APIs
...
[ROCm/clr commit: 6c18df987d ]
2017-11-27 22:06:50 +05:30
Evgeny Mankov
2df33086a8
Merge pull request #262 from ChrisKitching/frontendaction
...
[HIPIFY] Mostly fix preprocessor-or-template induced issues
[ROCm/clr commit: b25d199111 ]
2017-11-27 17:30:11 +03:00
Rahul Garg
896c146fef
Porting guides update for texture APIs usage
...
[ROCm/clr commit: 3e9a4cfdd1 ]
2017-11-24 12:00:55 +05:30
Ben Sander
84802e5d68
Merge pull request #273 from mangupta/swdev-129574
...
Fix float2int rounding functions
[ROCm/clr commit: 77de4b96cf ]
2017-11-23 12:04:36 -06:00
Maneesh Gupta
05963822e7
Fix float2int rounding functions
...
Change-Id: I67943859a6344c5eec0eaa23418c9b802ef72468
[ROCm/clr commit: 822743845c ]
2017-11-23 09:57:24 +05:30
Evgeny Mankov
d7e1419d8b
Merge pull request #263 from ChrisKitching/headers
...
[HIPIFY] Add hipify mappings for all CUDA headers that have HIP equivalents
[ROCm/clr commit: 6bddac1cfa ]
2017-11-22 21:24:21 +03:00
Rahul Garg
212afdf56b
Fixed review comments
...
[ROCm/clr commit: 38029f2849 ]
2017-11-21 21:19:06 +05:30
Rahul Garg
dc2c56c43d
Changed function hipMemcpy_2D to hipMemcpyParam2D
...
[ROCm/clr commit: 24307fe5c4 ]
2017-11-21 12:36:24 +05:30
Ben Sander
fdb4ea06f7
Merge pull request #264 from pzins/missing_end_marker
...
Fix missing MARKER_END
[ROCm/clr commit: a1e0c96689 ]
2017-11-20 06:08:01 -06:00
Rahul Garg
9cee9705dc
-Moved coGlobals in hipModule class (takes care of multi module case)
...
-Used mutex scope for updating coGlobals
[ROCm/clr commit: eff09fcd86 ]
2017-11-20 16:23:18 +05:30
Maneesh Gupta
02cbb93ec1
Merge pull request #266 from gargrahul/fix_half2_gfx900
...
Fixed half2 issue on gfx900
[ROCm/clr commit: 4477d3d314 ]
2017-11-20 07:28:41 +05:30
Maneesh Gupta
9498ff3cae
Merge pull request #265 from phani544/nvccTests
...
[nvccTests]Enabled inline_asm_vadd on nvcc
[ROCm/clr commit: 29c0ab8401 ]
2017-11-20 07:28:29 +05:30
Ben Sander
809f575305
Fix test on cuda
...
[ROCm/clr commit: aeadc3f18f ]
2017-11-19 15:31:02 -06:00
Ben Sander
80021d3c68
Merge branch 'feature_natural_indexing' of https://github.com/AlexVlx/HIP
...
[ROCm/clr commit: a43262e699 ]
2017-11-19 15:25:17 -06:00
Ben Sander
6b1fb439f6
Temporarily disable P2P on nvidia (fails on dual GPU)
...
[ROCm/clr commit: fc34fd6f03 ]
2017-11-19 15:21:37 -06:00
Rahul Garg
7453c8b25c
Update hipModuleGetTexRef API
...
[ROCm/clr commit: 76f174b536 ]
2017-11-19 22:10:46 +05:30
Alex Voicu
af0ae5421c
This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility.
...
[ROCm/clr commit: 0cc921f103 ]
2017-11-19 01:54:12 +00:00
Rahul Garg
b0461a2176
Fixed test case for GFX900
...
[ROCm/clr commit: cfc9f87465 ]
2017-11-16 09:34:52 +05:30
Rahul Garg
c3ba309562
Fixed half2 issue on gfx900
...
[ROCm/clr commit: 20947f80fb ]
2017-11-15 18:52:59 +05:30
Rahul Garg
0f8296b0c2
Removed redundant desc variable
...
[ROCm/clr commit: e2e499461a ]
2017-11-15 18:28:27 +05:30
Rahul Garg
a0f838a75e
-Fixed texture driver API sample
...
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs
[ROCm/clr commit: edbfeeb141 ]
2017-11-15 18:23:28 +05:30
Phaneendr-kumar Lanka
3bfe30e22c
[nvccTests]Enabled inline_asm_vadd on nvcc
...
[ROCm/clr commit: 9fa3ed4f6f ]
2017-11-14 16:37:59 +05:30
Rahul Garg
c154921e10
Texture code reorganized
...
[ROCm/clr commit: e536eb038d ]
2017-11-14 11:09:35 +05:30
Pierre
75a1e24519
Fix missing MARKER_END
...
Logging status of hipCtxSynchronize was missing
Test if hip profiling is active for MARKER_END in ihipPostLaunchKernel
Add MARKER_END after the completion of a kernel launched through
the "grid launch"
[ROCm/clr commit: e60a95d7dd ]
2017-11-13 16:13:19 -05:00
Chris Kitching
c3dc74be1b
Add an explicit check for proper rewriting of CUDA includes
...
[ROCm/clr commit: 642af94492 ]
2017-11-13 21:02:42 +00:00
Chris Kitching
98034c4e3f
Add a preprocessor conditional to one of the tests
...
Hurrah, we can cope with ifdefs now (except for kernel launches)
[ROCm/clr commit: e8c1e7bb11 ]
2017-11-13 20:58:55 +00:00
Chris Kitching
83e450e2c7
Use proper clang diagnostics for printing warnings
...
Much pretty. Very wow
This gives users all the usual power when it comes to manipulating
clang diagnostics. People can pass -Werror can have hipify fail if
it doesn't completely translate a file, for example. Much nicer
than reinventing the wheel.
[ROCm/clr commit: 506deca07f ]
2017-11-13 20:58:55 +00:00
Chris Kitching
842c8c28d7
Use a custom FrontendAction to simplify identifier translation
...
Most of what hipify does is really just replacing CUDA idenitifers
with HIP ones. CUDA function calls, preprocessor macro calls,
enum references, types, etc.
This is problematic: calls/types/enum-refs require name resolution
for the AST matcher to work. This fails in the presence of code
deleted by the preprocessor, and in two-pass template compilation.
Instead, we can simply hook the lexer and have it rewrite the
identifiers for us.
This approach means identifier transformations will work correctly
regardless of where they appear (and we get to delete lots of code)
- Fixes #260
- Helps a bit with #207 - it will still fail to translate kernel
calls in preprocessor-ignored code, but everything except kerel
launches should translate correctly now, even in
preprocessor-deleted code.
[ROCm/clr commit: 7a441aef10 ]
2017-11-13 20:58:54 +00:00
Chris Kitching
e00c92d828
Add hipify mappings for all CUDA headers that have HIP equivalents
...
I'm particularly running into issues with `device_types.h` in real
CUDA code...
[ROCm/clr commit: 9014c89f04 ]
2017-11-13 17:20:07 +00:00
Chris Kitching
41d7a05974
Add a test that exposes #260
...
[ROCm/clr commit: 0afc6b2e95 ]
2017-11-13 16:18:15 +00:00
Chris Kitching
5ced4544eb
Add a couple of missing CHECK directives to concurrentKernels.cu
...
[ROCm/clr commit: 4353fc06d7 ]
2017-11-13 16:17:19 +00:00
Maneesh Gupta
a857b650ad
Merge pull request #261 from gargrahul/fix_module_api_sample
...
Fix module_api sample
[ROCm/clr commit: 7f3653af7a ]
2017-11-13 11:55:54 +05:30
Rahul Garg
4ff4a71867
Fix module_api sample
...
[ROCm/clr commit: e434a23be5 ]
2017-11-13 08:56:39 +05:30
Rahul Garg
8069290ca1
Added texture 2D driver API usage example
...
[ROCm/clr commit: 3711136e46 ]
2017-11-09 22:35:29 +05:30
Rahul Garg
bb8c31c198
Texture driver APIs support
...
[ROCm/clr commit: 1851c153f6 ]
2017-11-09 22:10:55 +05:30
Maneesh Gupta
0b86003f38
Merge pull request #250 from AlexVlx/feature_add_agent_global_support
...
Support for agent globals
[ROCm/clr commit: 925f97fac7 ]
2017-11-09 07:52:09 +05:30
Maneesh Gupta
907a265bc1
Merge pull request #255 from AlexVlx/remove_some_trailing_whitespace
...
Clean up trailing whitespace so as to reduce noise in #246 .
[ROCm/clr commit: 6f1298d9f6 ]
2017-11-08 10:29:19 +05:30
Maneesh Gupta
9fe4f363ef
Merge pull request #253 from mangupta/hit_add_cmakecmd
...
[hit] Add support for cmake commands as part of test infra
[ROCm/clr commit: 86f5934beb ]
2017-11-08 09:50:29 +05:30
Alex Voicu
142213a52b
Clean up trailing whitespace so as to reduce noise in #246 .
...
[ROCm/clr commit: 0ad1308efd ]
2017-11-08 00:08:55 +00:00
Ben Sander
6f7992387b
Merge pull request #254 from bensander/event_thread_safe
...
Event thread safe
[ROCm/clr commit: 7b534885e6 ]
2017-11-07 17:59:51 +01:00
Maneesh Gupta
15a6a84a81
Merge pull request #252 from phani544/newTests2
...
More tests for memory,stream & error APIs
[ROCm/clr commit: 52c8c4b03c ]
2017-11-07 10:38:18 +05:30
Phaneendr-kumar Lanka
fc1af184a4
identation change in hipPeekAtLastError
...
[ROCm/clr commit: 2a1d7db371 ]
2017-11-07 09:51:32 +05:30
Ben Sander
2c95d48f87
Check for null event in hipEventElapsedTime
...
[ROCm/clr commit: e71456cc6c ]
2017-11-06 23:49:31 +00:00
Ben Sander
f761859993
hipStreamWaitEvent returns success if event created but not recorded
...
[ROCm/clr commit: b5911acce2 ]
2017-11-06 23:49:31 +00:00
Ben Sander
379b3fc7a0
Make hipEvent_t thread safe.
...
Support re-recording of same event by different threads.
- Add criticalData structure to hipEvent_t, similar to mechanism used
for streams, contexts, device. Events are always locked
after streams to avoid deadlock.
- ihipEvent_t::locked_copyCrit can be used to copy critical state
including marker. The critical state in the event can then
be re-recorded.
- refactor hipEventElapsedTime. Remmove stale debug code, native signal
refs.
[ROCm/clr commit: 955cfbfdc7 ]
2017-11-06 23:49:25 +00:00
Maneesh Gupta
f00dd1ad73
[hit] Add support for cmake commands as part of test infra
...
Cmake supports several builtin command-line tools. These tools can
now be specified as part of the HIT block. These commands will be
run during cmake configure step.
Change-Id: I32466c94b2fe1ecdc30249755fc027102295617d
[ROCm/clr commit: f92648e447 ]
2017-11-06 16:53:57 +05:30
Phaneendr-kumar Lanka
b36eb26ef0
Modified tests as per review
...
[ROCm/clr commit: e18f4239b6 ]
2017-11-06 15:13:36 +05:30