Wykres commitów

1999 Commity

Autor SHA1 Wiadomość Data
Evgeny Mankov aead033e41 Merge pull request #235 from ChrisKitching/preprocessorEnhancements
[HIPIFY] Handle unconditional preprocessor directives far better

[ROCm/hip commit: a865ebfe10]
2017-10-27 21:21:20 +03:00
Ben Sander ad9a636b90 Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
Feature support globals for module api

[ROCm/hip commit: f288f24e95]
2017-10-27 01:53:34 +02:00
Ben Sander 37bd264cd4 Merge pull request #218 from ChrisKitching/nodiscard
Add [[nodiscard]] attribute to hipError_t in C++17 mode

[ROCm/hip commit: e97f675397]
2017-10-26 22:48:54 +02:00
Ben Sander c8fc8122b9 Merge pull request #223 from bensander/2x_bidir
Use 2X for bidir memory bandwidth calc

[ROCm/hip commit: 772fe865fc]
2017-10-26 21:49:06 +02:00
Chris Kitching c15f6bdf5c Greatly enhance handling of macros in kernel launches
All but the most contrived use of macros is now properly handled -
have a look at the new testcases this commit adds. You can have
macros in kernel calls, macros spanning chunks of your arguments,
the call, call parameters, or callee can all be macros or
partially macros.


[ROCm/hip commit: 094b2b9b05]
2017-10-26 17:28:46 +01:00
Chris Kitching d0acfd5bde Simplify how kernel launch expressions get translated
It seems like there was a lot of machinery here that is no longer
needed now we have hipLaunchKernelGGL (which doesn't require us
to insert an extra argument into kernel functions). We no longer
need to waste cycles scanning the AST for callees.

We can literally just do "Take the callee expression, and dump
it into the first argument of hipLaunchKernelGGL()".


[ROCm/hip commit: eff86d975b]
2017-10-26 17:28:30 +01:00
Chris Kitching 59713c2459 Deduplicate preprocessor code
There's three functions here that all do the same thing...

There was also logic that looks for numeric literals and works
backwards to find the macro name from which they are expanded.
I previously introduced code that rewrites macro references at
expand-time in the `MacroExpands` callback, so that code is no
longer doing anything useful.


[ROCm/hip commit: fd911e1839]
2017-10-26 17:28:30 +01:00
Chris Kitching ca913bb196 Rewrite _all_ CUDA macro identifiers in the preprocessor
Calls to macros that were themselves CUDA API calls were often
being missed - this applies the identifier transform to macro
names at the callsites, too.


[ROCm/hip commit: d1e26b2e7e]
2017-10-26 17:27:56 +01:00
Chris Kitching cbf786a8fd Don't special-case source locations for calls in macros
The source location for a call that's inside a macro body will,
by default, point into the macro definition itself. The original
logic was causing macro invocations to be overwritten, as I
explain here:
https://github.com/ROCm-Developer-Tools/HIP/issues/207#issuecomment-337521851

The existing PPCallbacks code is correctly rewriting macro
definitions, so the practical effect of this change is that AST
rewrites on code that's expanded from macros are no-ops.

It might be a performance optimisation to put a short-circiut at
the top of the AST callbacks to abort when faced with code that
was expanded from macros.

It might yet prove wise to do absolutely everything at lex-time...


[ROCm/hip commit: 4a794ed8c0]
2017-10-26 17:26:37 +01:00
Chris Kitching 32cbe68a93 Prefer early-return to deep nesting
A chain of 7 closing braces is never a great sign :D

In the process it became apparant that the unsupported flag
was being silently ignored, causing users to be left with cuda
API calls in their programs with no warning given. This has been
rectified for consistency.


[ROCm/hip commit: 35a892bc77]
2017-10-26 17:26:37 +01:00
Evgeny Mankov 99d43e1e6a Merge pull request #213 from ChrisKitching/simplify
[HIPIFY] Simplify (and accelerate) hipification of CUDA type identifiers

[ROCm/hip commit: 3bacb69e20]
2017-10-26 19:17:01 +03:00
Ben Sander 79126196ce Merge pull request #232 from kknox/update-cuda-version
Update container to newer cuda driver and sdk 9.0

[ROCm/hip commit: c0ef6d7b62]
2017-10-25 23:45:24 +02:00
Kent Knox f6b4db7dd5 Update container to newer cuda driver and sdk 9.0
[ROCm/hip commit: 160d92af20]
2017-10-25 16:14:32 -05:00
Chris Kitching 641f7ed8f1 Don't use now-defunct cmake variable in lit test config
[ROCm/hip commit: 59071b895e]
2017-10-24 20:52:51 +01:00
Chris Kitching 5092f3f184 Refactor cudaCall to prefer early return to deep nesting
Sorry for the invasive refactor, but this was making reasoning
about this function more difficult.


[ROCm/hip commit: 778d6827f9]
2017-10-24 20:38:49 +01:00
Chris Kitching 22e7c4ebfc Split the giant lookup table into 3 smaller ones
Instead of having a single, enormous LUT for all CUDA names, let's
have separate ones for different types of entity. We often know
that we're looking at a typename, or a function name, or a macro
name - so we can be more efficient (and resilient to name
collisions) by having smaller lookup tables for each of those
classes of entity).

Here we start that off by having three LUTs:
- Header names
- Type names
- Everything else

Future work could usefully split "everything else" into:
- enum values
- macro names
- function names
- everything else

It's worth noting that the "needs new matcher" todos I delete here
were actually resolved with the previous commit. It no longer
naively searches for things that start with "cu*" - it will find
exactly those things that are present in our lookup tables.


[ROCm/hip commit: 695a1eb059]
2017-10-24 20:38:49 +01:00
Chris Kitching 1cf0c75c49 One matcher for type expressions to rule them all
Previously, there were different AST matchers for each
language construct that contains a type reference, and custom
logic to perform the transformation within each of those
structures.

Since the transformation in all such cases was only replacing
CUDA types with hip ones, we can instead use an AST matcher
that finds and updates the type references directly.
This simplifies the program considerably, and it won't fail
when it finds a language feature (or complicated type expression)
that nobody wrote custom logic for yet.


[ROCm/hip commit: a1d8340314]
2017-10-24 20:38:49 +01:00
Chris Kitching 0c0ca79c1c Make control flow less insane
`while(false)` is certainly a bold choice.


[ROCm/hip commit: 182d346356]
2017-10-24 20:38:49 +01:00
Chris Kitching aa9fa15bdf Use a cmake glob for collecting hipify sources
Should make breaking this monstrosity into multiple files a bit
easier...


[ROCm/hip commit: be34a017ef]
2017-10-24 20:38:48 +01:00
Chris Kitching 9eff10c00d Move giant lookup table into another translation unit
Also, rewrote it as a constant variable instead of a function
that imperatively fills a map. It's shorter, faster the compile,
and (depending on how badly the compiler screws it up) maybe
faster to run.

And, of course, it starts breaking up that giant .cpp file.


[ROCm/hip commit: 4ab29113a1]
2017-10-24 20:38:25 +01:00
Evgeny Mankov 90c72a6bd5 Merge pull request #230 from emankov/master
[HIPIFY][fix] cmake: NO_DEFAULT_PATH is strongly needed in find_package for LLVM

[ROCm/hip commit: fea0b6ef2f]
2017-10-24 20:44:56 +03:00
Evgeny Mankov 0b94701dac Merge pull request #224 from ChrisKitching/tests
[HIPIFY] Make the automated tests more useful

[ROCm/hip commit: 15c3030d95]
2017-10-24 20:12:36 +03:00
Evgeny Mankov 713c4ae3f4 Merge branch 'master' into tests
[ROCm/hip commit: 3f4d435d31]
2017-10-24 20:03:51 +03:00
Evgeny Mankov 558682ae64 Merge pull request #227 from ChrisKitching/clang-silly
Tweak some version numbers in clang version compatibility checks

[ROCm/hip commit: 061fe7a192]
2017-10-24 17:16:07 +03:00
Evgeny Mankov 31b3c14f05 [HIPIFY][fix] cmake: NO_DEFAULT_PATH is strongly needed in find_package for LLVM
Otherwise LLVM will be searched in system folders.


[ROCm/hip commit: 07a6d3ec06]
2017-10-24 16:35:10 +03:00
Rahul Garg bdbc4ce11c Example showing globals use with module APIs
[ROCm/hip commit: d638cd4fc1]
2017-10-24 18:12:25 +05:30
Maneesh Gupta ccb91e68d7 Merge pull request #225 from emankov/master
[HIPIFY] cmake: fix standalone build

[ROCm/hip commit: 5b6c5ca55e]
2017-10-24 17:06:51 +05:30
Evgeny Mankov 6a8e6693ad [HIPIFY][fix] cmake: do not build hipify-clang if not asked
+ warn "hipify-clang will not be built" if HIPIFY_CLANG_LLVM_DIR is not specified.
+ fix typo in previous commit .


[ROCm/hip commit: 5a9d4e3435]
2017-10-24 14:16:05 +03:00
emankov 83014ac728 [HIPIFY] cmake: simplify build
[ROCm/hip commit: 062127c5ea]
2017-10-24 10:51:11 +03:00
Chris Kitching 71cc7740ff Tweak some version numbers in clang version compatibility checks
Apparently a couple of those APIs changed in clang 5, not 4.

Drat.


[ROCm/hip commit: 0207a29f78]
2017-10-24 01:45:23 +01:00
Evgeny Mankov 241e72b02f [HIPIFY] cmake: fix standalone build
[ROCm/hip commit: ffd22ec756]
2017-10-23 21:16:13 +03:00
Rahul Garg 6a5da9f665 Use 2X for bidir p2p memory bandwidth calc
[ROCm/hip commit: 08c37f3296]
2017-10-23 21:57:20 +05:30
Chris Kitching cb581e6b4e Add concurentKernels.cu to the testsuite
[ROCm/hip commit: ac42fa53ce]
2017-10-23 13:39:37 +01:00
Chris Kitching 44946f5b7a Add the CUDA samples include dir to the path for tests
Means we get to easily steal CUDA examples for tests


[ROCm/hip commit: 71aa7e267a]
2017-10-23 13:39:37 +01:00
Chris Kitching 8144287158 Add cudaRegister.cu lit test
[ROCm/hip commit: 33f88fe7a1]
2017-10-23 13:39:37 +01:00
Chris Kitching ed3ed08e68 Add square.cu to lit testsuite
[ROCm/hip commit: 2faf2800a0]
2017-10-23 13:39:37 +01:00
Chris Kitching e32e9ffa6f Introduce a test runner script to simplify invocation
... And to use a standard, highly amusing trick for making
coloured output work.


[ROCm/hip commit: 80120aed22]
2017-10-23 13:39:37 +01:00
Chris Kitching 90cda6a00a Adapt lit test for the hipLaunchKernelGGL changes from before...
[ROCm/hip commit: 711f495041]
2017-10-23 13:39:37 +01:00
Chris Kitching 641f09e39d Migrate lit test to using FileCheck, so failures are readable
It seems the test is already broken, but look how awesome the
error message is now:

/home/chris/HIP/tests/hipify-clang/axpy.cu:31:12: error: expected string not found in input
 // CHECK: hipLaunchKernel(HIP_KERNEL_NAME(axpy), dim3(1), dim3(kDataLen), 0, 0, a, device_x, device_y);
           ^
<stdin>:31:2: note: scanning from here
 //
 ^
<stdin>:33:2: note: possible intended match here
 hipLaunchKernelGGL(axpy, dim3(1), dim3(kDataLen), 0, 0, a, device_x, device_y);
 ^


[ROCm/hip commit: 66cc45afc5]
2017-10-23 13:39:37 +01:00
Chris Kitching 866f3a06b9 Look for FileCheck for running lit tests, too
Use of grep in `lit` RUN lines is deprecated:
https://llvm.org/docs/TestingGuide.html#writing-new-regression-tests

Using grep leads to really unhelpful failure output (it literally
just says "the test failed"). FileCheck is much more helpful, and
distributed with LLVM on most distros anyway, so this extra
dependency shouldn't prove problematic.


[ROCm/hip commit: 640afd4186]
2017-10-23 13:39:36 +01:00
Chris Kitching ddcdfa839c Propagate the CUDA toolkit directory into the lit tests
Allows the tests to actually run... :D


[ROCm/hip commit: efa814e381]
2017-10-23 13:39:36 +01:00
Chris Kitching 971b55301c Remove dependency on nonexistent lit target
This just... isn't a target. Maybe you were trying to do a file
dependency? But that's handled implicitly by the find_program
call anyway, so all this is doing is generating a cmake warning


[ROCm/hip commit: 76ccda4205]
2017-10-23 13:39:36 +01:00
Maneesh Gupta f52f6e53bb Make elfio headers private
Change-Id: I3ba174bb46e84a75380207d93a0da6fe3703689e


[ROCm/hip commit: b792f9f507]
2017-10-23 10:24:36 +05:30
Ben Sander f47c40d2ff Use 2X for bidir memory bandwidth calc
[ROCm/hip commit: 0d1ad06458]
2017-10-21 07:47:32 -05:00
Evgeny Mankov d79f1b4747 Merge pull request #219 from ChrisKitching/newClang
[HIPIFY] Migrate hipify-clang to a newer version of clang

[ROCm/hip commit: d1ba399eb1]
2017-10-21 11:59:46 +03:00
Ben Sander 584fb96850 Merge pull request #214 from scchan/reinit_printf_buffer
hipDeviceReset(): make sure to reinitialize the printf buffer in hcc RT

[ROCm/hip commit: 0e3d824e8d]
2017-10-20 11:04:37 -05:00
Chris Kitching ecbe19828e Avoid a double-free of HipifyPPCallbacks instance
This bug was present all along, but something changed in the order
of de-initialisation performed by llvm that makes it actually
crash now.

The constructor of HipifyPPCallbacks gives:
```
std::unique_ptr<HipifyPPCallbacks>(this)
```
to the LLVM Preprocessor instance. The Preprocessor instance
subsequently frees the HipifyPPCallbacks, which is then freed
again when we leave the stack frame at line 4340.

So: let's leak the HipifyPPCallbacks onto the heap, and leave
the LLVM Preprocessor object responsible for tidying it up.


[ROCm/hip commit: f5b273fc4f]
2017-10-19 17:09:32 +01:00
Chris Kitching dc869ef474 Be agnostic to the new getReplacements() API
See comment


[ROCm/hip commit: 6718519025]
2017-10-19 17:08:55 +01:00
Chris Kitching c08f105982 Use inline initialisers to set default field values
A trivial cleanup that helps in a moment..


[ROCm/hip commit: 24fc459f69]
2017-10-19 17:08:55 +01:00
Chris Kitching 2584387bfc Omit now-removed Filename string arg from handleBeginSource
[ROCm/hip commit: 7ae6a10c99]
2017-10-19 17:08:55 +01:00