Aaron Enye Shi
2ae4e21bb9
Fix mixed dot for Jenkins rocm_head
...
[ROCm/hip commit: f31aa946e7 ]
2019-01-23 16:39:25 +00:00
Aaron Enye Shi
b1d61f076f
Add tests for dot functions
...
[ROCm/hip commit: 1d70f4217e ]
2019-01-22 22:32:28 +00:00
Aaron Enye Shi
7e22b05339
Use correct OCKL native vector types
...
[ROCm/hip commit: 712a4cbdc2 ]
2019-01-22 22:31:19 +00:00
Aaron Enye Shi
67d0de2dba
Allow hip-clang to use amd_mixed_dot as well
...
[ROCm/hip commit: 8bf588bdb1 ]
2019-01-22 16:21:16 +00:00
Aaron Enye Shi
cb5b6f92bb
Add backwards compatibility to dot funcs
...
[ROCm/hip commit: 22f7570774 ]
2019-01-14 23:14:49 +00:00
Aaron Enye Shi
914b23b6ac
Move fp16 dot functions to hip_fp16
...
[ROCm/hip commit: f0b114ee1c ]
2019-01-14 21:50:32 +00:00
Aaron Enye Shi
d36be16360
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
[ROCm/hip commit: aae1f4f96e ]
2019-01-11 23:31:02 +00:00
Evgeny Mankov
e3a888f672
Merge pull request #861 from emankov/master
...
[HIPIFY][cmake] Add header files to the project explicitly
[ROCm/hip commit: ebf6a8b8fe ]
2019-01-11 16:45:32 +03:00
Maneesh Gupta
bbd8cadd00
Merge pull request #858 from lmoriche/Windows
...
Fix compilation issues with MSVC
[ROCm/hip commit: eb618d2140 ]
2019-01-11 12:36:07 +05:30
Maneesh Gupta
b45debc7ba
Merge pull request #857 from ROCm-Developer-Tools/feature_ensure_lazy_init_initialises
...
Ensure that static structures are populated
[ROCm/hip commit: 5826ed04c1 ]
2019-01-11 12:35:52 +05:30
Maneesh Gupta
7eaa983304
Merge pull request #859 from LucasCampos/README-typo
...
Fix typo in README.md
[ROCm/hip commit: b186c5d69d ]
2019-01-11 07:54:19 +05:30
Evgeny Mankov
1e63082cba
[HIPIFY][cmake] Add header files to the project explicitly
...
[ROCm/hip commit: bd39d852f5 ]
2019-01-10 19:45:43 +03:00
Evgeny Mankov
0d211ebeb8
Merge pull request #860 from emankov/master
...
[HIPIFY] Introduce '--' option - a separator between hipify-clang and…
[ROCm/hip commit: 8990ddd1a7 ]
2019-01-10 19:18:04 +03:00
Evgeny Mankov
3792d264c0
[HIPIFY] Introduce '--' option - a separator between hipify-clang and clang options
...
+ Append '--' to the end of hipify-clang options in case of its absence.
[Reason]
+ Avoid the necessity for the user in JSON compilation database specification.
Otherwise:
Error while trying to load a compilation database:
Could not auto-detect compilation database for file "test.cu"
Though the hipification will be performed correctly without JSON database.
[ROCm/hip commit: aa4c59ae0c ]
2019-01-10 18:27:30 +03:00
Evgeny Mankov
36d1e34641
Merge pull request #856 from emankov/master
...
[HIPIFY] Introduce CUDA installation path option '--cuda-path'
[ROCm/hip commit: fe4d1d7f2c ]
2019-01-10 18:13:56 +03:00
Evgeny Mankov
b932304bfc
[HIPIFY] option cuda-path might be used with '--' as well as with a single dash: '-'
...
+ Update README.md
[ROCm/hip commit: 0206fadc14 ]
2019-01-10 18:01:33 +03:00
Lucas Campos
2642ab4291
Fix typo in README.md
...
[ROCm/hip commit: 9b32663b0e ]
2019-01-10 09:45:23 +01:00
Laurent Morichetti
0597dc63df
Fix compilation issues with MSVC
...
[ROCm/hip commit: c9f552968d ]
2019-01-09 09:32:50 -08:00
Alex Voicu
e2911291dd
Ensure that static structures are populated
...
[ROCm/hip commit: 0b57f50855 ]
2019-01-09 17:21:53 +00:00
Evgeny Mankov
6acede94f5
[HIPIFY] Introduce CUDA installation path option '-cuda-path'
...
Repeats clang's '--cuda-path' option.
[Reason]
In case of absence of any other clang's options setting '-cuda-path' allows not to specify separator '--' before clang's '--cuda-path'.
+ Tests and scripts are updated accordingly.
[ROCm/hip commit: fbfe005e4e ]
2019-01-09 20:18:36 +03:00
Evgeny Mankov
ea0ed81ca7
Merge pull request #855 from emankov/master
...
[HIPIFY][cmake][win] Add cmake versioning depending on Visual Studio version
[ROCm/hip commit: 0e750d727f ]
2019-01-09 18:29:01 +03:00
Evgeny Mankov
9314f882f3
[HIPIFY][cmake][win] Add cmake versioning depending on Visual Studio version
...
[ROCm/hip commit: 96f8bd7e2b ]
2019-01-09 16:44:12 +03:00
Evgeny Mankov
5fcf040c3d
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP
...
[ROCm/hip commit: cf7addbc73 ]
2019-01-09 16:39:48 +03:00
Evgeny Mankov
1c0b4b6c13
Merge pull request #854 from emankov/master
...
[HIPIFY][cmake] Add cmake versioning depending on CUDA version
[ROCm/hip commit: fb0877564e ]
2019-01-09 16:39:30 +03:00
Evgeny Mankov
643f84476c
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP
...
[ROCm/hip commit: f5effc0d23 ]
2019-01-09 14:03:40 +03:00
Evgeny Mankov
e816f3e98a
[HIPIFY][cmake] Add cmake versioning depending on CUDA version
...
[ROCm/hip commit: 8fb8ca02c8 ]
2019-01-09 13:59:48 +03:00
Maneesh Gupta
ec245b61e9
Merge pull request #853 from ROCm-Developer-Tools/hip-module-hash-fix
...
Introduce hash key to HIP module implementation
[ROCm/hip commit: 36039d6765 ]
2019-01-09 16:28:50 +05:30
Wen-Heng (Jack) Chung
2f9021afd3
Introduce hash key to HIP module implementation
...
A hash calculated via FNV-1a algorithm is introduced in ihipModule_t, the
internal of hipModule_t. The hash is used by HIP module APIs such as
- read_agent_global_from_module
to determine whether the agent-scope globals for a module have been iterated.
This commit fixes one issue that applications which load / unload modules
frequently would occasionally fail. After deep investigation of the issue it
turns out the old implementation in read_agent_global_from_module uses
hipModule_t as the key, which is not robust enough, as hipModule_t instances
are allocated dynamically so there are cases that one memory address may be
used by multiple hipModule_t instances. The real solution is to introduce a
uniquely identifiable hash for the code object associated with the HIP module.
And that's the rationale behind this commit.
[ROCm/hip commit: b4d658a48f ]
2019-01-08 17:33:40 +00:00
Maneesh Gupta
685b3b2167
Merge pull request #845 from mangupta/cleanup-install-markdown
...
Update INSTALL.md
[ROCm/hip commit: 7637504776 ]
2019-01-07 10:04:36 +05:30
Maneesh Gupta
419f970c01
Merge pull request #701 from ssahasra/stream-tests
...
introduce tests for asynchronous streams
[ROCm/hip commit: 87d0674a1a ]
2019-01-07 10:04:29 +05:30
Maneesh Gupta
cf7c7f2db2
Update hipAsynchronousStreams.cpp
...
This test needs to be excluded from nvcc platform
[ROCm/hip commit: 580c808400 ]
2019-01-07 09:26:21 +05:30
Evgeny Mankov
16751e3961
Merge pull request #852 from emankov/master
...
[HIPIFY][fix][#851 ] make_absolute only in case of relative path
[ROCm/hip commit: b060b5295b ]
2019-01-06 18:25:05 +03:00
emankov
57b553324b
[HIPIFY][fix][ #851 ] make_absolute only in case of relative path
...
Otherwise return the input path.
[ROCm/hip commit: bff9e1ebe2 ]
2019-01-06 17:58:37 +03:00
Evgeny Mankov
78fa7b9f9d
Merge pull request #850 from emankov/master
...
[HIPIFY][fix][#849 ] Create temporary file based on source file name only
[ROCm/hip commit: 3fc5b8b16e ]
2019-01-05 15:16:14 +03:00
emankov
60adc94cad
[HIPIFY][fix][ #849 ] Create temporary file based on source file name only
...
[ROCm/hip commit: 4a475a9db6 ]
2019-01-05 14:52:15 +03:00
Evgeny Mankov
1b84b2ef4a
Merge pull request #848 from emankov/master
...
[HIPIFY][cmake] Change min supported version of cmake to 3.12.3
[ROCm/hip commit: 2dbac57f51 ]
2019-01-04 19:12:14 +03:00
emankov
3fa069c87d
[HIPIFY][cmake] Change min supported version of cmake to 3.12.3
...
[Reason]
CUDA 10 is supported by cmake since 3.12.3.
[ROCm/hip commit: 8a54ef365b ]
2019-01-04 19:10:11 +03:00
Evgeny Mankov
010fafd6cd
Merge pull request #847 from emankov/master
...
[HIPIFY][cmake][win] check for Visual Studio version
[ROCm/hip commit: 394a3fcc52 ]
2019-01-04 18:57:39 +03:00
emankov
e371a45400
[HIPIFY][cmake][win] check for Visual Studio version
...
hipify-clang could be built by Visual Studio 14 2015 or higher.
[Reason]
hipify-clang contains c++14 features which are not supported even by the latest VS 2013 and the Platform Toolset "Visual C++ Compiler Nov 2013 CTP".
[ROCm/hip commit: cd15a85663 ]
2019-01-04 18:52:28 +03:00
Sameer Sahasrabuddhe
d1a7eb0cb5
introduce test for asynchronous streams
...
This test is identical to the HCC tests under
tests/Unit/OCKL/AsyncStreams, but only for synchronous operation.
[ROCm/hip commit: 8fe2a4b167 ]
2019-01-04 05:51:50 -05:00
Maneesh Gupta
98b7c724dc
Update INSTALL.md
...
- Improved install instructions for AMD-hcc and NVIDIA-nvcc.
- Removed redundant section related to HCC-Native-GCN-ISA.
Change-Id: I33a9e71d95df51dbdcade4a48834840146bdbe6f
[ROCm/hip commit: b8c78f90a6 ]
2019-01-04 11:04:47 +05:30
Evgeny Mankov
8a339547d6
Merge pull request #844 from emankov/master
...
[HIPIFY][tests] CUDA 7.0 is also supported
[ROCm/hip commit: 1d30bf5ccc ]
2019-01-04 01:34:14 +03:00
emankov
74f77c9f29
[HIPIFY][tests] CUDA 7.0 is also supported
...
+ Exclude 1 test for CUDA 7.0
+ Update README.md
[ROCm/hip commit: 9abcad9407 ]
2019-01-04 01:32:30 +03:00
Evgeny Mankov
497d6c37c9
Merge pull request #843 from emankov/master
...
[HIPIFY][tests] Exclude CUDA 8.0 (9.0) tests if CUDA < 8.0 (9.0)
[ROCm/hip commit: d6f3af9a4f ]
2019-01-03 18:08:48 +03:00
emankov
15949403b4
[HIPIFY][tests] Exclude CUDA 8.0 (9.0) tests if CUDA < 8.0 (9.0)
...
[ROCm/hip commit: 65155c7159 ]
2019-01-03 18:06:49 +03:00
Evgeny Mankov
6622aad859
Merge pull request #842 from emankov/master
...
[HIPIFY][tests] Reduce cudaRegister.cu test
[ROCm/hip commit: fbee9932c7 ]
2019-01-03 17:19:11 +03:00
emankov
8883153c05
[HIPIFY][tests] Reduce cudaRegister.cu test
...
[ROCm/hip commit: 7fb0f4ee2a ]
2019-01-03 17:17:21 +03:00
Maneesh Gupta
72fd96c3da
Merge pull request #841 from ROCm-Developer-Tools/rebase-pr827-to-master
...
Fix issues with prior iteration of 731
[ROCm/hip commit: 8c5e018e16 ]
2019-01-01 20:22:57 +05:30
Alex Voicu
587745b841
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
...
[ROCm/hip commit: ec14daa7ce ]
2019-01-01 11:01:16 +05:30
Alex Voicu
e127990e23
More blobs, more problems.
...
[ROCm/hip commit: 340674ceb6 ]
2019-01-01 11:01:07 +05:30