Maneesh Gupta
95fe4b770f
Merge pull request #863 from aaronenyeshi/add-dot-instructions
...
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Evgeny Mankov
4519acd459
Merge pull request #883 from emankov/master
...
[HIPIFY] CUDA Runtime API functions total revise (up to CUDA 10.0)
2019-01-25 19:17:15 +03:00
Aaron Enye Shi
2ab6fd374c
Fix mixed dot for Jenkins rocm_2_0
2019-01-24 22:48:35 +00:00
Aaron Enye Shi
77de87d0cd
Exclude mixed dot functions from nvcc path
2019-01-24 21:00:45 +00:00
Evgeny Mankov
229b34fc79
[HIPIFY] CUDA Runtime API functions total revise (up to CUDA 10.0)
...
+ for all CUDA versions starting from 7.0
+ add missing functions
+ sync with HIP
+ update CUDA_Runtime_API_functions_supported_by_HIP.md
+ update affected tests
+ fix typos
+ formatting, annotating
2019-01-24 14:05:36 +03:00
Maneesh Gupta
791db4d760
Merge pull request #882 from eshcherb/roctracer-hip-frontend-190123
...
enabling PROF_API pkgconfig
2019-01-24 12:10:52 +05:30
Evgeny
6fd5946273
PROF_API pkgconfig roctracer-proto module lookup
2019-01-23 23:40:35 -06:00
Evgeny
eb183d4e7b
enabling PROF_API pkgconfig
2019-01-23 22:05:50 -06:00
Maneesh Gupta
b5b5ba1b36
Merge pull request #877 from mangupta/ci_enable_rocm_21
...
[ci] Add rocm-2.1.x to CI testing infrastructure
2019-01-23 22:48:31 +05:30
Aaron Enye Shi
a014668d3a
Fix mixed dot for Jenkins rocm_head
2019-01-23 16:39:25 +00:00
Maneesh Gupta
a7f60b2aa7
[ci] Add rocm-2.1.x to CI testing infrastructure
...
Change-Id: I4c8025b7745605f4ba01000d609211801176d4e0
2019-01-23 13:12:44 +05:30
Aaron Enye Shi
bbe5a0381f
Add tests for dot functions
2019-01-22 22:32:28 +00:00
Aaron Enye Shi
b63597bb95
Use correct OCKL native vector types
2019-01-22 22:31:19 +00:00
Evgeny Mankov
fac08442da
Merge pull request #875 from emankov/master
...
[HIPIFY][tests] Exclude one more test if CUDA == 7.0, which is of CUDA 7.5
2019-01-22 22:29:10 +03:00
Evgeny Mankov
994ee4bab0
[HIPIFY][tests] Exclude one more test if CUDA == 7.0, which is of CUDA 7.5
2019-01-22 22:26:43 +03:00
Aaron Enye Shi
b0e0ec3c67
Allow hip-clang to use amd_mixed_dot as well
2019-01-22 16:21:16 +00:00
Maneesh Gupta
2c610dde0b
Merge pull request #873 from gargrahul/fix_p2p_host_memory_access
...
Fixed issue of GPU device losing access to host pinned memory
2019-01-22 10:53:18 +05:30
Maneesh Gupta
325a5a2606
Merge pull request #872 from yxsamliu/chrono
...
Use chrono instead of sys/time.h in test
2019-01-22 10:53:07 +05:30
Rahul Garg
f8b399d3b5
Fixed issue of GPU device losing access to host pinned memory
2019-01-22 06:25:43 +05:30
Yaxun Sam Liu
f49aed237b
Fix get_time in tests for nvcc
2019-01-21 14:07:50 -05:00
Yaxun Sam Liu
75a67c216f
Use chrono instead of sys/time.h in test
...
sys/time.h is not available on Windows. Use C++11 <chrono> instead so that the tests
compile on Windows.
2019-01-20 00:05:27 -05:00
Maneesh Gupta
b86b994917
Merge pull request #871 from mangupta/roctracer_graceful_disable
...
Don't break cmake configure if roctracer headers not found
2019-01-18 16:11:11 +05:30
Maneesh Gupta
3403527cfd
Don't break cmake configure if roctracer headers not found
...
Change-Id: I35e49b52353ab24680836e397ab50136922f8641
2019-01-18 13:23:57 +05:30
Maneesh Gupta
c67082ebc2
Merge pull request #870 from eshcherb/roctracer-hip-frontend-190116
...
fixing HSA_INIT_API cid args
2019-01-17 12:13:22 +05:30
Evgeny
47625cb8fd
fixing HSA_INIT_API cid args
2019-01-16 23:45:44 -06:00
Evgeny Mankov
95f66adaf3
Merge pull request #868 from emankov/master
...
[HIPIFY][fix][#867 ] Relative paths routines fixes and improvements
2019-01-16 20:34:32 +03:00
Evgeny Mankov
7bbf284636
[HIPIFY][fix][ #867 ] Relative paths routines fixes and improvements
...
+ Always check for existence any file or dir, first of all, because real_path and make_absolute differ: real_path additionally performs access on file/dir.
+ Relative path routines in source files implemented similarly to dirs.
+ Error handling improvements.
2019-01-16 19:30:09 +03:00
Maneesh Gupta
01f52762fc
Merge pull request #864 from mangupta/swdev-157382
...
Switch to faster ocml variants for trig instrinsics
2019-01-16 10:17:20 +05:30
Maneesh Gupta
f74816efe6
Map more instrincis to ocml native
...
Change-Id: I54e55d5902931bafdabd0956e4e8c1d7b39f7173
2019-01-15 19:40:37 +05:30
Evgeny Mankov
dd22278232
Merge pull request #866 from emankov/testing
...
[HIPIFY] Introduce '-D' and '-I' options as hipify-clang's ones
2019-01-15 14:48:13 +03:00
Aaron Enye Shi
bdb84f3d9c
Add backwards compatibility to dot funcs
2019-01-14 23:14:49 +00:00
Aaron Enye Shi
7ba7d9a0f6
Move fp16 dot functions to hip_fp16
2019-01-14 21:50:32 +00:00
Evgeny Mankov
e5e93a1088
[HIPIFY] rename variables as requested in review
2019-01-14 21:19:34 +03:00
Evgeny Mankov
4b8fdda651
[HIPIFY] Introduce '-D' option as hipify-clang's one
...
+ '-D' might be set as hipify-clang option (before separator '--' or without specifying separator at all);
+ '-D' as a clang option might be specified as well (after options separator '--');
+ All defines specified as hipify-clang defines (before options separator '--') will be set after clang's ones and will redefine the same clang's ones.
2019-01-14 19:34:42 +03:00
Evgeny Mankov
d5281a771a
[HIPIFY] Introduce '-I' option as hipify-clang's one
...
+ '-I' might be set as hipify-clang option (before separator '--' or without specifying separator at all);
+ '-I' as a clang option might be specified as well (after options separator '--').
2019-01-14 18:23:54 +03:00
Maneesh Gupta
54e4463d16
Switch to faster ocml variants for trig instrinsics
...
Change-Id: If62821e2fe1b0da91ad1b8c5580ebf1a009405e9
2019-01-14 13:01:57 +05:30
Aaron Enye Shi
84a028b9e1
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Evgeny Mankov
a075eb8658
Merge pull request #861 from emankov/master
...
[HIPIFY][cmake] Add header files to the project explicitly
2019-01-11 16:45:32 +03:00
Maneesh Gupta
7def56e23e
Merge pull request #858 from lmoriche/Windows
...
Fix compilation issues with MSVC
2019-01-11 12:36:07 +05:30
Maneesh Gupta
bc2f51a86f
Merge pull request #857 from ROCm-Developer-Tools/feature_ensure_lazy_init_initialises
...
Ensure that static structures are populated
2019-01-11 12:35:52 +05:30
Maneesh Gupta
6320a98498
Merge pull request #859 from LucasCampos/README-typo
...
Fix typo in README.md
2019-01-11 07:54:19 +05:30
Evgeny Mankov
b053cc0a21
[HIPIFY][cmake] Add header files to the project explicitly
2019-01-10 19:45:43 +03:00
Evgeny Mankov
dc9f9e9c35
Merge pull request #860 from emankov/master
...
[HIPIFY] Introduce '--' option - a separator between hipify-clang and…
2019-01-10 19:18:04 +03:00
Evgeny Mankov
63b7fc5714
[HIPIFY] Introduce '--' option - a separator between hipify-clang and clang options
...
+ Append '--' to the end of hipify-clang options in case of its absence.
[Reason]
+ Avoid the necessity for the user in JSON compilation database specification.
Otherwise:
Error while trying to load a compilation database:
Could not auto-detect compilation database for file "test.cu"
Though the hipification will be performed correctly without JSON database.
2019-01-10 18:27:30 +03:00
Evgeny Mankov
0661f69bff
Merge pull request #856 from emankov/master
...
[HIPIFY] Introduce CUDA installation path option '--cuda-path'
2019-01-10 18:13:56 +03:00
Evgeny Mankov
9b6af0a81d
[HIPIFY] option cuda-path might be used with '--' as well as with a single dash: '-'
...
+ Update README.md
2019-01-10 18:01:33 +03:00
Lucas Campos
ae4c2ee64c
Fix typo in README.md
2019-01-10 09:45:23 +01:00
Laurent Morichetti
167bbffc4f
Fix compilation issues with MSVC
2019-01-09 09:32:50 -08:00
Alex Voicu
1c815170da
Ensure that static structures are populated
2019-01-09 17:21:53 +00:00
Evgeny Mankov
c4268855e3
[HIPIFY] Introduce CUDA installation path option '-cuda-path'
...
Repeats clang's '--cuda-path' option.
[Reason]
In case of absence of any other clang's options setting '-cuda-path' allows not to specify separator '--' before clang's '--cuda-path'.
+ Tests and scripts are updated accordingly.
2019-01-09 20:18:36 +03:00