Commit-Graf

3255 Incheckningar

Upphovsman SHA1 Meddelande Datum
Maneesh Gupta 95fe4b770f Merge pull request #863 from aaronenyeshi/add-dot-instructions
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Evgeny Mankov 4519acd459 Merge pull request #883 from emankov/master
[HIPIFY] CUDA Runtime API functions total revise (up to CUDA 10.0)
2019-01-25 19:17:15 +03:00
Aaron Enye Shi 2ab6fd374c Fix mixed dot for Jenkins rocm_2_0 2019-01-24 22:48:35 +00:00
Aaron Enye Shi 77de87d0cd Exclude mixed dot functions from nvcc path 2019-01-24 21:00:45 +00:00
Evgeny Mankov 229b34fc79 [HIPIFY] CUDA Runtime API functions total revise (up to CUDA 10.0)
+ for all CUDA versions starting from 7.0
+ add missing functions
+ sync with HIP
+ update CUDA_Runtime_API_functions_supported_by_HIP.md
+ update affected tests
+ fix typos
+ formatting, annotating
2019-01-24 14:05:36 +03:00
Maneesh Gupta 791db4d760 Merge pull request #882 from eshcherb/roctracer-hip-frontend-190123
enabling PROF_API pkgconfig
2019-01-24 12:10:52 +05:30
Evgeny 6fd5946273 PROF_API pkgconfig roctracer-proto module lookup 2019-01-23 23:40:35 -06:00
Evgeny eb183d4e7b enabling PROF_API pkgconfig 2019-01-23 22:05:50 -06:00
Maneesh Gupta b5b5ba1b36 Merge pull request #877 from mangupta/ci_enable_rocm_21
[ci] Add rocm-2.1.x to CI testing infrastructure
2019-01-23 22:48:31 +05:30
Aaron Enye Shi a014668d3a Fix mixed dot for Jenkins rocm_head 2019-01-23 16:39:25 +00:00
Maneesh Gupta a7f60b2aa7 [ci] Add rocm-2.1.x to CI testing infrastructure
Change-Id: I4c8025b7745605f4ba01000d609211801176d4e0
2019-01-23 13:12:44 +05:30
Aaron Enye Shi bbe5a0381f Add tests for dot functions 2019-01-22 22:32:28 +00:00
Aaron Enye Shi b63597bb95 Use correct OCKL native vector types 2019-01-22 22:31:19 +00:00
Evgeny Mankov fac08442da Merge pull request #875 from emankov/master
[HIPIFY][tests] Exclude one more test if CUDA == 7.0, which is of CUDA 7.5
2019-01-22 22:29:10 +03:00
Evgeny Mankov 994ee4bab0 [HIPIFY][tests] Exclude one more test if CUDA == 7.0, which is of CUDA 7.5 2019-01-22 22:26:43 +03:00
Aaron Enye Shi b0e0ec3c67 Allow hip-clang to use amd_mixed_dot as well 2019-01-22 16:21:16 +00:00
Maneesh Gupta 2c610dde0b Merge pull request #873 from gargrahul/fix_p2p_host_memory_access
Fixed issue of GPU device losing access to host pinned memory
2019-01-22 10:53:18 +05:30
Maneesh Gupta 325a5a2606 Merge pull request #872 from yxsamliu/chrono
Use chrono instead of sys/time.h in test
2019-01-22 10:53:07 +05:30
Rahul Garg f8b399d3b5 Fixed issue of GPU device losing access to host pinned memory 2019-01-22 06:25:43 +05:30
Yaxun Sam Liu f49aed237b Fix get_time in tests for nvcc 2019-01-21 14:07:50 -05:00
Yaxun Sam Liu 75a67c216f Use chrono instead of sys/time.h in test
sys/time.h is not available on Windows. Use C++11 <chrono> instead so that the tests
compile on Windows.
2019-01-20 00:05:27 -05:00
Maneesh Gupta b86b994917 Merge pull request #871 from mangupta/roctracer_graceful_disable
Don't break cmake configure if roctracer headers not found
2019-01-18 16:11:11 +05:30
Maneesh Gupta 3403527cfd Don't break cmake configure if roctracer headers not found
Change-Id: I35e49b52353ab24680836e397ab50136922f8641
2019-01-18 13:23:57 +05:30
Maneesh Gupta c67082ebc2 Merge pull request #870 from eshcherb/roctracer-hip-frontend-190116
fixing HSA_INIT_API cid args
2019-01-17 12:13:22 +05:30
Evgeny 47625cb8fd fixing HSA_INIT_API cid args 2019-01-16 23:45:44 -06:00
Evgeny Mankov 95f66adaf3 Merge pull request #868 from emankov/master
[HIPIFY][fix][#867] Relative paths routines fixes and improvements
2019-01-16 20:34:32 +03:00
Evgeny Mankov 7bbf284636 [HIPIFY][fix][#867] Relative paths routines fixes and improvements
+ Always check for existence any file or dir, first of all, because real_path and make_absolute differ: real_path additionally performs access on file/dir.
+ Relative path routines in source files implemented similarly to dirs.
+ Error handling improvements.
2019-01-16 19:30:09 +03:00
Maneesh Gupta 01f52762fc Merge pull request #864 from mangupta/swdev-157382
Switch to faster ocml variants for trig instrinsics
2019-01-16 10:17:20 +05:30
Maneesh Gupta f74816efe6 Map more instrincis to ocml native
Change-Id: I54e55d5902931bafdabd0956e4e8c1d7b39f7173
2019-01-15 19:40:37 +05:30
Evgeny Mankov dd22278232 Merge pull request #866 from emankov/testing
[HIPIFY] Introduce '-D' and '-I' options as hipify-clang's ones
2019-01-15 14:48:13 +03:00
Aaron Enye Shi bdb84f3d9c Add backwards compatibility to dot funcs 2019-01-14 23:14:49 +00:00
Aaron Enye Shi 7ba7d9a0f6 Move fp16 dot functions to hip_fp16 2019-01-14 21:50:32 +00:00
Evgeny Mankov e5e93a1088 [HIPIFY] rename variables as requested in review 2019-01-14 21:19:34 +03:00
Evgeny Mankov 4b8fdda651 [HIPIFY] Introduce '-D' option as hipify-clang's one
+ '-D' might be set as hipify-clang option (before separator '--' or without specifying separator at all);
+ '-D' as a clang option might be specified as well (after options separator '--');
+ All defines specified as hipify-clang defines (before options separator '--') will be set after clang's ones and will redefine the same clang's ones.
2019-01-14 19:34:42 +03:00
Evgeny Mankov d5281a771a [HIPIFY] Introduce '-I' option as hipify-clang's one
+ '-I' might be set as hipify-clang option (before separator '--' or without specifying separator at all);
+ '-I' as a clang option might be specified as well (after options separator '--').
2019-01-14 18:23:54 +03:00
Maneesh Gupta 54e4463d16 Switch to faster ocml variants for trig instrinsics
Change-Id: If62821e2fe1b0da91ad1b8c5580ebf1a009405e9
2019-01-14 13:01:57 +05:30
Aaron Enye Shi 84a028b9e1 Add Dot functions as amd_mixed_dot function
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Evgeny Mankov a075eb8658 Merge pull request #861 from emankov/master
[HIPIFY][cmake] Add header files to the project explicitly
2019-01-11 16:45:32 +03:00
Maneesh Gupta 7def56e23e Merge pull request #858 from lmoriche/Windows
Fix compilation issues with MSVC
2019-01-11 12:36:07 +05:30
Maneesh Gupta bc2f51a86f Merge pull request #857 from ROCm-Developer-Tools/feature_ensure_lazy_init_initialises
Ensure that static structures are populated
2019-01-11 12:35:52 +05:30
Maneesh Gupta 6320a98498 Merge pull request #859 from LucasCampos/README-typo
Fix typo in README.md
2019-01-11 07:54:19 +05:30
Evgeny Mankov b053cc0a21 [HIPIFY][cmake] Add header files to the project explicitly 2019-01-10 19:45:43 +03:00
Evgeny Mankov dc9f9e9c35 Merge pull request #860 from emankov/master
[HIPIFY] Introduce '--' option - a separator between hipify-clang and…
2019-01-10 19:18:04 +03:00
Evgeny Mankov 63b7fc5714 [HIPIFY] Introduce '--' option - a separator between hipify-clang and clang options
+ Append '--' to the end of hipify-clang options in case of its absence.

[Reason]
+ Avoid the necessity for the user in JSON compilation database specification.
  Otherwise:
  Error while trying to load a compilation database:
  Could not auto-detect compilation database for file "test.cu"

  Though the hipification will be performed correctly without JSON database.
2019-01-10 18:27:30 +03:00
Evgeny Mankov 0661f69bff Merge pull request #856 from emankov/master
[HIPIFY] Introduce CUDA installation path option '--cuda-path'
2019-01-10 18:13:56 +03:00
Evgeny Mankov 9b6af0a81d [HIPIFY] option cuda-path might be used with '--' as well as with a single dash: '-'
+ Update README.md
2019-01-10 18:01:33 +03:00
Lucas Campos ae4c2ee64c Fix typo in README.md 2019-01-10 09:45:23 +01:00
Laurent Morichetti 167bbffc4f Fix compilation issues with MSVC 2019-01-09 09:32:50 -08:00
Alex Voicu 1c815170da Ensure that static structures are populated 2019-01-09 17:21:53 +00:00
Evgeny Mankov c4268855e3 [HIPIFY] Introduce CUDA installation path option '-cuda-path'
Repeats clang's '--cuda-path' option.

[Reason]
In case of absence of any other clang's options setting '-cuda-path' allows not to specify separator '--' before clang's '--cuda-path'.

+ Tests and scripts are updated accordingly.
2019-01-09 20:18:36 +03:00