Граф коммитов

6428 Коммитов

Автор SHA1 Сообщение Дата
Weixing Zhang 4264a4ce56 [HIP Texture] The GPU virtual address for texture memory needs to be
aligned.

In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta 8d6acec135 Bump min hcc_workweek required for named kernel dispatch to 17312
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg eb25bd72db docs update for memcpytosymbol support 2017-08-08 09:13:12 +05:30
Rahul Garg 3e84cf4aba Updated context management logic:
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Rahul Garg 2f617bac6d Disable failing test p2p_copy_coherency 2017-08-02 14:47:13 +05:30
Siu Chi Chan 96e79f97e2 implement __threadfence_system 2017-08-02 08:50:18 +00:00
Jenkins a827869a4a Merge 'master' into 'amd-master'
Change-Id: Id400e2e3c0fa393ca934634dee46496aca8ade21
2017-08-01 04:11:20 -05:00
Ben Sander 6d6c325626 Merge pull request #136 from mangupta/update_install_markdown
[doc] Add details of HIP build time dependencies to INSTALL.md
2017-07-31 11:05:30 -05:00
Ben Sander 0fcd27586b Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
[hipcc] Cleanup amdgpu target logic
2017-07-31 11:05:06 -05:00
Maneesh Gupta e9c995959b [doc] Add details of HIP build time dependencies to INSTALL.md
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
2017-07-31 14:57:32 +05:30
Jenkins cc77fb5087 Merge 'master' into 'amd-master'
Change-Id: I510b01084b5aad8382044fe828a099896fa12aff
2017-07-31 04:11:17 -05:00
Maneesh Gupta fce1d0d7f0 Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta 48573a037e Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Maneesh Gupta 85ff6e3ef4 [hipcc] Fix amdgpu target selection logic
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
  fails. hipcc will report this and die if linking was required.

Change-Id: I76131867049fef92331807dd19a926406dcc1d02
2017-07-31 10:11:19 +05:30
Ben Sander ecbb494a58 Some fixes to tracing. 2017-07-28 22:13:43 -05:00
Rahul Garg 0053e4db03 fix hipMemcpy2DAsync 2017-07-29 06:50:56 +05:30
Ben Sander 98f09debbb Merge pull request #119 from emankov/master
[HIPIFY] CUDA RT Textures support
2017-07-28 12:12:41 -05:00
Maneesh Gupta 830e85060b [hipcc] Cleanup amdgpu target logic
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).

Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
2017-07-28 16:18:15 +05:30
Aditya Atluri a3bc662c06 fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-28 10:43:11 +05:30
Maneesh Gupta c25b9c6786 Renable frexp(f) device math function
Change-Id: I53c022b8ddf38cd17ddb42eba457b9020db66395
2017-07-28 10:42:51 +05:30
Maneesh Gupta 375ec84bc6 Merge pull request #127 from bensander/pass_module_kname
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander 4980a6d3ab Add workweek check to make sure we have a new enough compiler 2017-07-27 23:00:58 -05:00
Maneesh Gupta fcc294ed41 Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Maneesh Gupta 821f8b9e02 Merge pull request #121 from syifan/master
Allow hip cmake to link libraries
2017-07-28 09:12:15 +05:30
Ben Sander 3a4dfc0f85 Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile 2017-07-27 22:00:15 -05:00
Evgeny Mankov d614f53ad7 Update Cuda2Hip.cpp 2017-07-28 01:45:24 +03:00
Evgeny Mankov bcca169631 Update CUDA_Runtime_API_functions_supported_by_HIP.md 2017-07-28 01:41:14 +03:00
Evgeny Mankov 36a9ccacb6 Update CUDA_Runtime_API_functions_supported_by_HIP.md 2017-07-28 01:38:50 +03:00
Ben Sander 9cede50942 Merge pull request #110 from sunway513/fix_hipEnvVar_test
Fix hipEnvVar test
2017-07-27 13:03:43 -05:00
sunway513 ed0d6cf745 Change to use hipDeviceGetPCIBusId api for hipEnvVarDriver test 2017-07-27 17:41:49 +00:00
Ben Sander 77fb9893b4 Set HIP_SYNC_NULL_STREAM=0.
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.
2017-07-27 11:11:54 -05:00
Yifan Sun 0aaa800109 Allow hip cmake to link libraries 2017-07-27 11:46:18 -04:00
Evgeny Mankov ffcf5d32a0 [HIPIFY] CUDA RT Textures support 2017-07-27 18:39:40 +03:00
Maneesh Gupta 526fcb1223 Merge pull request #116 from bensander/enable_coh_host
Make host memory allocations coherent by default.
2017-07-27 10:45:32 +05:30
Ben Sander 4f93b99f23 Merge pull request #100 from weixingzhang/texture
HIP Texture Support
2017-07-26 19:28:15 -05:00
Ben Sander bcdc6681fa Merge pull request #114 from emankov/master
[HIPIFY] CUDA Stream Callback support
2017-07-26 19:27:05 -05:00
Ben Sander 50546c7866 Merge pull request #113 from mangupta/avoid_rebuild_cache
[cmake] No longer rebuild cmake cache on each build
2017-07-26 19:24:43 -05:00
Ben Sander 5589c59605 Merge pull request #115 from bensander/README
Update release notes.
2017-07-26 19:23:26 -05:00
Ben Sander 8b8e97ff28 Make host memory allocations coherent by default.
Associated change is to optimize event recording so it uses
agent-scope releaes (since it was only using system-scope release
to support non-coherent host mem).

Flags and environment variables exist to obtain previous behavior
if desired.  Options are documented in new performance guide.
2017-07-26 19:20:34 -05:00
Ben Sander d9d9a8620f Update release notes. 2017-07-26 19:03:32 -05:00
Evgeny Mankov 368aebc4bc [HIPIFY] CUDA Stream Callback support 2017-07-26 19:29:50 +03:00
Maneesh Gupta e63c8494fd [cmake] No longer rebuild cmake cache on each build
Cmake cache was being rebuilt on each build. This was being done
to update HIP_VERSION, HCC_VERSION, .hipInfo and .hipVersion.
However, rebuilding cache also re-runs HIT parser which is slow.
Removing the cache rebuild should speed up the build. But user
needs to explicitly rebuild the cache in case HIP_VERSION or
HCC_VERSION changes by calling "make rebuild_cache"

Change-Id: Ia5476eb7105aa614239c4dc7968c37f5e6cb0b29
2017-07-26 13:51:32 +05:30
Evgeny Mankov 85e6593a1b [HIPIFY] Finally finished syncing with CUDA 8.0.61 Driver and Runtime… (#105)
[hipify-clang] Finally finished syncing with CUDA 8.0.61 Driver and Runtime API (including missing data types, D3D, OpenGL, VDPAU and EGL interop).

    + All the Modules are supported now:
	1) 4.1 – 4.31 from CUDA 8.0.61 Driver API
	2) CUDA_Driver_API_functions_supported_by_HIP.md updated accordingly
	3) 4.1 – 4.31 from CUDA 8.0.61 Runtime API
	4) CUDA_Runtime_API_functions_supported_by_HIP.md updated accordingly
    + Typos fixing
    + Annotating
2017-07-26 09:38:03 +05:30
Maneesh Gupta f420af64d0 Merge pull request #111 from bensander/master
Enable HCC_OPT_FLUSH=1 (if HCC compiler new enough)
2017-07-25 12:02:33 +05:30
Ben Sander 67f5d2de3d Enable HCC_OPT_FLUSH=1 (if HCC compiler new enough) 2017-07-24 18:57:19 -05:00
sunway513 6f359dfbe6 Update Jenkinsfile to run all the built tests 2017-07-24 19:02:00 +00:00
sunway513 1275ceed67 Re-enable hipEnvVarDriver in Jenkinsfile 2017-07-24 18:53:46 +00:00
sunway513 81bc82d166 Fix indentation, binary path and add wait on popen calls to hipEnvVar test 2017-07-24 17:16:13 +00:00
Maneesh Gupta bb5128a0ab Merge pull request #109 from mangupta/disable_failing_tests
[ci] Exclude sporadically failing test in CI only
2017-07-24 20:22:45 +05:30
Maneesh Gupta 4b8931f717 [ci] Exclude sporadically failing test in CI only
Change-Id: If76bc89d39aa35f5c246cbcd41a997b580832450
2017-07-24 15:16:41 +05:30