Commit Graph

1831 Commits

Author SHA1 Message Date
wsttiger ef8c6a67fc Modify header such that it's compatible with strict C compiler
[ROCm/clr commit: 8302138d57]
2017-08-15 13:23:09 -05:00
wsttiger 5ba2f6ea57 Add missing CUDA tags
[ROCm/clr commit: 7933b3292d]
2017-08-15 13:22:20 -05:00
Maneesh Gupta b37cc92f36 Merge pull request #138 from scchan/threadfence_system
implement __threadfence_system

[ROCm/clr commit: 2bd79ab9f5]
2017-08-10 12:51:05 +05:30
Maneesh Gupta a8dc5dc11b [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760


[ROCm/clr commit: 0d537f9966]
2017-08-10 12:05:58 +05:30
Ben Sander b17e0394f5 Merge pull request #146 from gargrahul/docs_update_memcpysymbol
docs update for memcpytosymbol support

[ROCm/clr commit: e810da61f3]
2017-08-09 17:48:48 -05:00
Ben Sander 34775b1296 Merge pull request #145 from gargrahul/context_mgmt_changes
Updated context management logic:

[ROCm/clr commit: dfc87a85bd]
2017-08-09 17:47:45 -05:00
Ben Sander 086cd96e56 Merge pull request #139 from gargrahul/disable_p2pcoherency_test
Disable failing test p2p_copy_coherency

[ROCm/clr commit: 2a3ba24776]
2017-08-09 16:54:33 -05:00
Maneesh Gupta 97aabed5f4 Merge pull request #147 from mangupta/bump_workweek
Bump min hcc_workweek required for named kernel dispatch to 17312

[ROCm/clr commit: 077c5a75af]
2017-08-09 14:10:07 +05:30
Maneesh Gupta 51c48e4cef Bump min hcc_workweek required for named kernel dispatch to 17312
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68


[ROCm/clr commit: dabd6d2bc6]
2017-08-08 11:08:55 +05:30
Rahul Garg 93d30119d0 docs update for memcpytosymbol support
[ROCm/clr commit: a8512b7597]
2017-08-08 09:13:12 +05:30
Rahul Garg 687fac97dc Updated context management logic:
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice


[ROCm/clr commit: c4e9323877]
2017-08-08 07:02:22 +05:30
Rahul Garg b32cf1b3e8 Disable failing test p2p_copy_coherency
[ROCm/clr commit: f916a7654e]
2017-08-02 14:47:13 +05:30
Siu Chi Chan 6598eabd24 implement __threadfence_system
[ROCm/clr commit: b6e22b5c93]
2017-08-02 08:50:18 +00:00
Ben Sander ddf72a1b2c Merge pull request #136 from mangupta/update_install_markdown
[doc] Add details of HIP build time dependencies to INSTALL.md

[ROCm/clr commit: 3061e394bf]
2017-07-31 11:05:30 -05:00
Ben Sander 10a2a2e3e6 Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
[hipcc] Cleanup amdgpu target logic

[ROCm/clr commit: 3ed8c6c034]
2017-07-31 11:05:06 -05:00
Maneesh Gupta e090aef2d2 [doc] Add details of HIP build time dependencies to INSTALL.md
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88


[ROCm/clr commit: 87e71ae681]
2017-07-31 14:57:32 +05:30
Maneesh Gupta d3d3cedc1d Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
fix hipMemcpy2DAsync

[ROCm/clr commit: 9ded1d02a1]
2017-07-31 10:27:18 +05:30
Maneesh Gupta 602c3466f9 Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.

[ROCm/clr commit: 06b51109c6]
2017-07-31 10:24:41 +05:30
Maneesh Gupta 5e2f8804ba [hipcc] Fix amdgpu target selection logic
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
  fails. hipcc will report this and die if linking was required.

Change-Id: I76131867049fef92331807dd19a926406dcc1d02


[ROCm/clr commit: 8f6c150134]
2017-07-31 10:11:19 +05:30
Ben Sander 6079b4b4f6 Some fixes to tracing.
[ROCm/clr commit: 9e9f384899]
2017-07-28 22:13:43 -05:00
Rahul Garg 5a1578847e fix hipMemcpy2DAsync
[ROCm/clr commit: 320ae86d44]
2017-07-29 06:50:56 +05:30
Ben Sander 5a891afc7c Merge pull request #119 from emankov/master
[HIPIFY] CUDA RT Textures support

[ROCm/clr commit: fad59ea8c3]
2017-07-28 12:12:41 -05:00
Maneesh Gupta 8e35e7a287 [hipcc] Cleanup amdgpu target logic
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).

Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9


[ROCm/clr commit: 2a2c7575eb]
2017-07-28 16:18:15 +05:30
Maneesh Gupta 2405c239e1 Merge pull request #127 from bensander/pass_module_kname
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile

[ROCm/clr commit: 791063ad6e]
2017-07-28 10:19:15 +05:30
Ben Sander 004080bff8 Add workweek check to make sure we have a new enough compiler
[ROCm/clr commit: 2781293e89]
2017-07-27 23:00:58 -05:00
Maneesh Gupta 8b7ce77fc0 Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.

[ROCm/clr commit: 8330fb3fe0]
2017-07-28 09:15:56 +05:30
Maneesh Gupta 0f78021303 Merge pull request #121 from syifan/master
Allow hip cmake to link libraries

[ROCm/clr commit: 6835f5ac59]
2017-07-28 09:12:15 +05:30
Ben Sander c8b72cb926 Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
[ROCm/clr commit: a282351763]
2017-07-27 22:00:15 -05:00
Evgeny Mankov 6e010bec22 Update Cuda2Hip.cpp
[ROCm/clr commit: 6c5adf42dd]
2017-07-28 01:45:24 +03:00
Evgeny Mankov f9780654b8 Update CUDA_Runtime_API_functions_supported_by_HIP.md
[ROCm/clr commit: 4486f91350]
2017-07-28 01:41:14 +03:00
Evgeny Mankov 4940eea975 Update CUDA_Runtime_API_functions_supported_by_HIP.md
[ROCm/clr commit: 681dd9e14d]
2017-07-28 01:38:50 +03:00
Ben Sander 82a347659c Merge pull request #110 from sunway513/fix_hipEnvVar_test
Fix hipEnvVar test

[ROCm/clr commit: bd563bf761]
2017-07-27 13:03:43 -05:00
sunway513 67afe0ec04 Change to use hipDeviceGetPCIBusId api for hipEnvVarDriver test
[ROCm/clr commit: b7c34fb80f]
2017-07-27 17:41:49 +00:00
Ben Sander 1aa0656d34 Set HIP_SYNC_NULL_STREAM=0.
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.


[ROCm/clr commit: cd42711134]
2017-07-27 11:11:54 -05:00
Yifan Sun 1edab9f118 Allow hip cmake to link libraries
[ROCm/clr commit: 679075e2f0]
2017-07-27 11:46:18 -04:00
Evgeny Mankov e586926eb7 [HIPIFY] CUDA RT Textures support
[ROCm/clr commit: fa7719b5b5]
2017-07-27 18:39:40 +03:00
Maneesh Gupta 97edbda54d Merge pull request #116 from bensander/enable_coh_host
Make host memory allocations coherent by default.

[ROCm/clr commit: 9086c14936]
2017-07-27 10:45:32 +05:30
Ben Sander a5b3f825c2 Merge pull request #100 from weixingzhang/texture
HIP Texture Support

[ROCm/clr commit: 702651fc85]
2017-07-26 19:28:15 -05:00
Ben Sander 03e01439aa Merge pull request #114 from emankov/master
[HIPIFY] CUDA Stream Callback support

[ROCm/clr commit: 1be14ad733]
2017-07-26 19:27:05 -05:00
Ben Sander 70783958f3 Merge pull request #113 from mangupta/avoid_rebuild_cache
[cmake] No longer rebuild cmake cache on each build

[ROCm/clr commit: 9e120e3dd3]
2017-07-26 19:24:43 -05:00
Ben Sander 4f06b4157d Merge pull request #115 from bensander/README
Update release notes.

[ROCm/clr commit: ed9647ad25]
2017-07-26 19:23:26 -05:00
Ben Sander 235b60165f Make host memory allocations coherent by default.
Associated change is to optimize event recording so it uses
agent-scope releaes (since it was only using system-scope release
to support non-coherent host mem).

Flags and environment variables exist to obtain previous behavior
if desired.  Options are documented in new performance guide.


[ROCm/clr commit: 6576201ec2]
2017-07-26 19:20:34 -05:00
Ben Sander 7eeb6e4d9d Update release notes.
[ROCm/clr commit: e29a53f4a6]
2017-07-26 19:03:32 -05:00
Evgeny Mankov 7428857399 [HIPIFY] CUDA Stream Callback support
[ROCm/clr commit: af9dadf20a]
2017-07-26 19:29:50 +03:00
Maneesh Gupta 013f6cd0cc [cmake] No longer rebuild cmake cache on each build
Cmake cache was being rebuilt on each build. This was being done
to update HIP_VERSION, HCC_VERSION, .hipInfo and .hipVersion.
However, rebuilding cache also re-runs HIT parser which is slow.
Removing the cache rebuild should speed up the build. But user
needs to explicitly rebuild the cache in case HIP_VERSION or
HCC_VERSION changes by calling "make rebuild_cache"

Change-Id: Ia5476eb7105aa614239c4dc7968c37f5e6cb0b29


[ROCm/clr commit: 1b5d19ff36]
2017-07-26 13:51:32 +05:30
Evgeny Mankov 7fab0b5ecc [HIPIFY] Finally finished syncing with CUDA 8.0.61 Driver and Runtime… (#105)
[hipify-clang] Finally finished syncing with CUDA 8.0.61 Driver and Runtime API (including missing data types, D3D, OpenGL, VDPAU and EGL interop).

    + All the Modules are supported now:
	1) 4.1 – 4.31 from CUDA 8.0.61 Driver API
	2) CUDA_Driver_API_functions_supported_by_HIP.md updated accordingly
	3) 4.1 – 4.31 from CUDA 8.0.61 Runtime API
	4) CUDA_Runtime_API_functions_supported_by_HIP.md updated accordingly
    + Typos fixing
    + Annotating

[ROCm/clr commit: 9b10efe419]
2017-07-26 09:38:03 +05:30
Maneesh Gupta f2ee8ffd69 Merge pull request #111 from bensander/master
Enable HCC_OPT_FLUSH=1 (if HCC compiler new enough)

[ROCm/clr commit: c5790aed94]
2017-07-25 12:02:33 +05:30
Ben Sander 0e9272cfd7 Enable HCC_OPT_FLUSH=1 (if HCC compiler new enough)
[ROCm/clr commit: cdc4291431]
2017-07-24 18:57:19 -05:00
sunway513 3015b8a12d Update Jenkinsfile to run all the built tests
[ROCm/clr commit: af8d028aa4]
2017-07-24 19:02:00 +00:00
sunway513 cf850a4bc7 Re-enable hipEnvVarDriver in Jenkinsfile
[ROCm/clr commit: ebd9708dfe]
2017-07-24 18:53:46 +00:00