Wykres commitów

714 Commity

Autor SHA1 Wiadomość Data
Maneesh Gupta 1be2dbb150 Add specifying ROCm target support to HIP
Change-Id: I42930d8ccf0dafe408ce5bebe2d95f99e544ee8d
2016-07-15 10:08:27 +05:30
Maneesh Gupta 2410d285a2 Merge branch 'release_0.90.00' into amd-develop
Conflicts:
	bin/hipcc

Change-Id: Iced6290fe205551f5d30f656088530b0ea813193
2016-07-15 10:04:03 +05:30
Maneesh Gupta b0dc9d208a Whitespace cleanup in hipcc
Change-Id: I36d32808e9e393df44ffd80b49d09ccf68b5751e
2016-07-13 11:26:03 +05:30
Maneesh Gupta b4b9ab1d28 Cleanup hip_hcc packaging script
Change-Id: I41168b3135b36263369b97974a2d2646516e8401
2016-07-12 13:08:21 +05:30
Maneesh Gupta 7d3217a3e3 Fix whitespace in cmakefile
Change-Id: Ide671e19ea15d57cc8619ec57ad10bca615b3858
2016-07-12 13:00:42 +05:30
Maneesh Gupta 22941235e1 cmake queries hip version from hipconfig
Change-Id: I8c65ca508ddc288111eaa1d818213c72ef27ac1b
2016-07-12 12:40:35 +05:30
Maneesh Gupta 1b184c8527 Move hip version information to hipconfig
Change-Id: I2892a9eb9356ac5310b89a433d858c1c3eb986ee
2016-07-11 16:38:41 +05:30
Evgeny Mankov 9f77666802 clang-hipify: Populate replacement rules and fix typos.
+ fix typo in hipStreamWaitEvent.
+ hipHostAlloc -> hipHostMalloc.
+ Memory Types, Pointer Attributes are added.
+ Event, Host Malloc and Register, Device and Stream Flags are added.
+ exclude cudacommon.h.prehip from includes' replacement rules.
2016-07-07 18:01:41 +03:00
Maneesh Gupta 41e80065ca Merge branch 'amd-develop' into amd-master
Change-Id: I8743f4653c63b6b19d9f055b2c2df76e1ad185cb
2016-07-05 21:42:43 +05:30
Maneesh Gupta 5947731054 Merge branch 'amd-master' into amd-develop 2016-07-05 21:40:22 +05:30
Maneesh Gupta 589664743b Merge branch 'hipblas' into amd-master
Change-Id: I73bcf3476174b1d01ef7109fe026323fba577ad8
2016-07-05 10:05:43 +05:30
Maneesh Gupta e936f584f7 Merge remote-tracking branch 'hipblas' into hipblas 2016-07-05 10:04:10 +05:30
Maneesh Gupta fa41794911 Merge commit 'hipblas~5' into hipblas 2016-07-05 10:03:51 +05:30
Maneesh Gupta 5bb51a3150 don't bundle source in rocm packages
Change-Id: I36bd93e2bff9db1310724dd79c6c736ee17aa806
2016-07-05 08:18:30 +05:30
Evgeny Mankov 5d04a17a44 clang-hipify: add Replacement Excludes
Excludes are not replaced, for instance, CHECK_CUDA_ERROR and CUDA_SAFE_CALL.
Add check for excludes in MacroExpands and CallExpr routines.
2016-07-01 19:58:14 +03:00
Aditya Atluri adad6ef71f added more nvcc event functions
Change-Id: I79ee20ef444d4c1ab6ada3c0d56730ce754ab6b6
2016-06-30 21:03:19 -05:00
Maneesh Gupta 2577b6158f Merge branch 'amd-develop' into amd-master
Change-Id: I04f85b207e15e66c1a546675dc0937726ee08362
2016-06-30 18:36:07 +05:30
Fan Cao dc0a787984 Replace GPU agent with CPU agent properly for memory async copy API
ihipStream_t::copySync use GPU agent in memory async copy API, even
if the src/dst memory does not belong to GPU, which cause the hsa
runtime to choose a slower copy engine.

SWDEV-95191

Change-Id: If3cab3d493c0c96ed63721cdcf28247a1193887c
2016-06-30 18:23:29 +05:30
Aditya Atluri 38720f8a4e moved half support to a source file
Change-Id: I7c09b41877e22c1b743dea25a585e5307427dafd
2016-06-30 18:23:29 +05:30
Aditya Atluri 4a6ebce880 added fp16 software support
Change-Id: Ic0fdd9f8248a66911169fc00d3af71f50b36e233
2016-06-30 18:23:29 +05:30
Evgeny Mankov 53401727a8 clang-hipify: fix typos and populate replacement rules
+ typos in cudaDeviceAttr and cudaDeviceAttr
+ Device Attribules are added
+ More error codes are added
2016-06-29 20:32:05 +03:00
Maneesh Gupta 88ca6f7312 Update release notes
Change-Id: I6ae7e5c44b1f7d084abb7cedd353276b81531f82
2016-06-29 20:50:14 +05:30
Maneesh Gupta 4138461080 Update markdown documentation
Change-Id: I92cf4411d601e5bbd62ce6c3346cdac0ea2945a8
2016-06-29 20:49:40 +05:30
Evgeny Mankov f19992760e clang-hipify: Array of elements of typedef type translation support.
Example: cudaStream_t streams[2]
2016-06-28 19:57:08 +03:00
Evgeny Mankov ab2a7f436d cublas to hipblas translation support in clang-hipify
+ CUBLAS V2 functions support
+ annotating, formatting
2016-06-28 15:33:38 +03:00
7SK 54034e5048 NVCC_COMPAT
add support for both cuda compatible implementation and hcc(faster)
implementation with test

Change-Id: I79a22344f458391d7dffac5f147619a542e97e4e
2016-06-28 09:36:06 +05:30
Maneesh Gupta 058cdf2a8c hip_samples package now contains uncompressed samples
Change-Id: I69e773ebeff59733e016abd5a90bd1637798b1f5
2016-06-24 21:17:52 +05:30
Rahul Garg dc5ca03143 Included code to calculate value of maxThreadsPerMultiprocessor property
Change-Id: Ie7cad7442f36a7163e715048de5a309febc28664
2016-06-24 21:17:30 +05:30
Maneesh Gupta 32438e69ff Merge branch 'amd-develop' into amd-master
Change-Id: Ibb383539352e3ad5bcdaaf22f88ec6954b2fb55e
2016-06-24 21:15:44 +05:30
Maneesh Gupta dca8fca8eb Merge branch 'amd-master' into amd-develop 2016-06-24 21:13:11 +05:30
Maneesh Gupta 75f14acb79 hip_samples package now contains uncompressed samples
Change-Id: I69e773ebeff59733e016abd5a90bd1637798b1f5
2016-06-24 21:10:37 +05:30
Rahul Garg 226aa917e7 Included code to calculate value of maxThreadsPerMultiprocessor property
Change-Id: Ie7cad7442f36a7163e715048de5a309febc28664
2016-06-24 15:10:11 +05:30
Evgeny Mankov aa75bff6ee Initial cublas to hipblas translation support in clang-hipify.
To run clang-hipify with hipblas support please specify corresponding hipblas include directory, for example:

./hipify-clang --print-stats matrixMult.cpp.cuda -- -I/srv/git/HIP/HIP-Examples/hipblas/include

Additionally:
+ typedef translation support is added (cudaEvent_t and cudaStream_t are converted now).
+ anonymous typedef enum translation support.
+ function name macro expansion support.
+ clang options propagation is restored.
P.S. In order to avoid the following error message:
“Could not auto-detect compilation database for file "… .cu"
No compilation database found in /srv/git/HIP/build/clang-hipify or any parent directory
json-compilation-database: Error while opening JSON database: No such file or directory
Running without flags.”
please append “--“ to the end of clang-hipify’s command line.
2016-06-22 19:15:03 +03:00
Aditya Atluri d2c1fb6f13 added device functions to docs
Change-Id: I11ad1af86274620782986e227888b2d5f0544d8f
2016-06-22 14:51:50 +05:30
Ben Sander 5fecff5a59 Use GRID_LAUNCH_VERSION define provided by HCC grid_launch.h
Change-Id: I5a7b31f13c4055cdca3a8854728d59db7dc5a7bb
2016-06-21 23:52:25 -05:00
Ben Sander e27b5cc927 Grid-launch updates to 2.0 and cleanup of old.
_ Use fields from GRID_LAUNCH_20 structure
  (See USE_GRID_LAUNCH_20 define, currently set to 0)
  "1" will require HCC support.
- Remove old DISABLE_GRID_LAUNCH support.

Change-Id: I584ce648d217251789a6283cf27feb24cb7dc8d1
2016-06-21 23:24:38 -05:00
Ben Sander fefc3f3f1e Switch to hc_* coordinate builtins (replace amp_)
Change-Id: I0a8871f0c9f047eb45a7391fd032100af2bbd4e0
2016-06-21 23:22:38 -05:00
Aditya Atluri f201f27317 added device functions to docs
Change-Id: I11ad1af86274620782986e227888b2d5f0544d8f
2016-06-21 11:11:20 -05:00
Maneesh Gupta f620c846a9 Merge branch 'amd-develop' into amd-master
Change-Id: I8a95f76f270eb6937f3069c90feee07048a43028
2016-06-21 05:58:33 -04:00
Maneesh Gupta 28b905e128 Update hip_samples packaging script
Change-Id: I8c33d2f02f02bb0553fd8e42b8ca6a12f7d5a9f7
2016-06-21 15:16:32 +05:30
Maneesh Gupta d3a0de99a1 Merge branch 'amd-master' into amd-develop 2016-06-21 15:14:04 +05:30
Maneesh Gupta 47b26ff5f0 Merge "Add compiler annotations to make_ functions" into amd-master 2016-06-21 05:34:18 -04:00
Maneesh Gupta 61641cb8e8 Updated version string
Change-Id: I86695e5deba87ef3a8d390d9a78a88be815dff0e
2016-06-21 08:39:47 +05:30
Maneesh Gupta ba29adcffa Remove generatable doxygen documentation from repo
Change-Id: Iaa0aa69189c5a165ec81caa5d01995bec6a856a2
2016-06-21 08:15:42 +05:30
Maneesh Gupta f8a283a092 Add target 'make doc' to cmakelists.txt
Change-Id: I382201b02cf3cc66bc64bd305d7f8549e25c2bb9
2016-06-21 08:13:19 +05:30
Maneesh Gupta 77933298dd Fix cmake issues in directed tests
Change-Id: Ie2ae78022550d5d37c3dbfaecece7d879fb2d37a
2016-06-20 22:00:52 +05:30
Maneesh Gupta e01e8f562a hip_doc package updates doxygen documentation
Change-Id: I5e1b08d8aa5c87564ee2f0eefea63a689e75bf9a
2016-06-20 14:44:11 +05:30
Maneesh Gupta c6858bc75d doxygen input needs HIP_PATH environment variable
Change-Id: I05e027b765fde6339693564bb9dcc584a4ce43bd
2016-06-20 13:02:11 +05:30
Maneesh Gupta 2d50e4b9e0 default value of uninitialized dim3 elements should be 1
Change-Id: Idff38fac8dfca68f38f1714f8fdec64df2890a6a
2016-06-20 10:13:46 +05:30
Aditya Atluri ffcfc95360 able to pass non-dim launch parm to kernel launch
Change-Id: I0411849a27efcba597a1a9aa08be179635e04988
2016-06-18 11:28:20 -05:00