Maneesh Gupta
1be2dbb150
Add specifying ROCm target support to HIP
...
Change-Id: I42930d8ccf0dafe408ce5bebe2d95f99e544ee8d
2016-07-15 10:08:27 +05:30
Maneesh Gupta
2410d285a2
Merge branch 'release_0.90.00' into amd-develop
...
Conflicts:
bin/hipcc
Change-Id: Iced6290fe205551f5d30f656088530b0ea813193
2016-07-15 10:04:03 +05:30
Maneesh Gupta
b0dc9d208a
Whitespace cleanup in hipcc
...
Change-Id: I36d32808e9e393df44ffd80b49d09ccf68b5751e
2016-07-13 11:26:03 +05:30
Maneesh Gupta
b4b9ab1d28
Cleanup hip_hcc packaging script
...
Change-Id: I41168b3135b36263369b97974a2d2646516e8401
2016-07-12 13:08:21 +05:30
Maneesh Gupta
7d3217a3e3
Fix whitespace in cmakefile
...
Change-Id: Ide671e19ea15d57cc8619ec57ad10bca615b3858
2016-07-12 13:00:42 +05:30
Maneesh Gupta
22941235e1
cmake queries hip version from hipconfig
...
Change-Id: I8c65ca508ddc288111eaa1d818213c72ef27ac1b
2016-07-12 12:40:35 +05:30
Maneesh Gupta
1b184c8527
Move hip version information to hipconfig
...
Change-Id: I2892a9eb9356ac5310b89a433d858c1c3eb986ee
2016-07-11 16:38:41 +05:30
Evgeny Mankov
9f77666802
clang-hipify: Populate replacement rules and fix typos.
...
+ fix typo in hipStreamWaitEvent.
+ hipHostAlloc -> hipHostMalloc.
+ Memory Types, Pointer Attributes are added.
+ Event, Host Malloc and Register, Device and Stream Flags are added.
+ exclude cudacommon.h.prehip from includes' replacement rules.
2016-07-07 18:01:41 +03:00
Maneesh Gupta
41e80065ca
Merge branch 'amd-develop' into amd-master
...
Change-Id: I8743f4653c63b6b19d9f055b2c2df76e1ad185cb
2016-07-05 21:42:43 +05:30
Maneesh Gupta
5947731054
Merge branch 'amd-master' into amd-develop
2016-07-05 21:40:22 +05:30
Maneesh Gupta
589664743b
Merge branch 'hipblas' into amd-master
...
Change-Id: I73bcf3476174b1d01ef7109fe026323fba577ad8
2016-07-05 10:05:43 +05:30
Maneesh Gupta
e936f584f7
Merge remote-tracking branch 'hipblas' into hipblas
2016-07-05 10:04:10 +05:30
Maneesh Gupta
fa41794911
Merge commit 'hipblas~5' into hipblas
2016-07-05 10:03:51 +05:30
Maneesh Gupta
5bb51a3150
don't bundle source in rocm packages
...
Change-Id: I36bd93e2bff9db1310724dd79c6c736ee17aa806
2016-07-05 08:18:30 +05:30
Evgeny Mankov
5d04a17a44
clang-hipify: add Replacement Excludes
...
Excludes are not replaced, for instance, CHECK_CUDA_ERROR and CUDA_SAFE_CALL.
Add check for excludes in MacroExpands and CallExpr routines.
2016-07-01 19:58:14 +03:00
Aditya Atluri
adad6ef71f
added more nvcc event functions
...
Change-Id: I79ee20ef444d4c1ab6ada3c0d56730ce754ab6b6
2016-06-30 21:03:19 -05:00
Maneesh Gupta
2577b6158f
Merge branch 'amd-develop' into amd-master
...
Change-Id: I04f85b207e15e66c1a546675dc0937726ee08362
2016-06-30 18:36:07 +05:30
Fan Cao
dc0a787984
Replace GPU agent with CPU agent properly for memory async copy API
...
ihipStream_t::copySync use GPU agent in memory async copy API, even
if the src/dst memory does not belong to GPU, which cause the hsa
runtime to choose a slower copy engine.
SWDEV-95191
Change-Id: If3cab3d493c0c96ed63721cdcf28247a1193887c
2016-06-30 18:23:29 +05:30
Aditya Atluri
38720f8a4e
moved half support to a source file
...
Change-Id: I7c09b41877e22c1b743dea25a585e5307427dafd
2016-06-30 18:23:29 +05:30
Aditya Atluri
4a6ebce880
added fp16 software support
...
Change-Id: Ic0fdd9f8248a66911169fc00d3af71f50b36e233
2016-06-30 18:23:29 +05:30
Evgeny Mankov
53401727a8
clang-hipify: fix typos and populate replacement rules
...
+ typos in cudaDeviceAttr and cudaDeviceAttr
+ Device Attribules are added
+ More error codes are added
2016-06-29 20:32:05 +03:00
Maneesh Gupta
88ca6f7312
Update release notes
...
Change-Id: I6ae7e5c44b1f7d084abb7cedd353276b81531f82
2016-06-29 20:50:14 +05:30
Maneesh Gupta
4138461080
Update markdown documentation
...
Change-Id: I92cf4411d601e5bbd62ce6c3346cdac0ea2945a8
2016-06-29 20:49:40 +05:30
Evgeny Mankov
f19992760e
clang-hipify: Array of elements of typedef type translation support.
...
Example: cudaStream_t streams[2]
2016-06-28 19:57:08 +03:00
Evgeny Mankov
ab2a7f436d
cublas to hipblas translation support in clang-hipify
...
+ CUBLAS V2 functions support
+ annotating, formatting
2016-06-28 15:33:38 +03:00
7SK
54034e5048
NVCC_COMPAT
...
add support for both cuda compatible implementation and hcc(faster)
implementation with test
Change-Id: I79a22344f458391d7dffac5f147619a542e97e4e
2016-06-28 09:36:06 +05:30
Maneesh Gupta
058cdf2a8c
hip_samples package now contains uncompressed samples
...
Change-Id: I69e773ebeff59733e016abd5a90bd1637798b1f5
2016-06-24 21:17:52 +05:30
Rahul Garg
dc5ca03143
Included code to calculate value of maxThreadsPerMultiprocessor property
...
Change-Id: Ie7cad7442f36a7163e715048de5a309febc28664
2016-06-24 21:17:30 +05:30
Maneesh Gupta
32438e69ff
Merge branch 'amd-develop' into amd-master
...
Change-Id: Ibb383539352e3ad5bcdaaf22f88ec6954b2fb55e
2016-06-24 21:15:44 +05:30
Maneesh Gupta
dca8fca8eb
Merge branch 'amd-master' into amd-develop
2016-06-24 21:13:11 +05:30
Maneesh Gupta
75f14acb79
hip_samples package now contains uncompressed samples
...
Change-Id: I69e773ebeff59733e016abd5a90bd1637798b1f5
2016-06-24 21:10:37 +05:30
Rahul Garg
226aa917e7
Included code to calculate value of maxThreadsPerMultiprocessor property
...
Change-Id: Ie7cad7442f36a7163e715048de5a309febc28664
2016-06-24 15:10:11 +05:30
Evgeny Mankov
aa75bff6ee
Initial cublas to hipblas translation support in clang-hipify.
...
To run clang-hipify with hipblas support please specify corresponding hipblas include directory, for example:
./hipify-clang --print-stats matrixMult.cpp.cuda -- -I/srv/git/HIP/HIP-Examples/hipblas/include
Additionally:
+ typedef translation support is added (cudaEvent_t and cudaStream_t are converted now).
+ anonymous typedef enum translation support.
+ function name macro expansion support.
+ clang options propagation is restored.
P.S. In order to avoid the following error message:
“Could not auto-detect compilation database for file "… .cu"
No compilation database found in /srv/git/HIP/build/clang-hipify or any parent directory
json-compilation-database: Error while opening JSON database: No such file or directory
Running without flags.”
please append “--“ to the end of clang-hipify’s command line.
2016-06-22 19:15:03 +03:00
Aditya Atluri
d2c1fb6f13
added device functions to docs
...
Change-Id: I11ad1af86274620782986e227888b2d5f0544d8f
2016-06-22 14:51:50 +05:30
Ben Sander
5fecff5a59
Use GRID_LAUNCH_VERSION define provided by HCC grid_launch.h
...
Change-Id: I5a7b31f13c4055cdca3a8854728d59db7dc5a7bb
2016-06-21 23:52:25 -05:00
Ben Sander
e27b5cc927
Grid-launch updates to 2.0 and cleanup of old.
...
_ Use fields from GRID_LAUNCH_20 structure
(See USE_GRID_LAUNCH_20 define, currently set to 0)
"1" will require HCC support.
- Remove old DISABLE_GRID_LAUNCH support.
Change-Id: I584ce648d217251789a6283cf27feb24cb7dc8d1
2016-06-21 23:24:38 -05:00
Ben Sander
fefc3f3f1e
Switch to hc_* coordinate builtins (replace amp_)
...
Change-Id: I0a8871f0c9f047eb45a7391fd032100af2bbd4e0
2016-06-21 23:22:38 -05:00
Aditya Atluri
f201f27317
added device functions to docs
...
Change-Id: I11ad1af86274620782986e227888b2d5f0544d8f
2016-06-21 11:11:20 -05:00
Maneesh Gupta
f620c846a9
Merge branch 'amd-develop' into amd-master
...
Change-Id: I8a95f76f270eb6937f3069c90feee07048a43028
2016-06-21 05:58:33 -04:00
Maneesh Gupta
28b905e128
Update hip_samples packaging script
...
Change-Id: I8c33d2f02f02bb0553fd8e42b8ca6a12f7d5a9f7
2016-06-21 15:16:32 +05:30
Maneesh Gupta
d3a0de99a1
Merge branch 'amd-master' into amd-develop
2016-06-21 15:14:04 +05:30
Maneesh Gupta
47b26ff5f0
Merge "Add compiler annotations to make_ functions" into amd-master
2016-06-21 05:34:18 -04:00
Maneesh Gupta
61641cb8e8
Updated version string
...
Change-Id: I86695e5deba87ef3a8d390d9a78a88be815dff0e
2016-06-21 08:39:47 +05:30
Maneesh Gupta
ba29adcffa
Remove generatable doxygen documentation from repo
...
Change-Id: Iaa0aa69189c5a165ec81caa5d01995bec6a856a2
2016-06-21 08:15:42 +05:30
Maneesh Gupta
f8a283a092
Add target 'make doc' to cmakelists.txt
...
Change-Id: I382201b02cf3cc66bc64bd305d7f8549e25c2bb9
2016-06-21 08:13:19 +05:30
Maneesh Gupta
77933298dd
Fix cmake issues in directed tests
...
Change-Id: Ie2ae78022550d5d37c3dbfaecece7d879fb2d37a
2016-06-20 22:00:52 +05:30
Maneesh Gupta
e01e8f562a
hip_doc package updates doxygen documentation
...
Change-Id: I5e1b08d8aa5c87564ee2f0eefea63a689e75bf9a
2016-06-20 14:44:11 +05:30
Maneesh Gupta
c6858bc75d
doxygen input needs HIP_PATH environment variable
...
Change-Id: I05e027b765fde6339693564bb9dcc584a4ce43bd
2016-06-20 13:02:11 +05:30
Maneesh Gupta
2d50e4b9e0
default value of uninitialized dim3 elements should be 1
...
Change-Id: Idff38fac8dfca68f38f1714f8fdec64df2890a6a
2016-06-20 10:13:46 +05:30
Aditya Atluri
ffcfc95360
able to pass non-dim launch parm to kernel launch
...
Change-Id: I0411849a27efcba597a1a9aa08be179635e04988
2016-06-18 11:28:20 -05:00