Fan Cao
eb0db4e549
Replace GPU agent with CPU agent properly for memory async copy API
...
ihipStream_t::copySync use GPU agent in memory async copy API, even
if the src/dst memory does not belong to GPU, which cause the hsa
runtime to choose a slower copy engine.
SWDEV-95191
Change-Id: If3cab3d493c0c96ed63721cdcf28247a1193887c
[ROCm/clr commit: dc0a787984 ]
2016-06-30 18:23:29 +05:30
Aditya Atluri
cf3186cd63
moved half support to a source file
...
Change-Id: I7c09b41877e22c1b743dea25a585e5307427dafd
[ROCm/clr commit: 38720f8a4e ]
2016-06-30 18:23:29 +05:30
Aditya Atluri
29c5bf181a
added fp16 software support
...
Change-Id: Ic0fdd9f8248a66911169fc00d3af71f50b36e233
[ROCm/clr commit: 4a6ebce880 ]
2016-06-30 18:23:29 +05:30
Maneesh Gupta
28915dd491
Merge branch 'amd-master' into amd-develop
...
[ROCm/clr commit: dca8fca8eb ]
2016-06-24 21:13:11 +05:30
Maneesh Gupta
6cf55619f3
hip_samples package now contains uncompressed samples
...
Change-Id: I69e773ebeff59733e016abd5a90bd1637798b1f5
[ROCm/clr commit: 75f14acb79 ]
2016-06-24 21:10:37 +05:30
Rahul Garg
38987ce023
Included code to calculate value of maxThreadsPerMultiprocessor property
...
Change-Id: Ie7cad7442f36a7163e715048de5a309febc28664
[ROCm/clr commit: 226aa917e7 ]
2016-06-24 15:10:11 +05:30
Ben Sander
5edb4185a2
Use GRID_LAUNCH_VERSION define provided by HCC grid_launch.h
...
Change-Id: I5a7b31f13c4055cdca3a8854728d59db7dc5a7bb
[ROCm/clr commit: 5fecff5a59 ]
2016-06-21 23:52:25 -05:00
Ben Sander
050430789c
Grid-launch updates to 2.0 and cleanup of old.
...
_ Use fields from GRID_LAUNCH_20 structure
(See USE_GRID_LAUNCH_20 define, currently set to 0)
"1" will require HCC support.
- Remove old DISABLE_GRID_LAUNCH support.
Change-Id: I584ce648d217251789a6283cf27feb24cb7dc8d1
[ROCm/clr commit: e27b5cc927 ]
2016-06-21 23:24:38 -05:00
Ben Sander
5e376bf121
Switch to hc_* coordinate builtins (replace amp_)
...
Change-Id: I0a8871f0c9f047eb45a7391fd032100af2bbd4e0
[ROCm/clr commit: fefc3f3f1e ]
2016-06-21 23:22:38 -05:00
Aditya Atluri
041b0a544e
added device functions to docs
...
Change-Id: I11ad1af86274620782986e227888b2d5f0544d8f
[ROCm/clr commit: f201f27317 ]
2016-06-21 11:11:20 -05:00
Maneesh Gupta
1f31d0a49e
Update hip_samples packaging script
...
Change-Id: I8c33d2f02f02bb0553fd8e42b8ca6a12f7d5a9f7
[ROCm/clr commit: 28b905e128 ]
2016-06-21 15:16:32 +05:30
Maneesh Gupta
22edc8805a
Merge branch 'amd-master' into amd-develop
...
[ROCm/clr commit: d3a0de99a1 ]
2016-06-21 15:14:04 +05:30
Maneesh Gupta
72feba608b
Merge "Add compiler annotations to make_ functions" into amd-master
...
[ROCm/clr commit: 47b26ff5f0 ]
2016-06-21 05:34:18 -04:00
Maneesh Gupta
2f54161c11
Updated version string
...
Change-Id: I86695e5deba87ef3a8d390d9a78a88be815dff0e
[ROCm/clr commit: 61641cb8e8 ]
2016-06-21 08:39:47 +05:30
Maneesh Gupta
6d87abcf20
Remove generatable doxygen documentation from repo
...
Change-Id: Iaa0aa69189c5a165ec81caa5d01995bec6a856a2
[ROCm/clr commit: ba29adcffa ]
2016-06-21 08:15:42 +05:30
Maneesh Gupta
b593fe4012
Add target 'make doc' to cmakelists.txt
...
Change-Id: I382201b02cf3cc66bc64bd305d7f8549e25c2bb9
[ROCm/clr commit: f8a283a092 ]
2016-06-21 08:13:19 +05:30
Maneesh Gupta
df12d5a0f9
Fix cmake issues in directed tests
...
Change-Id: Ie2ae78022550d5d37c3dbfaecece7d879fb2d37a
[ROCm/clr commit: 77933298dd ]
2016-06-20 22:00:52 +05:30
Maneesh Gupta
a2df45b1da
hip_doc package updates doxygen documentation
...
Change-Id: I5e1b08d8aa5c87564ee2f0eefea63a689e75bf9a
[ROCm/clr commit: e01e8f562a ]
2016-06-20 14:44:11 +05:30
Maneesh Gupta
2c637d4259
doxygen input needs HIP_PATH environment variable
...
Change-Id: I05e027b765fde6339693564bb9dcc584a4ce43bd
[ROCm/clr commit: c6858bc75d ]
2016-06-20 13:02:11 +05:30
Maneesh Gupta
a0006c5df0
default value of uninitialized dim3 elements should be 1
...
Change-Id: Idff38fac8dfca68f38f1714f8fdec64df2890a6a
[ROCm/clr commit: 2d50e4b9e0 ]
2016-06-20 10:13:46 +05:30
Aditya Atluri
adace999e7
able to pass non-dim launch parm to kernel launch
...
Change-Id: I0411849a27efcba597a1a9aa08be179635e04988
[ROCm/clr commit: ffcfc95360 ]
2016-06-18 11:28:20 -05:00
Maneesh Gupta
7540ec158f
Disable hipMemcpyAll test
...
Change-Id: I58ff2e1baa91e3af335dd869ce4f6d56cd81028c
[ROCm/clr commit: 763dd0cb76 ]
2016-06-18 15:48:42 +05:30
Maneesh Gupta
63936a27b3
Disable running tests that are not being built
...
Change-Id: Ic1ae492780f4c7c6315e815e0b3c291b354927fd
[ROCm/clr commit: 0e3128c1e5 ]
2016-06-18 15:48:14 +05:30
Ben Sander
caf2bc8208
Fix test project names
...
Change-Id: I105c3759208ec832f3bdf6068029b4bfa2827909
[ROCm/clr commit: 20a164be6a ]
2016-06-17 15:18:57 -05:00
Ben Sander
12b1c634ef
Initial release notes for 0.95.
...
Change-Id: I6d43f643f4c6ecd4b4e4434c5cc1dfecf540a6d3
[ROCm/clr commit: c82cd5822c ]
2016-06-17 15:18:57 -05:00
Ben Sander
a03c51dc9e
Clean up old work-week and USE_* refs
...
Change-Id: I929c979fa085f8e5205194cbccca46e9b5516aa9
[ROCm/clr commit: 44d117ba63 ]
2016-06-17 15:18:57 -05:00
Ben Sander
b31b23ff1d
Refactor directed test infrastructue.
...
- Add hierarchy. Tests now live in directories, each with its own
CMakeFiles.txt. Reduces merge conflicts.
- Change make_hip_executable -> build_hip_executable.
- Refresh docs.
- Enable some tests that were previously built but not run.
Change-Id: I8c5de3c954400bf233904282b8b42861a2b7c536
[ROCm/clr commit: 3feb13c8f6 ]
2016-06-17 15:18:57 -05:00
Ben Sander
d2d921f7be
Fix hierchical tests. Set compiler before project to avoid loop
...
Change-Id: Icede4e76cb47820b39403a7f1ba3f4bd3f47ed03
[ROCm/clr commit: 883d733174 ]
2016-06-17 15:18:57 -05:00
Aditya Atluri
1e390cd4af
added tests for host math functions
...
Change-Id: I66a5c574a27190e32054586f07ecf20e1ff71292
[ROCm/clr commit: ba262ea855 ]
2016-06-17 15:05:33 -05:00
Aditya Atluri
f13c55fb2a
added copyright
...
Change-Id: Ic77c09ff11e031836f4c3d66c9b8ba1894d2e374
[ROCm/clr commit: c07646fddd ]
2016-06-17 11:51:36 -05:00
Maneesh Gupta
75ecf38bb7
hipLaunchKernel supports empty kernel with no arguments
...
Change-Id: Iaa46ced7fe0e3d72734d02250dfb48b98bef615d
[ROCm/clr commit: 275c5516b6 ]
2016-06-17 14:11:46 +05:30
Aditya Atluri
6837ae644c
added bessel nth order function
...
Change-Id: I18a64d894dda9330b39638535dfafd7ce31bb968
[ROCm/clr commit: 75fc024308 ]
2016-06-17 09:22:23 +05:30
Ben Sander
ac73c9331c
Resolve cmake infinite loop issue
...
Change-Id: I8af4eb8dee754c97b72c108d4613071b0de023bd
[ROCm/clr commit: debddd9b45 ]
2016-06-16 19:43:07 -05:00
Ben Sander
e317f07cb3
Add NVCC implementation for hipPeeakAtLastError
...
[ROCm/clr commit: 7492351a3c ]
2016-06-16 09:38:34 -05:00
Ben Sander
62c5658e93
remove old KERNELBEGIN/KERNELEND macros
...
[ROCm/clr commit: dbffc7bba2 ]
2016-06-16 09:38:07 -05:00
Ben Sander
4679da2c45
Add include guards for HIP_PLATFORM_NVCC
...
[ROCm/clr commit: c2f9fb32a1 ]
2016-06-16 09:37:10 -05:00
Ben Sander
a8cba75ce4
Update launch_bounds test
...
[ROCm/clr commit: 46e27c7568 ]
2016-06-16 09:29:03 -05:00
Ben Sander
c6ace6f647
Add first steps for CMake test hierarchy + initial launch_bounds.
...
[ROCm/clr commit: 71b9369256 ]
2016-06-16 08:41:32 -05:00
Maneesh Gupta
475fd56684
Add compiler annotations to make_ functions
...
Change-Id: Ic16cfe402dc5cb387b1b63ad311ce8b57ea07361
[ROCm/clr commit: bdfd943b4a ]
2016-06-16 16:18:35 +05:30
Ben Sander
c46b2e7450
NVCC improvements.
...
- Complete translation tables for cudaError <-> hipError_t.
- Remove some odd errors that were not correctly translated or not used.
- Add HIPCHECK_API to test infrastructure. Used for negative testing
an API ; if a mismatch occurs it shows the expected return error
code. Can also print a warning rather than error.
- Enable hipMemoryAllocate on NV system, and review error coded.
- Add hipErrorName to nvcc.
Change-Id: I680427dcf32a5796d5913cf9e7f3b4c6f6b91599
Conflicts:
tests/src/CMakeLists.txt
Bug fixes and improved docs for hipFree and hipHostFree.
- Passing NULL pointer initialized runtime and return hipSuccess
(not an error like before).
- add negative test for this. (hipMemoryAllocate, improved)
- Match NVCC errors for invalid pointers, add to test.
- Update hipFree and hipHostFree docs.
- hipGetDevicePointer always set *devicePointer=NULL, even for
invalid flags.
- Gate shared memory usage on specific HCC work-week.
Change-Id: I533b4fd3280a3d6cdbf05eb768976f0c7506c012
[ROCm/clr commit: 6a2a140f34 ]
2016-06-16 06:13:51 +05:30
Aditya Atluri
d01b51eab9
added more host functions and tests
...
Change-Id: I9904e65e14c5479ba33d836c5c0b763cb5af71e3
[ROCm/clr commit: c4e667cf90 ]
2016-06-15 11:45:19 -05:00
Maneesh Gupta
cf468b9a7b
Merge "added host device functions" into amd-master
...
[ROCm/clr commit: 8755f2f1bc ]
2016-06-15 02:45:55 -04:00
Maneesh Gupta
8b9f354f60
Merge "cmake picks HIP version from hipcc" into amd-master
...
[ROCm/clr commit: 065b2f532f ]
2016-06-15 01:47:18 -04:00
Maneesh Gupta
e003279b6e
Merge "hipcc supports printing HIP version" into amd-master
...
[ROCm/clr commit: 35624f33f9 ]
2016-06-15 01:46:11 -04:00
Aditya Atluri
f56bda32bf
added host device functions
...
Change-Id: I8f299752fb8dd8e8947da62e4ad88842c1c19f62
[ROCm/clr commit: bb02880a12 ]
2016-06-14 18:14:44 -05:00
Maneesh Gupta
862ee762cd
cmake picks HIP version from hipcc
...
Change-Id: I334d9573bb6d60fe907a13083486f06504a621e4
[ROCm/clr commit: ea953cb76a ]
2016-06-14 14:57:56 +05:30
Maneesh Gupta
4009b60b44
hipcc supports printing HIP version
...
Change-Id: I9f1cd9a29ded9d354b9ebe265dcd91b060e8fc08
[ROCm/clr commit: 29ca733be2 ]
2016-06-14 14:51:03 +05:30
Aditya Atluri
4d16bb272e
added bessel zero and one order functions
...
Change-Id: I57039d54eae7207db00415bc7ba09bbf9cb6425a
[ROCm/clr commit: ce52a8f70c ]
2016-06-14 11:50:48 +05:30
Maneesh Gupta
bdfa974253
Merge "added erfinv software implementation" into amd-master
...
[ROCm/clr commit: 1dbb2f5205 ]
2016-06-14 02:13:01 -04:00
Maneesh Gupta
46e60b1bd1
Merge "hipDispatchLatency: reduce iterations to 5120" into amd-master
...
[ROCm/clr commit: 37257dda35 ]
2016-06-14 00:28:12 -04:00