Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Aaron Enye Shi
0cfaa52d15
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
e59c33250a
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
49cc703d7a
Test for hipGetSymbolSize and hipGetSymbolAddress
2018-11-06 11:39:34 +01:00
Aaron Enye Shi
2b108a2cfd
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Alex Voicu
fe959f7bd7
Re-sync with upstream.
2018-10-18 12:27:03 +01:00
Maneesh Gupta
52e320f396
Replace hipLaunchKernel -> hipLaunchKernelGGL
...
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Alex Voicu
5312336ce2
Minimal should mean minimal.
2018-10-11 00:21:41 +01:00
Alex Voicu
50265028fc
Trailing whitespace is still not the final frontier.
2018-10-11 00:08:09 +01:00
Alex Voicu
069bbbd7ed
Use HIP math functions, stop using hipLaunchParm.
2018-10-10 11:56:54 +01:00
Alex Voicu
ca375cb8c5
Re-sync with upstream.
2018-10-10 11:43:49 +01:00
Maneesh Gupta
da64156fb2
Implementation for stream priority
...
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged
Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca
2018-10-05 16:27:46 +05:30
Maneesh Gupta
5c665acf74
Merge pull request #685 from ROCm-Developer-Tools/hip-trig-return
...
Improve hip_trig test case
2018-09-26 09:50:48 +05:30
Aaron Enye Shi
5179a72cda
Use trig functions from ocml instead
2018-09-25 15:58:36 +00:00
Aaron Enye Shi
3d3820272a
Fix missing HIP_ASSERT in hip_trig
2018-09-24 15:01:24 +00:00
Maneesh Gupta
ca6b916d15
Merge pull request #683 from ROCm-Developer-Tools/hip-float-rounded
...
Disable non-default-rounded functions
2018-09-24 09:32:52 +05:30
Aaron Enye Shi
357b6844fa
Improve hip_trig test case
...
Hip_trig is failing but returning a passing value on HIP-Clang. Also update with debug output and free arrays used.
2018-09-21 15:28:10 -04:00
Maneesh Gupta
255589ae15
Update hipTestFMA.cpp
2018-09-21 11:00:08 +05:30
Aaron Enye Shi
c11220f224
Disable non-default-rounded functions
...
Device library has removed the non-default-rounded functions, so hipFloatMath will fail to build. These include the removal of __ocml_sqrt_rte, __ocml_sqrt_rtn, __ocml_sqrt_rtp, and __ocml_sqrt_rtz. As seen here: https://github.com/RadeonOpenCompute/ROCm-Device-Libs/commit/2fc04e10e1354edee331ce700f98a60f8255effb . Disable these function tests for now, until they are re-enabled, or deleted completely.
2018-09-20 16:33:32 -04:00
Maneesh Gupta
ecd6a212c7
Update hipTestFMA.cpp
...
Fix the test so that it works on nvcc path as well.
2018-09-20 11:23:51 +05:30
Yaxun Sam Liu
bd622a4b4a
Add fma function with float and _Float16 arguments
2018-09-19 09:59:33 -04:00
Rahul Garg
8da3a96f2b
Fixed get alignment offset test for NVCC
...
- Removed dtest hipGetAlgntoffset2D.cpp
- hipGetTextureAlignmentOffset is being tested now in hipBindTexRef1DFetch.cpp
2018-09-18 14:10:03 +05:30
Maneesh Gupta
cca2c5afc8
[dests] Fix hipTestClock, hipTestNew & hipTestGlobalVariable tests for nvcc
...
nvcc does not support global kernels in struct/class
Change-Id: I2d7297e0c3725564215e20dbdd31c0bb8d7a07de
2018-09-17 15:32:05 +05:30
Maneesh Gupta
94a0589eb5
[dtests] Fix deviceLib/hipSimpleAtomicsTest for nvcc
...
Change-Id: I23cbea2820d41da6f6a1bcab4bc3b59ac21799dd
2018-09-17 15:26:45 +05:30
Maneesh Gupta
0b211bb429
[dtests] Fix deviceLib/hipTestIncludeMath on nvcc path
...
Change-Id: Ifb121886bec6c1134e98bd194ff1713f28454d33
2018-09-17 15:15:49 +05:30
Alex Voicu
c6720e882b
Align with HC Next.
2018-09-17 11:50:29 +03:00
Maneesh Gupta
30b1494c3d
Merge pull request #606 from ROCm-Developer-Tools/add-test-global-var
...
Add HIP directed test hipTestGlobalVariable.cpp
2018-09-17 11:11:51 +05:30
Maneesh Gupta
66f863d1f3
Merge branch 'master' into support-malloc
2018-09-17 10:17:25 +05:30
Maneesh Gupta
48cc4f0d9e
Merge pull request #639 from ROCm-Developer-Tools/skudchad-hip
...
Check for hipEnvVar at the same level if directed_tests location fails
2018-09-16 18:31:37 +05:30
Maneesh Gupta
9b2107749c
Revert changes to runtime/stream/hipStreamSync2
2018-09-15 13:23:38 +05:30
Maneesh Gupta
cb348421d7
Merge pull request #650 from ROCm-Developer-Tools/hip-clang-new
...
Support placement new in hip-clang
2018-09-15 11:21:01 +05:30
Maneesh Gupta
87bf9aa9d1
Merge pull request #658 from phani544/hipClangFrndClass
...
[HIP-Clang]Add friend class/function test
2018-09-14 08:59:40 +05:30
fpadmin
a8bc26344f
[HIP-Clang]Add friend class/function test
2018-09-05 11:48:49 +05:30
Yaxun (Sam) Liu
338eaefa84
Support placement new in hip-clang
2018-09-04 16:53:46 -04:00
Maneesh Gupta
a3a93a68d4
Merge pull request #657 from yxsamliu/fix-clz
...
Fix ambiguity of __clz and __clzll
2018-09-03 16:34:42 +05:30
Maneesh Gupta
7459defec1
Merge pull request #651 from yxsamliu/hip-clang-clock
...
Add clock() and clock64()
2018-09-03 16:32:30 +05:30
Maneesh Gupta
943c4d874c
Merge pull request #640 from aaronenyeshi/hip-clang-add-bit-funcs
...
Add bitextract and bitinsert functions
2018-09-03 16:31:42 +05:30
Yaxun Sam Liu
11c91c2fdd
Fix ambiguity of __clz and __clzll
2018-08-31 18:07:57 -04:00
Saleel Kudchadker
1daee67eb6
Fix record_event and hipStreamSync2 tests. The test should expect null stream to complete if synchrionize is called as per the spec
2018-08-31 12:54:23 -07:00
Saleel Kudchadker
cd926a44b7
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into skudchad-hip
2018-08-31 10:38:41 -07:00
Maneesh Gupta
5ab1d757e4
Merge pull request #564 from phani544/hipClangClass8B
...
Adding class size>8bytes tests
2018-08-30 10:33:47 +05:30
Aaron Enye Shi
bbcb3fa0c0
Remove testing bitinsert/extract on nvcc path
2018-08-29 15:29:53 +00:00
Yaxun Sam Liu
1f74a66bea
Add clock() and clock64()
2018-08-29 10:51:00 -04:00
fpadmin
0022f53a10
Adding class size>8bytes tests
2018-08-29 15:16:43 +05:30
Aaron Enye Shi
5893bbc1f4
Add tests for bitextract and bitinsert
2018-08-28 20:01:54 +00:00
Saleel Kudchadker
8e0e373f69
Check for hipEnvVar at the same level if directed_tests location fails
2018-08-17 12:14:42 -07:00
Yaxun Sam Liu
a6c7aeed72
Add HIP directed test hipTestGlobalVariable.cpp
2018-08-10 08:15:36 -04:00
Yaxun Sam Liu
b8a0c14171
Fix __HIP_ARCH_* not defined after including math_functions.h
...
hcc_detail/math_functions.h used to include hcc_detail/hip_runtime.h.
Removing it has caused regression in TensorFlow 1.8.
Put it back for backward compatibiliity.
2018-08-08 08:55:28 -04:00
Yaxun Sam Liu
450ea49df0
Fix __HIP_DEVICE_COMPILE__ not defined when hip/math_functions.h is included
...
This fixes build failure in TensorFlow 1.8 for HCC
2018-08-07 17:14:39 -04:00
Yaxun Sam Liu
dacb18414e
Fix declaration conflict when hip/math_functions.h is included first
...
This fixes build failure in TensorFlow 1.8 for HCC
2018-08-07 15:44:59 -04:00