Deven Desai
76a7d7e374
Fixing a copy/paste error in my previous checkin
2018-04-27 12:59:51 +00:00
Deven Desai
4be4cf644f
Checkin to add unit tests for fixes in my previous commit
...
This change adds unit tests for
1. __hisinf
2. __hisnan
2. abs(long long)
3. lgamma(double)
2018-04-25 17:53:21 +00:00
Deven Desai
af586bbbf2
Checkin to fix bugs in math functions.
...
This change fixes the following bugs that were discovered while debuggnig TF unit test failures (cwise_ops_test)
1. __hisinf and __hisnan routines
Both had incorrect implementations.
2. abs
A "long long" (64bit int) version was missing, resulting in the 32bit version being used for 64bit ints (which resulted in incorrect results, when the value passed in was outside the 32bit int range)
3. lgamma
We seemed to have a custom version for the 'double' datatype (which was giving incorrect results). Replaced it with a call to the 'double' version of the underlying 'hc::precision_math::lgamma'
2018-04-24 18:10:07 +00:00
Evgeny Mankov
b36001bbbf
Merge pull request #413 from emankov/master
...
[HIPIFY] undo apply .clangformat to HIPIFY source files
2018-04-23 20:29:43 +03:00
Evgeny Mankov
8499e79e2b
[HIPIFY] undo apply .clangformat to HIPIFY source files
2018-04-23 20:22:15 +03:00
Maneesh Gupta
35a8deec00
Merge pull request #404 from gargrahul/hipMemset2DAsync_support
...
Added hipMemset2DAsync support
2018-04-18 14:19:41 +05:30
Rahul Garg
3cfb9c0d40
Added hipMemset2DAsync support
2018-04-17 18:27:27 +05:30
Maneesh Gupta
33ada06de9
Merge pull request #400 from gargrahul/hipModule_cleanup
...
hip_module code cleanup
2018-04-17 09:00:15 +05:30
Aaron Enye Shi
5a80c16cd2
Clean up triple_to_hsa_isa naming ( #403 )
2018-04-17 06:47:55 +05:30
Rahul Garg
1446f78799
hip_module code cleanup
...
-Fixed missing ihipLogStatus in hipModuleLoad()
-Fixed some ihipXXX functions
2018-04-16 15:35:04 +05:30
Aaron Enye Shi
1a281240d6
Fix bug when using older rocr naming ( #399 )
2018-04-14 15:47:46 +05:30
Aaron Enye Shi
ab7e727fa2
Fix new gfx target naming convention ( #397 )
...
This was introduced in recent xnack changes naming changes which affected our offload bundler. This patch will fix issues in HIP samples, ROCR tests, and others.
2018-04-13 16:24:39 +05:30
Maneesh Gupta
bda18af06d
Merge pull request #395 from lmoriche/master
...
Add missing __device__ function attributes.
2018-04-12 14:51:32 +05:30
Maneesh Gupta
97f67ced66
Merge pull request #393 from mangupta/cmake_update_deps
...
[cmake] Update target build dependencies
2018-04-12 14:36:49 +05:30
Maneesh Gupta
b1057f0be6
Merge pull request #394 from gargrahul/hipMemset_refactoring
...
hipMemset refactoring
2018-04-12 14:01:49 +05:30
Rahul Garg
16c89d101a
Correct missed ihipMemsetCopyDataType change
2018-04-12 10:27:19 +05:30
Rahul Garg
3d6eb75828
Changed ihipMemsetCopyDataType to ihipMemsetDataType
2018-04-12 09:29:22 +05:30
Laurent Morichetti
211c18e674
Add missing __device__ function attributes.
2018-04-11 09:29:37 -07:00
Rahul Garg
294bf50f68
Fix hipMemset stream resolution
2018-04-11 19:01:53 +05:30
Rahul Garg
412a35be20
hipMemset refactoring
2018-04-11 15:58:48 +05:30
Maneesh Gupta
b1be325745
[cmake] Update target build dependencies
...
Change-Id: Ib1c34135c0d04f88daa695e7a90272225d7e64b0
2018-04-11 15:30:13 +05:30
Maneesh Gupta
7c43396596
Merge pull request #391 from lmoriche/master
...
Create one hipModule_t per device
2018-04-11 13:06:58 +05:30
Laurent Morichetti
91f996035c
Create one hipModule_t per device
2018-04-10 08:57:34 -07:00
Maneesh Gupta
399a17d755
Merge pull request #390 from mangupta/hipmemcpy_fix
...
hipMemcpyAsync returns success when trying to copy 0 bytes
2018-04-09 14:02:36 +05:30
Maneesh Gupta
ac7c5dfd0b
Merge pull request #388 from lmoriche/master
...
Add support for clang offload bundles and <<<>>> kernel launch.
2018-04-09 12:40:28 +05:30
Maneesh Gupta
03eca1c57e
hipMemcpyAsync returns success when trying to copy 0 bytes
...
Change-Id: I4c0ee7ccc7563e2df657b50356cdd7fec9a1ef15
2018-04-09 12:39:44 +05:30
Maneesh Gupta
4ac59148a9
[ci] Disable nvcc CI builds
2018-04-09 12:23:56 +05:30
Maneesh Gupta
d98eaf5923
Merge pull request #389 from scchan/adjust_texture_hcc_workweek
...
adjust the hcc work week for the texture fix
2018-04-09 10:54:16 +05:30
Siu Chi Chan
f48e471dd4
adjust the hcc work week for the texture fix
2018-04-06 15:46:55 -04:00
Laurent Morichetti
bc6fa2bc62
Don't include __clang_cuda_builtin_vars.h
2018-04-05 16:27:05 -07:00
Laurent Morichetti
0c1893ea62
Add support for clang offload bundles and <<<>>> kernel launch.
2018-04-04 13:32:33 -07:00
Maneesh Gupta
b2c5f7a3e9
Merge pull request #387 from mangupta/cleanup_bin
...
Cleanup bin directory
2018-04-04 15:06:57 +05:30
Maneesh Gupta
228fae1224
Cleanup bin directory
...
Remove hccgenco.sh and hsa-api-filter-cxl.txt
Change-Id: I5942aec911a2fe6c871fe0eb5f44a0b243552b79
2018-04-04 14:31:40 +05:30
Maneesh Gupta
0785e75150
Merge pull request #383 from gargrahul/remove_hcc_check_texref_sample
...
Remove textureObj kernel argument for HIP/HCC path
2018-04-04 10:35:58 +05:30
Maneesh Gupta
860249194a
Merge pull request #384 from yxsamliu/patch-1
...
Update HIP language spec to support both <<< >>> kernel launching mec…
2018-04-04 10:35:34 +05:30
Maneesh Gupta
fdcc6558f4
Merge pull request #385 from gargrahul/fix_texture_trsf_flag
...
Fix texture driver api TRFS flags
2018-04-04 10:34:09 +05:30
Maneesh Gupta
69e1185ee7
Merge pull request #386 from ROCm-Developer-Tools/mangupta-patch-4
...
[ci] Re-enable testing against hcc_1.7
2018-04-04 10:01:41 +05:30
Maneesh Gupta
d9cb38c764
[ci] Re-enable testing against hcc_1.7
2018-04-03 14:04:33 +05:30
Rahul Garg
62fc6c85e0
Fix texture driver api TRFS flags
2018-04-02 21:47:20 +05:30
Yaxun (Sam) Liu
cf78d85638
Update HIP language spec to support both <<< >>> kernel launching mechanism and hipLaunchKernel
2018-04-02 11:58:32 -04:00
Rahul Garg
017a6fb059
Remove textureObj kernel argument for HIP/HCC path
2018-03-28 15:36:57 +05:30
Maneesh Gupta
0b25a2cb40
Merge pull request #382 from scchan/fix_texture_addr_space
...
Change constant address space to 4
2018-03-27 07:48:50 +05:30
Siu Chi Chan
0d8b97274a
Change constant address space to 4
...
when compiling with a newer hcc. This is
due to a recent change to address space
mapping in the amdgpu compiler backend.
2018-03-26 18:13:59 -04:00
Maneesh Gupta
dbd818c4bd
Merge pull request #379 from lmoriche/master
...
Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename …
2018-03-26 15:38:02 +05:30
Maneesh Gupta
21e0adc700
[ci] Get CI builds to work again ( #377 )
...
* [ci] Enable tests against HCC from ROCm1.7 as well
* [ci] rocm 1.7 requires nodes with dkms label
* [ci] Temporarily drop hcc_1_6 and hcc_1_7 builds
2018-03-26 15:21:45 +05:30
Laurent Morichetti
a5e04c9706
Merge remote-tracking branch 'upstream/master'
2018-03-21 11:17:52 -07:00
Laurent Morichetti
a06b21c500
Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename hipLaunch->hipLaunchByPtr.
2018-03-21 11:07:21 -07:00
Maneesh Gupta
fe3e3dd09a
Merge pull request #373 from gargrahul/fix_function_not_found_tex_drv_sample
...
Fixed function not found issue in texture driver api sample
2018-03-19 14:11:28 +05:30
Rahul Garg
ed2d4ddfc7
Removed hidden args and hipLaunchParm from HIP/HCC path
2018-03-16 22:50:25 +05:30
Rahul Garg
be25556aed
Change co file name
2018-03-16 12:54:44 +05:30