Граф коммитов

2404 Коммитов

Автор SHA1 Сообщение Дата
Deven Desai 76a7d7e374 Fixing a copy/paste error in my previous checkin 2018-04-27 12:59:51 +00:00
Deven Desai 4be4cf644f Checkin to add unit tests for fixes in my previous commit
This change adds unit tests for

1. __hisinf
2. __hisnan
2. abs(long long)
3. lgamma(double)
2018-04-25 17:53:21 +00:00
Deven Desai af586bbbf2 Checkin to fix bugs in math functions.
This change fixes the following bugs that were discovered while debuggnig TF unit test failures (cwise_ops_test)

1. __hisinf and __hisnan routines
   Both had incorrect implementations.

2. abs
   A "long long" (64bit int) version was missing, resulting in the 32bit version being used for 64bit ints (which resulted in incorrect results, when the value passed in was outside the 32bit int range)

3. lgamma
  We seemed to have a custom version for the 'double' datatype (which was giving incorrect results). Replaced it with a call to the 'double' version of the underlying 'hc::precision_math::lgamma'
2018-04-24 18:10:07 +00:00
Evgeny Mankov b36001bbbf Merge pull request #413 from emankov/master
[HIPIFY] undo apply .clangformat to HIPIFY source files
2018-04-23 20:29:43 +03:00
Evgeny Mankov 8499e79e2b [HIPIFY] undo apply .clangformat to HIPIFY source files 2018-04-23 20:22:15 +03:00
Maneesh Gupta 35a8deec00 Merge pull request #404 from gargrahul/hipMemset2DAsync_support
Added hipMemset2DAsync support
2018-04-18 14:19:41 +05:30
Rahul Garg 3cfb9c0d40 Added hipMemset2DAsync support 2018-04-17 18:27:27 +05:30
Maneesh Gupta 33ada06de9 Merge pull request #400 from gargrahul/hipModule_cleanup
hip_module code cleanup
2018-04-17 09:00:15 +05:30
Aaron Enye Shi 5a80c16cd2 Clean up triple_to_hsa_isa naming (#403) 2018-04-17 06:47:55 +05:30
Rahul Garg 1446f78799 hip_module code cleanup
-Fixed missing ihipLogStatus in hipModuleLoad()
-Fixed some ihipXXX functions
2018-04-16 15:35:04 +05:30
Aaron Enye Shi 1a281240d6 Fix bug when using older rocr naming (#399) 2018-04-14 15:47:46 +05:30
Aaron Enye Shi ab7e727fa2 Fix new gfx target naming convention (#397)
This was introduced in recent xnack changes naming changes which affected our offload bundler. This patch will fix issues in HIP samples, ROCR tests, and others.
2018-04-13 16:24:39 +05:30
Maneesh Gupta bda18af06d Merge pull request #395 from lmoriche/master
Add missing __device__ function attributes.
2018-04-12 14:51:32 +05:30
Maneesh Gupta 97f67ced66 Merge pull request #393 from mangupta/cmake_update_deps
[cmake] Update target build dependencies
2018-04-12 14:36:49 +05:30
Maneesh Gupta b1057f0be6 Merge pull request #394 from gargrahul/hipMemset_refactoring
hipMemset refactoring
2018-04-12 14:01:49 +05:30
Rahul Garg 16c89d101a Correct missed ihipMemsetCopyDataType change 2018-04-12 10:27:19 +05:30
Rahul Garg 3d6eb75828 Changed ihipMemsetCopyDataType to ihipMemsetDataType 2018-04-12 09:29:22 +05:30
Laurent Morichetti 211c18e674 Add missing __device__ function attributes. 2018-04-11 09:29:37 -07:00
Rahul Garg 294bf50f68 Fix hipMemset stream resolution 2018-04-11 19:01:53 +05:30
Rahul Garg 412a35be20 hipMemset refactoring 2018-04-11 15:58:48 +05:30
Maneesh Gupta b1be325745 [cmake] Update target build dependencies
Change-Id: Ib1c34135c0d04f88daa695e7a90272225d7e64b0
2018-04-11 15:30:13 +05:30
Maneesh Gupta 7c43396596 Merge pull request #391 from lmoriche/master
Create one hipModule_t per device
2018-04-11 13:06:58 +05:30
Laurent Morichetti 91f996035c Create one hipModule_t per device 2018-04-10 08:57:34 -07:00
Maneesh Gupta 399a17d755 Merge pull request #390 from mangupta/hipmemcpy_fix
hipMemcpyAsync returns success when trying to copy 0 bytes
2018-04-09 14:02:36 +05:30
Maneesh Gupta ac7c5dfd0b Merge pull request #388 from lmoriche/master
Add support for clang offload bundles and <<<>>> kernel launch.
2018-04-09 12:40:28 +05:30
Maneesh Gupta 03eca1c57e hipMemcpyAsync returns success when trying to copy 0 bytes
Change-Id: I4c0ee7ccc7563e2df657b50356cdd7fec9a1ef15
2018-04-09 12:39:44 +05:30
Maneesh Gupta 4ac59148a9 [ci] Disable nvcc CI builds 2018-04-09 12:23:56 +05:30
Maneesh Gupta d98eaf5923 Merge pull request #389 from scchan/adjust_texture_hcc_workweek
adjust the hcc work week for the texture fix
2018-04-09 10:54:16 +05:30
Siu Chi Chan f48e471dd4 adjust the hcc work week for the texture fix 2018-04-06 15:46:55 -04:00
Laurent Morichetti bc6fa2bc62 Don't include __clang_cuda_builtin_vars.h 2018-04-05 16:27:05 -07:00
Laurent Morichetti 0c1893ea62 Add support for clang offload bundles and <<<>>> kernel launch. 2018-04-04 13:32:33 -07:00
Maneesh Gupta b2c5f7a3e9 Merge pull request #387 from mangupta/cleanup_bin
Cleanup bin directory
2018-04-04 15:06:57 +05:30
Maneesh Gupta 228fae1224 Cleanup bin directory
Remove hccgenco.sh and hsa-api-filter-cxl.txt

Change-Id: I5942aec911a2fe6c871fe0eb5f44a0b243552b79
2018-04-04 14:31:40 +05:30
Maneesh Gupta 0785e75150 Merge pull request #383 from gargrahul/remove_hcc_check_texref_sample
Remove textureObj kernel argument for HIP/HCC path
2018-04-04 10:35:58 +05:30
Maneesh Gupta 860249194a Merge pull request #384 from yxsamliu/patch-1
Update HIP language spec to support both <<< >>> kernel launching mec…
2018-04-04 10:35:34 +05:30
Maneesh Gupta fdcc6558f4 Merge pull request #385 from gargrahul/fix_texture_trsf_flag
Fix texture driver api TRFS flags
2018-04-04 10:34:09 +05:30
Maneesh Gupta 69e1185ee7 Merge pull request #386 from ROCm-Developer-Tools/mangupta-patch-4
[ci] Re-enable testing against hcc_1.7
2018-04-04 10:01:41 +05:30
Maneesh Gupta d9cb38c764 [ci] Re-enable testing against hcc_1.7 2018-04-03 14:04:33 +05:30
Rahul Garg 62fc6c85e0 Fix texture driver api TRFS flags 2018-04-02 21:47:20 +05:30
Yaxun (Sam) Liu cf78d85638 Update HIP language spec to support both <<< >>> kernel launching mechanism and hipLaunchKernel 2018-04-02 11:58:32 -04:00
Rahul Garg 017a6fb059 Remove textureObj kernel argument for HIP/HCC path 2018-03-28 15:36:57 +05:30
Maneesh Gupta 0b25a2cb40 Merge pull request #382 from scchan/fix_texture_addr_space
Change constant address space to 4
2018-03-27 07:48:50 +05:30
Siu Chi Chan 0d8b97274a Change constant address space to 4
when compiling with a newer hcc.  This is
due to a recent change to address space
mapping in the amdgpu compiler backend.
2018-03-26 18:13:59 -04:00
Maneesh Gupta dbd818c4bd Merge pull request #379 from lmoriche/master
Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename …
2018-03-26 15:38:02 +05:30
Maneesh Gupta 21e0adc700 [ci] Get CI builds to work again (#377)
* [ci] Enable tests against HCC from ROCm1.7 as well

* [ci] rocm 1.7 requires nodes with dkms label

* [ci] Temporarily drop hcc_1_6 and hcc_1_7 builds
2018-03-26 15:21:45 +05:30
Laurent Morichetti a5e04c9706 Merge remote-tracking branch 'upstream/master' 2018-03-21 11:17:52 -07:00
Laurent Morichetti a06b21c500 Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename hipLaunch->hipLaunchByPtr. 2018-03-21 11:07:21 -07:00
Maneesh Gupta fe3e3dd09a Merge pull request #373 from gargrahul/fix_function_not_found_tex_drv_sample
Fixed function not found issue in texture driver api sample
2018-03-19 14:11:28 +05:30
Rahul Garg ed2d4ddfc7 Removed hidden args and hipLaunchParm from HIP/HCC path 2018-03-16 22:50:25 +05:30
Rahul Garg be25556aed Change co file name 2018-03-16 12:54:44 +05:30