Γράφημα Υποβολών

2464 Υποβολές

Συγγραφέας SHA1 Μήνυμα Ημερομηνία
Aaron Enye Shi 8e1a7fdd09 Implement min/max functions in HIP header
Remove using hc::precise_math min and max. Instead we can use ocml directly for device and std:: for host.


[ROCm/hip commit: 47d78e372e]
2018-07-05 20:15:41 +00:00
Aaron Enye Shi 0ea959ba9d Implement Memory Fence Functions in header
Enabled __llvm_fence_* functions for seq_cst.


[ROCm/hip commit: 930a16bccd]
2018-07-04 23:35:24 +00:00
Aaron Enye Shi 6bcb3cca86 Workaround cast warning of smaller integer type for __to_local
For now, guard the __to_local function for device compile only since a local pointer should be same size as unsigned int on GPU compile. Also change to void* instead of char*.


[ROCm/hip commit: 07de5cb334]
2018-07-04 20:52:55 +00:00
Aaron Enye Shi e8fac0449e Merge branch 'master' into hipclang-add-amdgcn-funcs
[ROCm/hip commit: 2975f2a10a]
2018-07-04 17:36:08 +00:00
Maneesh Gupta f9ea86e749 Merge pull request #539 from ROCm-Developer-Tools/fix-tf-link
Add workaround to hipcc for build failure in tensorflow due to missin…

[ROCm/hip commit: 66f12daaa4]
2018-07-04 11:08:52 +05:30
Maneesh Gupta a4eb081242 Merge pull request #530 from ROCm-Developer-Tools/hipclang-compute-mode
Add HIP Compute Mode

[ROCm/hip commit: 4b4d78fdb1]
2018-07-04 10:55:48 +05:30
Maneesh Gupta 86abb2f7b8 Merge pull request #517 from ROCm-Developer-Tools/fix-hipcc-link
Let hipcc handle static library for hip-clang only if it contains bun…

[ROCm/hip commit: 7927f66326]
2018-07-04 10:51:44 +05:30
Maneesh Gupta 19cb14d54c Update hip_hcc_internal.h
Adding missing include for hip_hcc_internal in order to build with HCC

[ROCm/hip commit: 0c2f985553]
2018-07-04 09:33:51 +05:30
Maneesh Gupta b7367558ef Update hip-targets.cmake for hip::host & hip::device
[ROCm/hip commit: 4b660c8382]
2018-07-02 10:37:20 +05:30
Maneesh Gupta 54d7f22c92 Merge pull request #528 from ROCm-Developer-Tools/fix-hipclang-hipcc
HIPCC - Use clang if clang++ is not compiled

[ROCm/hip commit: b326b8a0a6]
2018-06-28 12:36:49 +05:30
Maneesh Gupta a31dc511f9 Merge pull request #503 from ROCm-Developer-Tools/fix-complex
Fix hip_complex.h

[ROCm/hip commit: 2ce48fbc05]
2018-06-28 12:15:22 +05:30
Maneesh Gupta e3996a5e16 Merge pull request #495 from ROCm-Developer-Tools/hip-clang-upstream
add more device functions for hip-clang

[ROCm/hip commit: 76b2d0f82d]
2018-06-28 11:42:01 +05:30
Maneesh Gupta 47804939a0 Merge pull request #522 from pradeepisro/cmake_verbose_fix
added changes to FindHIP.cmake which would allow us to build using ninja #467

[ROCm/hip commit: c004b15fc2]
2018-06-28 11:39:03 +05:30
Maneesh Gupta fe5afd44f4 Fix typo
[ROCm/hip commit: dde875f23b]
2018-06-28 11:19:22 +05:30
Maneesh Gupta bf2a6352b7 Merge pull request #536 from ROCm-Developer-Tools/hotfix_remove_unimplemented_math
Removes use of unimplemented OCML functionality.

[ROCm/hip commit: 525716bf91]
2018-06-28 11:15:57 +05:30
Aaron Enye Shi 6e2ee0f857 Implement __shfl_* funcs into HIP headers
[ROCm/hip commit: 9ac31e0bb6]
2018-06-26 18:32:11 +00:00
Yaxun (Sam) Liu af0e9b54de Add workaround to hipcc for build failure in tensorflow due to missing symbol __cpu_model
https://github.com/tensorflow/tensorflow/issues/9593


[ROCm/hip commit: 021728cd86]
2018-06-25 18:14:51 -04:00
Yaxun (Sam) Liu c205c090f4 Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes
[ROCm/hip commit: 02b160491d]
2018-06-25 15:34:34 -04:00
Alex Voicu 9fed6fed2b Removes use of unimplemented OCML functionality.
[ROCm/hip commit: 6c7a64efa2]
2018-06-25 19:16:27 +01:00
Aaron Enye Shi 260e50769b Add HIP Compute Mode
[ROCm/hip commit: ff924ecb3d]
2018-06-22 14:37:19 +00:00
Aaron Enye Shi a7baff62e7 HIPCC - Use clang if clang++ is not compiled
[ROCm/hip commit: 73023e0a54]
2018-06-21 18:12:55 +00:00
Aaron Enye Shi 20674cfcb2 Replace __hip_hc_ir_ inline asm with __ockl_* functions
[ROCm/hip commit: 8ac864c2e3]
2018-06-20 20:40:14 +00:00
Aaron Enye Shi e2f8af2f3d Implement __ballot, __any, __all into HIP headers
[ROCm/hip commit: 6dc16bbf04]
2018-06-20 17:39:39 +00:00
Aaron Enye Shi 2a2f34b42d Implement hip_hc.ll into HIP headers
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.


[ROCm/hip commit: 2142eb4d12]
2018-06-20 17:39:31 +00:00
Aaron Enye Shi 0fca41a7ce Implement device_functions.cpp into HIP headers
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.


[ROCm/hip commit: e02fc7e680]
2018-06-20 17:39:23 +00:00
Aaron Enye Shi 14f721702a Move hipclang funcs into corresponding headers
[ROCm/hip commit: fe4e6c53fc]
2018-06-20 17:39:15 +00:00
Aaron Enye Shi 8d65776127 Split __llvm and device lib funcs into new headers
[ROCm/hip commit: cfe37484c9]
2018-06-20 17:39:06 +00:00
Aaron Enye Shi cb46df1654 Add get_dynamicgroupbaseptr def and remove hc_
[ROCm/hip commit: 27f600b425]
2018-06-20 17:38:59 +00:00
Aaron Enye Shi 9b03a036c3 Add prefix __ to memory scope and order
[ROCm/hip commit: 871cfc6fd2]
2018-06-20 17:38:52 +00:00
Aaron Enye Shi cdd7931ac1 Add __llvm_fence funcs and __ prefixes
[ROCm/hip commit: cfa8fc1ca5]
2018-06-20 17:38:45 +00:00
Aaron Enye Shi b34868005a Add hipclang amdgcn functions
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.


[ROCm/hip commit: c453b42bff]
2018-06-20 17:38:37 +00:00
Maneesh Gupta a4b9e5305f Merge pull request #507 from ROCm-Developer-Tools/fix-forward
Add __device__ to device functions in hip_fp16_math_fwd.h

[ROCm/hip commit: fc80fb4ab3]
2018-06-20 14:21:46 +05:30
Maneesh Gupta 6932d45c14 Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
Fix channel_descriptor.h about vector 3 for gcc

[ROCm/hip commit: cffc5ad273]
2018-06-20 14:20:29 +05:30
Maneesh Gupta 27e2af1773 Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang  builtins.

[ROCm/hip commit: 946c8da88a]
2018-06-20 14:16:43 +05:30
Maneesh Gupta 4cdc20a6ce Merge pull request #457 from whchung/hip-reinit
HIP program state re-initialization logic

[ROCm/hip commit: 836627279f]
2018-06-20 09:37:27 +05:30
Yaxun (Sam) Liu 8696038043 Let hipcc handle library with extension lo for hip-clang
[ROCm/hip commit: 46d3c1d51e]
2018-06-19 10:51:08 -04:00
Yaxun (Sam) Liu b8ae2784e9 Add conj, operator-,==,!= for hipFloatComplex/hipDoubleComplex
[ROCm/hip commit: 84da72dae8]
2018-06-19 10:49:59 -04:00
pradeepisro 4c9fad371f added option HIP_BUILD_VERBOSE to enable verbosity in HIP build
[ROCm/hip commit: f6938961b9]
2018-06-19 13:50:24 +05:30
pradeepisro 1eb91f1bab added changes to FindHIP.cmake which would allow us to build using ninja #467
[ROCm/hip commit: 4f0aefbd63]
2018-06-19 12:26:18 +05:30
Maneesh Gupta 163e55fe10 Merge pull request #516 from bddppq/empty-generator-expression
Properly handle (empty) cmake generator expression

[ROCm/hip commit: 1b88a2ce2f]
2018-06-19 09:38:21 +05:30
Maneesh Gupta 3475fe6efa Merge pull request #520 from ntrost57/master
added missing hipCmul() to nvcc_detail/hip_complex.h

[ROCm/hip commit: 523e7fd9b2]
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung d214b045c3 Keep the map which tracks GPU kernel symbols to grow monotonically
[ROCm/hip commit: 32789a8b7d]
2018-06-18 16:54:18 -05:00
Yaxun (Sam) Liu a46f62a5c0 Add abs/real/imag functions for hipFloatComplex/hipDoubleComplex
[ROCm/hip commit: 9181fbb0b7]
2018-06-18 11:57:57 -04:00
Yaxun (Sam) Liu fa9e73ccdc Add missing __device__ __host__ to complex constructor
Also add missing typedef value_type


[ROCm/hip commit: 7a5605d006]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 0e29f327e2 Includes <cmath> or <math.h> by __cplusplus in hip_complex.h
[ROCm/hip commit: 2523c39a37]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu cf6cdab029 Include cmath instead of math.h in hip_complex.h
[ROCm/hip commit: 325cf3ccf0]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 0b10ab2b7e Add missing macro MAKE_COMPONENT_CONSTRUCTOR_TWO_COMPONENT to hip_complex.h
[ROCm/hip commit: 5eeb57b0a6]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu acaf783f52 Fix handling of static library in hipcc for hip-clang
[ROCm/hip commit: 2b32dbd414]
2018-06-17 20:11:10 -04:00
Yaxun (Sam) Liu 0573ee79de Let hipcc handle static library for hip-clang only if it contains bundles
[ROCm/hip commit: f0935e6d4f]
2018-06-17 20:11:10 -04:00
Wen-Heng (Jack) Chung c6b975bf13 Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.


[ROCm/hip commit: ece4539c1d]
2018-06-15 18:07:33 -05:00