Граф коммитов

  • f2c1bf3bc0 Remove ! USE_PINNED_HOST support Ben Sander 2016-02-11 23:13:43 -06:00
  • c1fd0e1608 Remove ! USE_PINNED_HOST support Ben Sander 2016-02-11 23:13:43 -06:00
  • 0370cd1cfc Remove ! USE_PINNED_HOST support Ben Sander 2016-02-11 23:13:43 -06:00
  • 712750e1a5 Use memtracker 'appID' to store deviceID associated with ptr Ben Sander 2016-02-11 23:07:19 -06:00
  • c04b5d3afb Use memtracker 'appID' to store deviceID associated with ptr Ben Sander 2016-02-11 23:07:19 -06:00
  • 305076d78f Use memtracker 'appID' to store deviceID associated with ptr Ben Sander 2016-02-11 23:07:19 -06:00
  • 00fd172c64 Use memtracker 'appID' to store deviceID associated with ptr Ben Sander 2016-02-11 23:07:19 -06:00
  • 2089e549eb Tracker improvements Ben Sander 2016-02-11 22:03:01 -06:00
  • 7216727fba Tracker improvements Ben Sander 2016-02-11 22:03:01 -06:00
  • 680b600b4a Tracker improvements Ben Sander 2016-02-11 22:03:01 -06:00
  • de45e2291e Tracker improvements Ben Sander 2016-02-11 22:03:01 -06:00
  • fe67be1134 Create address tracker for am_alloc. Ben Sander 2016-02-10 11:52:42 -06:00
  • 721508cc2f Create address tracker for am_alloc. Ben Sander 2016-02-10 11:52:42 -06:00
  • d4a90f8afd Create address tracker for am_alloc. Ben Sander 2016-02-10 11:52:42 -06:00
  • 4ee2a5229b Create address tracker for am_alloc. Ben Sander 2016-02-10 11:52:42 -06:00
  • a50fa0f78e Fix bug in device bounds comparison. Ben Sander 2016-02-09 08:39:08 -06:00
  • f1bc9af294 Fix bug in device bounds comparison. Ben Sander 2016-02-09 08:39:08 -06:00
  • 0a6e6e3b7e Fix bug in device bounds comparison. Ben Sander 2016-02-09 08:39:08 -06:00
  • e483eea85b Fix bug in device bounds comparison. Ben Sander 2016-02-09 08:39:08 -06:00
  • b5a2ca138d Fixed buffer overflow in ReduceOrCopy Nathan Luehr 2016-02-11 12:59:31 -08:00
  • 9442285526 Fixed buffer overflow in ReduceOrCopy Nathan Luehr 2016-02-11 12:59:31 -08:00
  • e8327090b9 gfx803: Add performance counter information Harish Kasiviswanathan 2016-02-11 16:43:30 -05:00
  • 04b92b8e05 gfx803: Add performance counter information Harish Kasiviswanathan 2016-02-11 16:43:30 -05:00
  • f4f0ffc8cb Adding missing performance counters for gfx801 Harish Kasiviswanathan 2016-02-11 15:10:27 -05:00
  • 1a0f915957 Adding missing performance counters for gfx801 Harish Kasiviswanathan 2016-02-11 15:10:27 -05:00
  • b33f9613fa Populate Cpu and Gpu nodes into different agent lists Ramesh Errabolu (xN/A) TX 2016-02-12 16:14:39 -05:00
  • 2280190f70 Populate Cpu and Gpu nodes into different agent lists Ramesh Errabolu (xN/A) TX 2016-02-12 16:14:39 -05:00
  • 0dc374e1a4 Fix double free issue and pointer alignment Ben Goz 2016-02-11 17:53:00 +02:00
  • b37f99a01e Fix double free issue and pointer alignment Ben Goz 2016-02-11 17:53:00 +02:00
  • 1b6994a2dc Fix build location for thunk RPM Kent Russell 2016-02-12 08:28:57 -05:00
  • cd6d75880f Fix build location for thunk RPM Kent Russell 2016-02-12 08:28:57 -05:00
  • 00d106a777 Documented supported fastmath functions Maneesh Gupta 2016-02-12 14:21:58 +05:30
  • c82511258c Documented supported fastmath functions Maneesh Gupta 2016-02-12 14:21:58 +05:30
  • 5657d7bae7 Documented supported fastmath functions Maneesh Gupta 2016-02-12 14:21:58 +05:30
  • 447201a6a0 Documented supported fastmath functions Maneesh Gupta 2016-02-12 14:21:58 +05:30
  • 7c93bdebaf Updated integer intrinsics documentation Maneesh Gupta 2016-02-12 13:58:35 +05:30
  • 2659e70d48 Updated integer intrinsics documentation Maneesh Gupta 2016-02-12 13:58:35 +05:30
  • 99052abbdb Updated integer intrinsics documentation Maneesh Gupta 2016-02-12 13:58:35 +05:30
  • 2c01480131 Updated integer intrinsics documentation Maneesh Gupta 2016-02-12 13:58:35 +05:30
  • 5a0961fdec P4 to Git Change 1236626 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/02/12 03:00:10 foreman 2016-02-12 03:09:11 -05:00
  • 0b9a71219a P4 to Git Change 1236626 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/02/12 03:00:10 foreman 2016-02-12 03:09:11 -05:00
  • b1845e0784 P4 to Git Change 1236618 by weizhang@msdnweizhang-opencl on 2016/02/11 22:12:34 foreman 2016-02-11 22:20:56 -05:00
  • fffbe5bb3b P4 to Git Change 1236618 by weizhang@msdnweizhang-opencl on 2016/02/11 22:12:34 foreman 2016-02-11 22:20:56 -05:00
  • 6add51ef8c Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path). Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported. Evgeny Mankov 2016-02-12 01:29:20 +03:00
  • 460b501cbb Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path). Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported. Evgeny Mankov 2016-02-12 01:29:20 +03:00
  • fcd154097f Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path). Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported. Evgeny Mankov 2016-02-12 01:29:20 +03:00
  • ea8f99702d Fix typo: maxThreadsPerMultiProcessor -> MaxSharedMemoryPerMultiprocessor Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path). Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported. Evgeny Mankov 2016-02-12 01:29:20 +03:00
  • 735d4738ad Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path). Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. Evgeny Mankov 2016-02-12 00:04:14 +03:00
  • 1025341300 Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path). Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. Evgeny Mankov 2016-02-12 00:04:14 +03:00
  • c587f89de7 Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path). Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. Evgeny Mankov 2016-02-12 00:04:14 +03:00
  • 9f05a52c74 Device property maxThreadsPerMultiProcessor set equal to totalGlobalMem (HIP path). Reason: maxThreadsPerMultiProcessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. Evgeny Mankov 2016-02-12 00:04:14 +03:00
  • a8b7647f8b BDFID (BusID/DeviceID/FunctionID) support. Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA. Evgeny Mankov 2016-02-11 22:26:01 +03:00
  • 658e9f0484 BDFID (BusID/DeviceID/FunctionID) support. Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA. Evgeny Mankov 2016-02-11 22:26:01 +03:00
  • 4eade0ce83 BDFID (BusID/DeviceID/FunctionID) support. Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA. Evgeny Mankov 2016-02-11 22:26:01 +03:00
  • 33f60c300d BDFID (BusID/DeviceID/FunctionID) support. Except FunctionID (or DomainID in CUDA) support, because cudaDeviceProp::pciDomainID is not reported by CUDA. Evgeny Mankov 2016-02-11 22:26:01 +03:00
  • 1795e55181 P4 to Git Change 1236441 by weizhang@weizhang-lnx-opencl on 2016/02/11 12:33:49 foreman 2016-02-11 12:41:22 -05:00
  • 31c030ff18 P4 to Git Change 1236441 by weizhang@weizhang-lnx-opencl on 2016/02/11 12:33:49 foreman 2016-02-11 12:41:22 -05:00
  • 6bfdfc34a0 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging sunway513 2016-02-11 22:22:47 +05:30
  • fe1000df17 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging sunway513 2016-02-11 22:22:47 +05:30
  • 18119645ab Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging sunway513 2016-02-11 22:22:47 +05:30
  • 1a8bd21eb6 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging sunway513 2016-02-11 22:22:47 +05:30
  • 38cc074f08 Add reminder to keep ROCR runtime on the system library path sunway513 2016-02-11 22:22:00 +05:30
  • c7cbcfa2e9 Add reminder to keep ROCR runtime on the system library path sunway513 2016-02-11 22:22:00 +05:30
  • 286f4e783c Add reminder to keep ROCR runtime on the system library path sunway513 2016-02-11 22:22:00 +05:30
  • d15f22113f Add reminder to keep ROCR runtime on the system library path sunway513 2016-02-11 22:22:00 +05:30
  • cd7a3bed15 P4 to Git Change 1236419 by asalmanp@asalmanp-opencl-stg on 2016/02/11 11:36:16 foreman 2016-02-11 11:49:08 -05:00
  • c16578eb71 P4 to Git Change 1236419 by asalmanp@asalmanp-opencl-stg on 2016/02/11 11:36:16 foreman 2016-02-11 11:49:08 -05:00
  • f7b58ebd17 adding ability to build in llvm source tree, updated README dfukalov 2016-02-11 15:27:00 +03:00
  • bae0f00e69 adding ability to build in llvm source tree, updated README dfukalov 2016-02-11 15:27:00 +03:00
  • 1b85fb3c4a adding ability to build in llvm source tree, updated README dfukalov 2016-02-11 15:27:00 +03:00
  • dca4a70bb7 adding ability to build in llvm source tree, updated README dfukalov 2016-02-11 15:27:00 +03:00
  • cd9dabd76d P4 to Git Change 1236333 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/02/11 03:00:10 foreman 2016-02-11 03:08:59 -05:00
  • 837ab19841 P4 to Git Change 1236333 by johtaylo@johtaylo-JTBUILDER03-increment on 2016/02/11 03:00:10 foreman 2016-02-11 03:08:59 -05:00
  • f826c7aaae Updated readme for test Maneesh Gupta 2016-02-11 13:06:58 +05:30
  • ed2d86f3a9 Updated readme for test Maneesh Gupta 2016-02-11 13:06:58 +05:30
  • f8290141b1 Updated readme for test Maneesh Gupta 2016-02-11 13:06:58 +05:30
  • a87c7988e8 Updated readme for test Maneesh Gupta 2016-02-11 13:06:58 +05:30
  • 50260adbe0 Initial version of CLANG based HIPIFY tool for CUDA -> HIP sources conversion dfukalov 2016-02-10 20:20:11 +03:00
  • 9d10ae4325 Initial version of CLANG based HIPIFY tool for CUDA -> HIP sources conversion dfukalov 2016-02-10 20:20:11 +03:00
  • 4518a1de70 Initial version of CLANG based HIPIFY tool for CUDA -> HIP sources conversion dfukalov 2016-02-10 20:20:11 +03:00
  • 5f224280ce Initial version of CLANG based HIPIFY tool for CUDA -> HIP sources conversion dfukalov 2016-02-10 20:20:11 +03:00
  • 03720306b9 Make hsaKmtAllocMemory more compliant with the Thunk spec Felix Kuehling 2016-02-06 18:47:40 -05:00
  • 887b32fe86 Make hsaKmtAllocMemory more compliant with the Thunk spec Felix Kuehling 2016-02-06 18:47:40 -05:00
  • 7879d2f103 Initial commit Ben Sander 2016-02-10 09:29:29 -06:00
  • f3a2e03a6b Initial commit Ben Sander 2016-02-10 09:29:29 -06:00
  • 48a445168b Initial commit Ben Sander 2016-02-10 09:29:29 -06:00
  • 09fe3cc7d4 Initial commit Ben Sander 2016-02-10 09:29:29 -06:00
  • cedd1c0947 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging Evgeny Mankov 2016-02-10 17:21:53 +03:00
  • 3139c72756 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging Evgeny Mankov 2016-02-10 17:21:53 +03:00
  • 1a9bd34ca2 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging Evgeny Mankov 2016-02-10 17:21:53 +03:00
  • 12d9d9be82 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging Evgeny Mankov 2016-02-10 17:21:53 +03:00
  • 2478fc078f Formatting, no functional changes Evgeny Mankov 2016-02-10 17:21:18 +03:00
  • d9a94191f2 Formatting, no functional changes Evgeny Mankov 2016-02-10 17:21:18 +03:00
  • 3a032ff317 Formatting, no functional changes Evgeny Mankov 2016-02-10 17:21:18 +03:00
  • 254da4ec53 Formatting, no functional changes Evgeny Mankov 2016-02-10 17:21:18 +03:00
  • 03c2768897 Remove test for atomicInc and atomicDec streamhsa 2016-02-10 21:02:52 +08:00
  • 90add185fd Remove test for atomicInc and atomicDec streamhsa 2016-02-10 21:02:52 +08:00
  • 4c861f0284 Remove test for atomicInc and atomicDec streamhsa 2016-02-10 21:02:52 +08:00
  • 51bff8757c Remove test for atomicInc and atomicDec streamhsa 2016-02-10 21:02:52 +08:00
  • 8359dc3119 Disable scratch Host allocation - via debug registration flags. Yair Shachar 2016-02-07 17:56:40 +02:00