cf6a53e81c
Changed rocshmem_get_device_ctx() to properly copy the full rocshmem_ctx_t structure and return only the ctx_opaque pointer instead of trying to copy directly to a void pointer. Prior implementation would cause undefined behavior or memory corruption as it was copying 16 bytes of data to 8 bytes. It worked so far beucase ctx_opaque field is at proper offsest, but incorrectly memcpy would overwrite some other allocations and cause issues. This fixes the context memory handling when passing device context from host to device kernels.