f484ff17b9
* msccl: add templated kernel * Use defines to improve code readability * Fix kernel indexing and review feedback