Skip to content

Conversation

@YdrMaster
Copy link
Contributor

No description provided.

#ifndef __INFINIOP_REDUCE_CUDA_H__
#define __INFINIOP_REDUCE_CUDA_H__

#include <cub/block/block_reduce.cuh>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

添加comment:
Important Note: This is a device-independent header file containing reduce kernels for all cuda-supporting platforms. Include device-specific headers (such as <cub/block/block_reduce.cuh> for nvidia) in your source file and then include this file for proper usage.

@YdrMaster YdrMaster force-pushed the distinct-cuda branch 7 times, most recently from 87fb529 to 7771e57 Compare July 10, 2025 03:07
Signed-off-by: YdrMaster <ydrml@hotmail.com>
Signed-off-by: YdrMaster <ydrml@hotmail.com>
Signed-off-by: YdrMaster <ydrml@hotmail.com>
Signed-off-by: YdrMaster <ydrml@hotmail.com>
Signed-off-by: YdrMaster <ydrml@hotmail.com>
@YdrMaster YdrMaster force-pushed the distinct-cuda branch 6 times, most recently from fde0c8f to f06b504 Compare July 10, 2025 10:57
Signed-off-by: PanZezhong <panzezhong@qiyuanlab.com>
@YdrMaster YdrMaster force-pushed the distinct-cuda branch 2 times, most recently from d6bdf81 to 5b9de98 Compare July 11, 2025 06:04
Signed-off-by: PanZezhong <panzezhong@qiyuanlab.com>
Signed-off-by: YdrMaster <ydrml@hotmail.com>
@PanZezhong1725 PanZezhong1725 added 准备好了 and removed 需要修改 被阻塞 需等待其他修改合并 labels Jul 11, 2025
@PanZezhong1725 PanZezhong1725 merged commit e4605f7 into InfiniTensor:main Jul 11, 2025
4 checks passed
@YdrMaster YdrMaster deleted the distinct-cuda branch July 11, 2025 06:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DEV] 合并英伟达和沐曦的 cuda 代码

2 participants