Skip to content

Commit a928ded

Browse files
ElizaWszolamgoin
andauthored
[Kernel] Split Marlin MoE kernels into multiple files (#8661)
Co-authored-by: mgoin <[email protected]>
1 parent cc4325b commit a928ded

File tree

7 files changed

+1552
-1427
lines changed

7 files changed

+1552
-1427
lines changed

CMakeLists.txt

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -316,6 +316,11 @@ set(VLLM_MOE_EXT_SRC
316316

317317
if(VLLM_GPU_LANG STREQUAL "CUDA")
318318
list(APPEND VLLM_MOE_EXT_SRC
319+
"csrc/moe/marlin_kernels/marlin_moe_kernel.h"
320+
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku4b8.h"
321+
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku4b8.cu"
322+
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku8b128.h"
323+
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku8b128.cu"
319324
"csrc/moe/marlin_moe_ops.cu")
320325
endif()
321326

0 commit comments

Comments
 (0)