Skip to content

Commit

Permalink
Turn off FlashInfer for mlc runtime
Browse files Browse the repository at this point in the history
  • Loading branch information
MasterJH5574 committed Jun 7, 2024
1 parent 5f163aa commit 7934130
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion scripts/build_mlc_llm_wheel_manylinux.sh
Original file line number Diff line number Diff line change
Expand Up @@ -110,7 +110,7 @@ elif [[ ${GPU} == cuda* ]]; then
echo set\(USE_CUBLAS ON\) >>config.cmake
echo set\(USE_THRUST ON\) >>config.cmake
echo set\(USE_NCCL ON\) >>config.cmake
echo set\(USE_FLASHINFER ON\) >>config.cmake
echo set\(USE_FLASHINFER OFF\) >>config.cmake
echo set\(FLASHINFER_ENABLE_FP8 OFF\) >>config.cmake
echo set\(FLASHINFER_ENABLE_BF16 OFF\) >>config.cmake
echo set\(FLASHINFER_GEN_GROUP_SIZES 1 4 6 8\) >>config.cmake
Expand Down

0 comments on commit 7934130

Please sign in to comment.