CUDA needs to default to sm_80 and use devicearrays #50

powderluv · 2022-05-19T23:15:50Z

We need model the CUDA backend in SHARK to be similar to:

https://github.com/nod-ai/transformer-benchmarks/blob/435984a420a2f285f717aa4752c14c0cabfd8c96/benchmark.py#L397-L437


    if use_gpu:
        backend = "cuda"
        backend_config = "cuda"
        args = ["--iree-cuda-llvm-target-arch=sm_80", "--iree-hal-cuda-disable-loop-nounroll-wa"]
        ireert.flags.FUNCTION_INPUT_VALIDATION = False
        ireert.flags.parse_flags("--cuda_allow_inline_execution")

...

    # Setting up input on host and moving to device.
    host_inputs =[encoded_input["input_ids"], encoded_input["attention_mask"], encoded_input["token_type_ids"]]
    if use_gpu:
        device_inputs = [ireert.asdevicearray(config.device, a) for a in host_inputs]
    else:
        device_inputs = host_inputs

The text was updated successfully, but these errors were encountered:

pashu123 · 2022-05-23T13:47:12Z

Added here: #52

powderluv assigned pashu123 May 19, 2022

powderluv closed this as completed Jun 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA needs to default to sm_80 and use devicearrays #50

CUDA needs to default to sm_80 and use devicearrays #50

powderluv commented May 19, 2022

pashu123 commented May 23, 2022 •

edited

CUDA needs to default to sm_80 and use devicearrays #50

CUDA needs to default to sm_80 and use devicearrays #50

Comments

powderluv commented May 19, 2022

pashu123 commented May 23, 2022 • edited

pashu123 commented May 23, 2022 •

edited