We want to implement a CUDA backend for the following functions * `std::fill` see https://en.cppreference.com/w/cpp/algorithm/fill.html * `std::fill_n` see https://en.cppreference.com/w/cpp/algorithm/fill_n.html