Opening an issue to track this. Want to get an idea on how easy it is for us to plumb support for codegen to output scalar values.
Currently the cases we are looking at are where the input is just a scalar. So this can already be done with ExpressionEvaluation, the plumbing might be a little bit ugly on the runtime system, since we need to glue the results from the kernel execution; But I expect support on codegen would be equally tedious/ugly?!
I'm quite neutral on how this should be supported.
cc'ing @kevinstephano @naoyam