What can we do to make this fast? At the moment, calling j.sin is 10x slower than math.sin or numpy.sin.
I'm guessing that this is due to type inference issues with the PyCall.jl callback. Is there a way we could provide annotations or hints for individual functions?