Unclear how many of these we will actually need to do. - [ ] Put hook output matricies on cpu - [ ] Optionally put models on cpu when possible (moving section to gpu when needed for gradients) - [ ] Use float32 when possible