Don't use SSE4.1 instructions unless the target supports it …
Fixes #48. Enabling the SIMD code path now requires building with `RUSTFLAGS="-C target-feature=+sse4.1"` or similar.