Adds Optional AVX2 Support, Cache Alignment, and Enhances Model Export Speed #94

…ment in Config This commit includes: 1. Optional AVX2 support for matmul and rmsnorm functions. 2. Fused matrix multiplies with new matmul2 and matmul3 functions. 3. Cache aligned allocations for better performance and compatibility with SIMD/Vector intrinsics. 4. Updated Config struct to support cache alignment. NOTE: Previous models should be re-exported due to this change. 5. Enhanced performance of serialization in Llama export code. 6. Updated Makefile to support AVX2 and OMP AVX2 builds.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds Optional AVX2 Support, Cache Alignment, and Enhances Model Export Speed #94

Adds Optional AVX2 Support, Cache Alignment, and Enhances Model Export Speed #94

Commits on Jul 26, 2023

Adds Optional AVX2 Support, Cache Alignment, and Enhances Model Export Speed #94

Are you sure you want to change the base?

Adds Optional AVX2 Support, Cache Alignment, and Enhances Model Export Speed #94

Commits on Jul 26, 2023