In the file cmd/compile/internal/ssa/gen/ARM64.rules, line 1468 - 1498. The rule seems to combine 8 load-byte to a single LE load dword. However the final REV is wrong, since it is a LE load. I will correct it along with my CL touching that part.