Bug fixes by mutalibmohammed · Pull Request #192 · UoB-HPC/SimEng

mutalibmohammed · 2021-09-16T11:44:55Z

When SimEng flushes an instruction that sets multiple destinations as same register, the rewinding of register renaming fails. This is because the order of applying rewinding by historyTable_ is in the wrong way.

Ultimately, it keeps the wrong physical register (which was freed) in mappingTable_. Therefore, the order of calling rewind() was reversed to have the correct order of updating mappingTable_ with historyTable_.

In FetchUnit::tick, If the pc_ was not aligned to the blockSize boundary and the fetchBuffer_ was empty, the fetchData would not be copied but used directly as an optimization. However, if the fetchData was not enough to start decoding right away, the function would exit and the fetchData would be lost. To fix the bug, the optimization was removed and fetchData is always copied onto the fetchBuffer_. There was no observed performance difference.

Additionally, the pointer to the buffer passed to predecode is not guaranteed to be aligned. This caused misalignment bugs as aarch64::predecode was expecting it to be 4 byte aligned. A workaround proposed by @FinnWilkinson fixed the bug by copying the buffer into a local variable.

The ROR implementation was found to be buggy, hence a modified version using modular arithmetic was implemented.

jj16791 · 2021-09-16T13:15:14Z

#rerun tests

FinnWilkinson

All looks good to me

jj16791 · 2021-09-25T23:29:03Z

#rerun tests

seunghun1ee

Looks good to me

seunghun1ee

I found a bug that can be critical, so I commit the fix straight to this branch.
Please have a look

FinnWilkinson

All looks good

Reversing the order of rewinding was done to rewind destination registers in correct history order. This change prevents register alias table leaving wrong mapping on rewinding. Ultimately, this fixes the issue where some operands get their values from incorrect register because of the wrong mapping.

If the pc_ was not aligned to blockSize boundary and the fetchBuffer_ was empty, the fetchData would not be copied but used directly as an optimization. However, if the fetchData was not enough to start decoding, the function would exit and the fetchData would be loss. To fix the bug, the optimization was removed and fetchData is always copied onto the fetchBuffer_. The optimization did not provide any performance improvement on the M1 Mac Mini.

The new ror implementation only works for type widths that are a power of 2. Instead of using arithmetic substraction, we are computing the modular inverse of amount (mod type_width). Using modular inverse instead of subtraction will not cause undefined behaviour when amount is 0. That is the only difference.

On decode, operand 0 of RET was set to LR. This was problematic as it always used LR even if an operand was given. To stop this, `InstructionMetadata.cc` now sets operand 0 of RET as LR only when `operandCount` is zero.

mutalibmohammed requested review from FinnWilkinson, jj16791 and seunghun1ee September 16, 2021 12:26

FinnWilkinson approved these changes Sep 17, 2021

View reviewed changes

seunghun1ee approved these changes Sep 27, 2021

View reviewed changes

seunghun1ee requested a review from FinnWilkinson September 30, 2021 04:32

seunghun1ee reviewed Sep 30, 2021

View reviewed changes

seunghun1ee mentioned this pull request Sep 30, 2021

Minifmm aarch64 coverage #194

Closed

FinnWilkinson reviewed Sep 30, 2021

View reviewed changes

Comment thread src/lib/arch/aarch64/InstructionMetadata.cc

seunghun1ee added the bug Something isn't working label Oct 1, 2021

seunghun1ee requested a review from FinnWilkinson October 1, 2021 04:11

FinnWilkinson approved these changes Oct 1, 2021

View reviewed changes

FinnWilkinson mentioned this pull request Oct 1, 2021

CloverLeaf and TeaLeaf Benchmark Support #195

Merged

jj16791 approved these changes Oct 1, 2021

View reviewed changes

seunghun1ee and others added 6 commits October 1, 2021 15:17

Fixed load of misaligned address sanitizer error

e8138f9

Fix RET instruction not to override operand with LR

5955b03

On decode, operand 0 of RET was set to LR. This was problematic as it always used LR even if an operand was given. To stop this, `InstructionMetadata.cc` now sets operand 0 of RET as LR only when `operandCount` is zero.

Add tests for ret instruction

120dfd4

jj16791 force-pushed the Bug-fixes branch from b0f904c to 120dfd4 Compare October 1, 2021 14:22

jj16791 merged commit 9e0b874 into main Oct 1, 2021

jj16791 mentioned this pull request Oct 5, 2021

Minifmm support #196

Closed

seunghun1ee mentioned this pull request Oct 5, 2021

Minifmm support #197

Merged

jj16791 deleted the Bug-fixes branch December 16, 2021 10:21

jj16791 restored the Bug-fixes branch December 16, 2021 10:21

jj16791 deleted the Bug-fixes branch December 16, 2021 10:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fixes#192

Bug fixes#192
jj16791 merged 6 commits intomainfrom
Bug-fixes

mutalibmohammed commented Sep 16, 2021

Uh oh!

jj16791 commented Sep 16, 2021

Uh oh!

FinnWilkinson left a comment

Uh oh!

jj16791 commented Sep 25, 2021

Uh oh!

seunghun1ee left a comment

Uh oh!

seunghun1ee left a comment

Uh oh!

Uh oh!

FinnWilkinson left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mutalibmohammed commented Sep 16, 2021

Uh oh!

jj16791 commented Sep 16, 2021

Uh oh!

FinnWilkinson left a comment

Choose a reason for hiding this comment

Uh oh!

jj16791 commented Sep 25, 2021

Uh oh!

seunghun1ee left a comment

Choose a reason for hiding this comment

Uh oh!

seunghun1ee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

FinnWilkinson left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants