difftest-as-dut: allow ref model to advance one more step on mismatch #407

shinezyy · 2024-07-18T07:42:57Z

When trapping on exception, occasionally, NEMU has executed
csrrw tp,mscratch,tp, but Spike has not.
After advancing Spike for one more step,
Spike's architectural state will match with NEMU again.

shinezyy · 2024-07-18T08:08:04Z

Current implementation will corrupt the site of CPU_state during second checking

shinezyy · 2024-07-18T10:24:43Z

Current implementation will corrupt the site of CPU_state during second checking

resolved

cebarobot

LGTM

shinezyy · 2024-07-22T09:08:18Z

@cebarobot @poemonsense

Can we do regression on difftest for bugs detection? Because current CI only ensure `` no difference ''.

If I mistakenly modify NEMU to never report difference in difftest, it will also pass current CI.

cebarobot · 2024-07-22T09:10:51Z

I think that is useful, but how to do that?

poemonsense · 2024-07-22T09:11:42Z

I know few about NEMU as DUT.

Generally, we need a failure testcase to test the functionality?

shinezyy · 2024-07-22T09:15:28Z

I know few about NEMU as DUT.

Generally, we need a failure testcase to test the functionality?

Yes, for example, use a buggy Spike as ref, and check output.

I am thinking about this because I am afraid that this patch (or similar patches) silently breaks NEMU as DUT (never report difference) but passes CI

poemonsense · 2024-07-22T09:21:13Z

I know few about NEMU as DUT.
Generally, we need a failure testcase to test the functionality?

Yes, for example, use a buggy Spike as ref, and check output.

I am thinking about this because I am afraid that this patch (or similar patches) silently breaks NEMU as DUT (never report difference) but passes CI

This feature looks important. The difftest repo also lacks this CI feature.

checking the failure can be done by checking the return code of a command. GitHub CI is internally a bash script. I think it can be done by like https://stackoverflow.com/questions/26675681/how-to-check-the-exit-status-using-an-if-statement I never tried this but it probably works.

false || exit_code=$?
if [[ ${exit_code} -ne 0 ]]; then echo ${exit_code}; fi

To inject a bug into Spike/NEMU, maybe we can add a probablistic return code of nonzero for the isa_difftest_checkregs function?

poemonsense · 2024-07-22T09:23:03Z

I know few about NEMU as DUT.
Generally, we need a failure testcase to test the functionality?

Yes, for example, use a buggy Spike as ref, and check output.
I am thinking about this because I am afraid that this patch (or similar patches) silently breaks NEMU as DUT (never report difference) but passes CI

This feature looks important. The difftest repo also lacks this CI feature.

checking the failure can be done by checking the return code of a command. GitHub CI is internally a bash script. I think it can be done by like https://stackoverflow.com/questions/26675681/how-to-check-the-exit-status-using-an-if-statement I never tried this but it probably works.
false || exit_code=$?
if [[ ${exit_code} -ne 0 ]]; then echo ${exit_code}; fi
To inject a bug into Spike/NEMU, maybe we can add a probablistic return code of nonzero for the isa_difftest_checkregs function?

I can have a try for these in the difftest repo as well. We may need to add the similar feature to other repos. It's really important

- When trapping on exception, occasionally, NEMU has executed `csrrw tp,mscratch,tp`, but Spike has not. After advancing Spike for one more step, Spike's architectural state will match with NEMU again.

shinezyy · 2024-07-24T03:20:22Z

Because my CI guard for NEMU as DUT depends on misa.rvb, I will add it in RVB patch

NewPaulWalker · 2024-07-30T08:32:24Z

When trapping on exception, occasionally, NEMU has executed
csrrw tp,mscratch,tp, but Spike has not.
After advancing Spike for one more step,
Spike's architectural state will match with NEMU again.

This is actually because when certain exceptions occur, NEMU does not let Spike continue executing the instruction that caused the exception. As a result, Spike falls one instruction behind NEMU, leading to a mismatch. After letting Spike execute one more instruction, they match again. I think all exceptions (excluding interrupts) are caused by the instructions themselves, so I decided that whenever an exception occurs, NEMU should let Spike execute one instruction, which is the instruction that cause the exception.
https://github.com/OpenXiangShan/NEMU/blob/master/src/isa/riscv64/system/intr.c#L71

poemonsense · 2024-07-30T08:35:34Z

I think all exceptions (excluding interrupts) are caused by the instructions themselves, so I decided that whenever an exception occurs, NEMU should let Spike execute one instruction, which is the instruction that cause the exception.

This is how difftest should work. Upon exception/interrupt, should tick one step.

shinezyy assigned cebarobot Jul 18, 2024

shinezyy force-pushed the one-more-step-on-mismatch branch 3 times, most recently from 197089e to 64376bb Compare July 18, 2024 07:49

shinezyy force-pushed the one-more-step-on-mismatch branch from 64376bb to 47c2f05 Compare July 18, 2024 08:24

cebarobot approved these changes Jul 22, 2024

View reviewed changes

cebarobot mentioned this pull request Jul 22, 2024

rv-b: mark B-ext in misa #397

Merged

shinezyy force-pushed the one-more-step-on-mismatch branch 3 times, most recently from ae591cc to 9fbc0df Compare July 23, 2024 09:48

difftest-as-dut: allow ref model to advance one more step on mismatch

77291e5

- When trapping on exception, occasionally, NEMU has executed `csrrw tp,mscratch,tp`, but Spike has not. After advancing Spike for one more step, Spike's architectural state will match with NEMU again.

shinezyy force-pushed the one-more-step-on-mismatch branch from 9fbc0df to 77291e5 Compare July 24, 2024 03:18

shinezyy merged commit 911f2b3 into master Jul 24, 2024
5 checks passed

shinezyy deleted the one-more-step-on-mismatch branch July 24, 2024 03:27

shinezyy mentioned this pull request Jul 24, 2024

On exception handle, NEMU (as DUT) mismatch with Spike #394

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

difftest-as-dut: allow ref model to advance one more step on mismatch #407

difftest-as-dut: allow ref model to advance one more step on mismatch #407

shinezyy commented Jul 18, 2024

shinezyy commented Jul 18, 2024

shinezyy commented Jul 18, 2024

cebarobot left a comment

shinezyy commented Jul 22, 2024 •

edited

Loading

cebarobot commented Jul 22, 2024

poemonsense commented Jul 22, 2024

shinezyy commented Jul 22, 2024

poemonsense commented Jul 22, 2024

poemonsense commented Jul 22, 2024

shinezyy commented Jul 24, 2024

NewPaulWalker commented Jul 30, 2024

poemonsense commented Jul 30, 2024

difftest-as-dut: allow ref model to advance one more step on mismatch #407

difftest-as-dut: allow ref model to advance one more step on mismatch #407

Conversation

shinezyy commented Jul 18, 2024

shinezyy commented Jul 18, 2024

shinezyy commented Jul 18, 2024

cebarobot left a comment

Choose a reason for hiding this comment

shinezyy commented Jul 22, 2024 • edited Loading

cebarobot commented Jul 22, 2024

poemonsense commented Jul 22, 2024

shinezyy commented Jul 22, 2024

poemonsense commented Jul 22, 2024

poemonsense commented Jul 22, 2024

shinezyy commented Jul 24, 2024

NewPaulWalker commented Jul 30, 2024

poemonsense commented Jul 30, 2024

shinezyy commented Jul 22, 2024 •

edited

Loading