Parametric/exact Rmin: wrong result #15

kleinj · 2017-05-15T17:10:33Z

At https://groups.google.com/d/topic/prismmodelchecker/vQI-1srXSdk/discussion, Shalini Ghosh reported a small MDP with a strange result for an Rmin formula in the parametric engine. The model is attached.

The problematic formula is Rmin=?[ F "changed" ].

prism lanechange_mdp.prism -pf 'Rmin=?[F "changed" ]' --param p=0:1
Result (minimum expected reward): ([0.0,1.0]): { 1  | 2  }

prism lanechange_mdp.prism -pf 'Rmin=?[F "changed" ]' --exact --const p=0.626
Result (minimum expected reward): 1/2

prism lanechange_mdp_simple_parametric.prism -pf 'Rmin=?[F "changed" ]' --const p=0.626
Result: 1.002137421843802 (value in the initial state)

Manually restricting the MDP to the minimal strategy DTMC yields the expected results:

prism lanechange_dtmc.prism -pf 'R=?[F "changed" ]' --exact --param p=0:1
Result (expected reward): ([0.0,1.0]): { 2 p - 5  | 10 p - 10  }

prism lanechange_dtmc.prism -pf 'R=?[F "changed" ]' --exact --const p=0.626
Result (expected reward): 937/935

prism lanechange_dtmc.prism -pf 'R=?[F "changed" ]' --const p=0.626
Result: 1.0021390374331551 (value in the initial state)

Some preliminary debugging suggests that the precomputation for Rmin in the parametric/exact engine is the most probable culprit.

lanechange_mdp.prism.txt
lanechange_dtmc.prism.txt

The text was updated successfully, but these errors were encountered:

…olicy iteration To ensure that the policy iteration (performed on an MDP where parameters have been replaced by a parameter valuation) converges and converges on the right result, we initialise the policy iteration for Rmin[F] with a proper scheduler, i.e., that reaches the goal with probability one (see [BertsekasTsitsiklis91]). Together with the previous commit, this fixes issue prismmodelchecker/prism#15. As there is no check against negative rewards, those remain problematic.

…olicy iteration To ensure that the policy iteration (performed on an MDP where parameters have been replaced by a parameter valuation) converges and converges on the right result in the presence of zero-reward end components, we initialise the policy iteration for Rmin[F] with a proper scheduler, i.e., that reaches the goal with probability one (see [BertsekasTsitsiklis91]). Together with the previous commit, this fixes prismmodelchecker#4 and prismmodelchecker#15.

…olicy iteration To ensure that the policy iteration (performed on an MDP where parameters have been replaced by a parameter valuation) converges and converges on the right result in the presence of zero-reward end components, we initialise the policy iteration for Rmin[F] with a proper scheduler, i.e., that reaches the goal with probability one (see [BertsekasTsitsiklis91]). Together with the previous commit, this fixes #4 and #15.

davexparker · 2017-08-31T10:57:28Z

Fixed by the commit referenced above, now merged.

davexparker added the bug label Jul 25, 2017

kleinj mentioned this issue Aug 28, 2017

Several fixes/improvements for parametric / exact engine. #19

Merged

davexparker closed this as completed Aug 31, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parametric/exact Rmin: wrong result #15

Parametric/exact Rmin: wrong result #15

kleinj commented May 15, 2017 •

edited

davexparker commented Aug 31, 2017

Parametric/exact Rmin: wrong result #15

Parametric/exact Rmin: wrong result #15

Comments

kleinj commented May 15, 2017 • edited

davexparker commented Aug 31, 2017

kleinj commented May 15, 2017 •

edited