Skip to content

A question about {} -> {} syntax in Learning queries of UPPAAL #227

Answered by petergjoel
Szpilman2 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi Peyman,

The variables i, j, d, f relates to the observable part of the state-space (to the learning agent).

As an example, consider the bouncing ball in Figure 4 in Teaching Stratego to Play Ball.

We may want to derive a controller that is ignorant to the velocity of the ball (as this might be hard to estimate in the application), we can describe this delimitation as follows:

minE(cost) [<=10] {} -> {h}

where h is the height of the ball.

This is opposed to

minE(cost) [<=10] {} -> {h, v}

which observes the full state of the ball.

Theoretically this induces an Partially Observable EMDP for which Q-learning is not garuanteed to converge. However, it does (at least to a usefull degree) f…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@mikucionisaau
Comment options

@Szpilman2
Comment options

Answer selected by Szpilman2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants