Use only RLInterface.jl interface #29

zsunberg · 2020-03-05T22:00:34Z

One of my students discovered that this package uses POMDPs.actionindex. Would it be possible to make the package only use functions from the RLInterface.jl interface? (i.e. we would need to construct our own action map)

The text was updated successfully, but these errors were encountered:

MaximeBouton · 2020-03-06T01:01:30Z

It is using actionindex to take slices of the batches of q values. It is listed in the requirements I believe.
As soon as the action is taken, it is converted as an integer and stored that way in the buffer.

Where would the custom action map definition be living? Are you suggesting to extend RLInterface or to use the existing actions function?
The solver would need to have access to this ordering somehow.

zsunberg · 2020-03-06T05:15:10Z

I think we would just use the existing actions function. The action map would be internal to the solver. I'll make a PR.

zsunberg · 2020-03-06T05:15:41Z

PR: #30

MaximeBouton closed this as completed in 363deaf Mar 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use only RLInterface.jl interface #29

Use only RLInterface.jl interface #29

zsunberg commented Mar 5, 2020

MaximeBouton commented Mar 6, 2020 •

edited

Loading

zsunberg commented Mar 6, 2020

zsunberg commented Mar 6, 2020

Use only RLInterface.jl interface #29

Use only RLInterface.jl interface #29

Comments

zsunberg commented Mar 5, 2020

MaximeBouton commented Mar 6, 2020 • edited Loading

zsunberg commented Mar 6, 2020

zsunberg commented Mar 6, 2020

MaximeBouton commented Mar 6, 2020 •

edited

Loading