MDP Playground

I'm making this repo as an environment for experiementing with MDP's and algorithms used to solve and construct them. This is both to improve my knowledge of rust and MDPs for COMP4620 @ ANU.

Examples

Use cargo run --example <example> to run an example from the ./examples/ directory.

So far I've implemented the following examples:

cookie_monster Question 2 from first lab.
(run with cargo run --example cookie_monster)

Features

Listing features here to not forget what i've built.

`model.rs`

A simple way to build an Mdp. The MdpBuilder uses a state / action based way to build the Mdp.
Essentially you can pass an initial "world_state" to the builder. This state may have many variables (eg, num_visits and is_banned). The values of these variables must be integers.
Then you can add actions using the ActionBuilder. Each action has a

string name.
list of preconditions. Each precondition must resolve to true for the action to be avaliable.
The preconditions can use variables from the world state.
list of outcomes. Each outcome is essentially a function that has mutable access to the world state. The outcome can mutate the world state to induce a state transition. The function must return the probability of this particular outcome occurring and the associated reward with this particular outcome.

`mdp.rs`

This is a struct to store the information about a MDP. Namely: State, Transitions and Actions.

You can construct a Mdp using the MdpBuilder::build() function. This will start at the initial state provided.
It will then apply allowed actions (by their preconditions) and build new "states" that encode the changes made by the action's outcomes. This continues until no more states can be reached by applying actions to states.

Finally, it organizes the avaliable actions and transitions from actions in a hashmap for each state.

`solver.rs`

This is a very simple value iteration solver for an Mdp. It can also generate a policy once solved.

plz note that code is hastily written to get something working quick :)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
examples		examples
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDP Playground

Examples

Features

`model.rs`

`mdp.rs`

`solver.rs`

About

Releases

Packages

Languages

jjantschulev/mdp-rs

Folders and files

Latest commit

History

Repository files navigation

MDP Playground

Examples

Features

model.rs

mdp.rs

solver.rs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`model.rs`

`mdp.rs`

`solver.rs`

Packages