GitHub

Workspace for collaborating on the ASEN 5519 (Decision Making Under Uncertainty) Project.

Contributors:

The project has three major components:

We have implemented the proposed approach on a 5X5 gridworld. The environment is as follows:

Our final results show how the learned reward compares to the true reward for the MDP

Run the file 'test.py' and it will compute the expert policy, generate trajectories, and get the IRL rewards for the MDP.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vscode		.vscode
Plots_new		Plots_new
plots		plots
plots_final		plots_final
q-learning		q-learning
sourav-vi		sourav-vi
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
ValueIteration.py		ValueIteration.py
dmu_proj_gw.drawio		dmu_proj_gw.drawio
gridworld.png		gridworld.png
gridworld.py		gridworld.py
maxEntIRL.py		maxEntIRL.py
new_R_55_100.png		new_R_55_100.png
new_r_55_250.png		new_r_55_250.png
result.png		result.png
test.py		test.py
trajs.csv		trajs.csv

tuhina2313/DMU_Project