Learning a plan by asking questions

The aim of this research effort is to extend the work by Branavan et al. (2012). Our work will introduce the actions of asking questions as part of the agent planning process.

Run

This assumes you have docker installed on your machine.

$ mkdir tmp && cd tmp
$ wget http://groups.csail.mit.edu/rbg/code/planning/data/env.cache.bz2
$ bzip2 -d env.cache.bz2
$ cd ../
$ sh ./start.sh

How to develop

$ sh ./start.sh # run the entire system
$ sh ./start_test.sh # run 2 iterations
$ sh ./start_gdb.sh # start with gdb

Changelog / Updates

Week 4

Dividing the tasks
Running Branavan's code (and understanding why it did not stop working after running 1 h)
Built a suite for testing

Week 3

(issue) There is no space left on the physical machine given by Karthik
(success) Discover how to augment the policy with information goals
- (code) We model our questions as PDDL predicates, and load it to the same "possible next subgoal" vector as regular subgoals.
(theory) Investigating the policy for predicting the next subgoal
(code) After deciding that a subgoal is a question, we need to execute the question, update C before continuing to sample
- (code/theory) For a trivial retrieval system, we can load all the answers in memory
(success) 42 million new actions are now down to 517 thanks to Nicola's hardcoding of questions and Adam's hate for thresholds.
(success) We found three type of questions:
1. Objects (T)
2. Subgoal (P*T)
3. Comparing two subgoals in the sampled sequence
4. Actions (A)

T=50
A=72
P=7

Week 2

Finalizing where to add questions and planning how to do so
Finally getting Branavan's code running
Setting up the machine given for computation by Regina/Karthik
Write Dockerfile that would compile the code & prepare the environment to run the agent
Setting up GDB to simplify C++ debugging

Week 1

Understanding the paper and the problem
Trying to get Branavan's code to work
Researching on where to add questions

Credits

This work is being actively research by Adam Yala, Nicola Greco, and Sebastien Boyer

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
annotations		annotations
bin		bin
cfg		cfg
code		code
data		data
lib		lib
output/t1		output/t1
.dockerignore		.dockerignore
.gdbinit		.gdbinit
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
gdb-learner		gdb-learner
notes.txt		notes.txt
readme		readme
run_cache.sh		run_cache.sh
run_client.sh		run_client.sh
run_everything.sh		run_everything.sh
run_gdb.sh		run_gdb.sh
run_ir.sh		run_ir.sh
run_learner.sh		run_learner.sh
run_pdb.sh		run_pdb.sh
run_test.sh		run_test.sh
start.sh		start.sh
start_gdb.sh		start_gdb.sh
start_test.sh		start_test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning a plan by asking questions

Run

How to develop

Changelog / Updates

Week 4

Week 3

Week 2

Week 1

Credits

About

Releases

Packages

Contributors 3

Languages

master-patch/learning_from_questions

Folders and files

Latest commit

History

Repository files navigation

Learning a plan by asking questions

Run

How to develop

Changelog / Updates

Week 4

Week 3

Week 2

Week 1

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages