Stochastic Temporal Difference Learning

In this respository, we implmented our proposed Stochastic Temporal Difference (STD) Learning for sequential data.

Stochastic Temporal Difference Learning is a general method to learn temporal abstraction that predict future latent states directly without going through all intermediate states. This learning style mimics human to think and plan with multiple time steps rather than act step by step. We introduce multiple jumpy states in our model, and use VAEs inspired lower bound to learn the latent representation. The latent states not only have the information about how to reconstruct to observation, but also contain the information about how to transit to future states.

Data Description

Moving MNIST
CarRacing-v0 (OpenAI Gym)
Penn TreeBank

Experimental Results

Moving MNIST
- Rollout result
- Latent representation
CarRacing-v0
- Rollout result
- Latent representation

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
MNIST		MNIST
PTB		PTB
figure		figure
openai_CarRacing		openai_CarRacing
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stochastic Temporal Difference Learning

Data Description

Experimental Results

Code References

About

Releases

Packages

Languages

Raychiu123/Stochastic-Temporal-Difference-Learning

Folders and files

Latest commit

History

Repository files navigation

Stochastic Temporal Difference Learning

Data Description

Experimental Results

Code References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages