Generative_NLP_RL_GAN

Trying to train a NLP generative model in a reinforcement learning setting.

I currently am trying to train a https://arxiv.org/abs/1710.02298 Rainbow DQN (only Noisy network, C51 and prioritized experience replay, paper seemed to show it gives the biggest gains by far) to generate from a truncated Google Billion Word dataset.

The general idea is to beat several different environnement, that get progressively harder with my DQN. In this case, the environnement is a discriminator that is trained to differentiate between my DQN's output and the dataset untill it's loss reaches a treshold (0.1 in this case), the reward is the output of the discriminator and we consider the environnement beat when the loss of the discriminator would reach another threshold (0.9 in this case).

The model adds 1 word at the time to a word vector of fixed size and then the discriminator evaluates the full vector.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Agent		Agent
Environnement		Environnement
PriorityExperienceReplay		PriorityExperienceReplay
.gitignore		.gitignore
LICENSE		LICENSE
LSTM_Model.py		LSTM_Model.py
NoisyDense.py		NoisyDense.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent

Agent

Environnement

Environnement

PriorityExperienceReplay

PriorityExperienceReplay

.gitignore

.gitignore

LICENSE

LICENSE

LSTM_Model.py

LSTM_Model.py

NoisyDense.py

NoisyDense.py

README.md

README.md

Repository files navigation

Generative_NLP_RL_GAN

About

Releases

Packages

Languages

License

LuEE-C/Generative_NLP_RL_GAN

Folders and files

Latest commit

History

Repository files navigation

Generative_NLP_RL_GAN

About

Resources

License

Stars

Watchers

Forks

Languages