text_generator

Text generation with LSTMs.

This project is based on Karpathy's article: http://karpathy.github.io/2015/05/21/rnn-effectiveness/

The aim of this project is to apply clean code techniques on a deep learning project.

Run the tests

Create a python 3.6 venv:

python3.6 -m venv myvenv
source myvenv/bin/activate
pip install -r requirements.txt
pytest -vv tests

Train a model

To train your model with your own data:

Create a directory in data folder and put all your .txt files inside. (We'll use in the following examples zweig with Zweig's novel inside)
Run the train command with the specified arguments:
- --data-dir-name is just the name of the folder in data (not the path)
- --sequence-length is a parameter to choose wisely
- --epoch-number is the number of iteration
- --batch-size: a big batch-size allows GPU to be used more efficiently

run train --data-dir-name=zweig --sequence-length=20 --epoch-number=1000 --batch-size=300

Once trained, get your models checkpoints in models/zweig/checkpoints (replace zweig with your dir name)
You can also find the character list: models/zweig/character_list_in_training_data.json (replace zweig with your dir name)

Predict a text

To predict a text with your best model:

Pick your best model in the checkpoints folder
Move it in the directory above and rename it in model.hdf5. (in this example: models/zweig/model.hdf5)
Run the predict command with the specified arguments:
- --data-dir-name use the same folder name as before (the train part)
- --text-starter is the beginning of the prediction.
  - The length must be equal to the parameter --sequence-length in the training part.
  - The characters of the starter must be present in the training text.
- --prediction-length is the length of the desired text prediction
- --temperature: A low temperature will give something conservative. With a high temperature the predictions will be more original, but with potentially more mistakes.

run predict --data-dir-name=zweig --text-starter="starter of lenght 20" --prediction-length=1000

The prediction will be prompted and written in models/zweig/prediction.txt (replace zweig with your dir name)

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
app		app
data		data
exploration_sandbox		exploration_sandbox
tests		tests
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

data

data

exploration_sandbox

exploration_sandbox

tests

tests

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

text_generator

Run the tests

Train a model

Predict a text

About

Releases

Packages

Contributors 5

Languages

Saxamos/text-generator

Folders and files

Latest commit

History

Repository files navigation

text_generator

Run the tests

Train a model

Predict a text

About

Resources

Stars

Watchers

Forks

Languages