Multi-Threading-mnist-classifier

The project aims at implementing a simple mnist classifer with multi-thread FIFOQueue

what's the advantage of using FIFOQueue?

For normal supervised learning coding style, the mainly pipeline may looks like:

for iter in range(max_iters):
  inputs, labels = data_loader.load_next_batch()
  feed_dict = {inputs_tf:inputs, labels_tf:labels}
  _, summary_str = sess.run([train_op, summary_op], feed_dict)
  writer.add_summary(summary_str, iter)
  if iter%eval_per_iters == 0:
    eval(......)

This is fairly straightforward and easy to implement. However, its main drawback is: when the model is loading data to memory, the gpu is hanging there, which slower the training process 😓
tf.FIFOQueue provides us another way to fully utilize the computation resources. In the previous pipeline, the model will sequentially load(CPU) the data to memory and then do the update(GPU). What if we can do the update and simultaneously prepare the next batch?

how this works?

construct a binary file to load your save the data into the tensorflow format

use tf.python_io.TFRecordWriter(recommended), which containing tf.train.Example protocol buffers (which contain Features as a field).
this step construct a binary file in the specified path
example code in construct_binary.py

read binary file as tensor

use tf.TFRecordReader(recommended) and decode the binary file as the way you encode the binary file(step 1)
this step will return your encode data(eg. inputs, labels). note that the return tensor represent a single tensor
if you call sess.run([inputs, labels]), the command will always return the next pair.
example code in reader.py

create batch

use tf.train.shuffle_batch to create batch from the return single paired data

initialize the dataflow graph

with command tf.train.start_queue_runners(sess=sess)

dependencies

python2
tensorflow (>0.12)
cuda (>8.0)
other requirements

pip install --user -r requirements.txt

usage

Available options include:

--lr            (default 3e-4, initial learning rate)
-- batch_size   (default 128, batch_size)

To run the model:

python main.py [args]

Note: the result show on the tensorboard/terminal is training loss/accuracy

reference

TensorFlow Data Input (Part 1): Placeholders, Protobufs & Queues

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
construct_binary.py		construct_binary.py
main.py		main.py
nn.py		nn.py
reader.py		reader.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

construct_binary.py

construct_binary.py

main.py

main.py

nn.py

nn.py

reader.py

reader.py

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

Multi-Threading-mnist-classifier

what's the advantage of using FIFOQueue?

how this works?

dependencies

usage

reference

About

Releases

Packages

Languages

andrewliao11/Tensorflow-Multi-Threading-Classifier

Folders and files

Latest commit

History

Repository files navigation

Multi-Threading-mnist-classifier

what's the advantage of using FIFOQueue?

how this works?

dependencies

usage

reference

About

Topics

Resources

Stars

Watchers

Forks

Languages