Can I get the new data you used? #1

peternara · 2018-06-05T05:03:20Z

Can I get the new data(included training format) you used?
thanks.:)

symoon11 · 2018-06-06T03:24:35Z

I uploaded training dataset(4000 images) and test dataset(1000 images).
Each dataset consists of 3 gz files(image, label-digit, label-color) and has the same format as the ordinary MNIST dataset.

When you use the dataset

Unzip the datasets.
Each file is a byte code. So you have to change it to integer. Here is an example of decoding the datasets.


import numpy as np
from struct import *

images = open('test-images-ubyte', 'rb')
digits = open('test-label-digit-ubyte', 'rb')
colors = open('test-label-color-ubyte', 'rb')

for i in range(4000):
    image_byte = images.read(28 * 28 * 3)
    digit_byte = digits.read(1)
    color_byte = colors.read(3)

    image = np.reshape(unpack(len(image_byte) * 'B', image_byte), [28, 28, 3])
    digit = unpack(len(digit_byte) * 'B', digit_byte)
    color = unpack(len(color_byte) * 'B', color_byte)

Then, you can get an image and the corresponding digit and color,
Note that digit is not a one-hot vector, but a scalar value.

If there is any problem with the datasets, feel free to contact me.
Thanks.

peternara · 2018-06-08T01:16:36Z

@drillermoon I try it!! thanks :) I appreciate the code you have also released.

peternara closed this as completed Jun 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I get the new data you used? #1

Can I get the new data you used? #1

peternara commented Jun 5, 2018

symoon11 commented Jun 6, 2018

peternara commented Jun 8, 2018 •

edited

Loading

Can I get the new data you used? #1

Can I get the new data you used? #1

Comments

peternara commented Jun 5, 2018

symoon11 commented Jun 6, 2018

peternara commented Jun 8, 2018 • edited Loading

peternara commented Jun 8, 2018 •

edited

Loading